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CHLAMYDIA PNEUMONIAE GENOME SEQUENCE 

CROSS-REFERENCES TO RELATED APPLICATIONS 
The present application is related to 60/128,606, filed April 8, 1999 and 
5 60/108,279, filed November 12, 1998, which are incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 



10 FIELD OF THE INVENTION 

This invention relates to nucleic acids and polypeptides from Chlamydia 
pneumoniae and to their use in the diagnosis, prevention and treatment of diseases 
associated with C pneumoniae. 

15 BACKGROUND OF THE INVENTION 

Chlamydiaceae is a family of obligate intracellular parasite with a tropism 

for epithelial cells lining the mucus membranes. The bacteria have two morphologically 

distinct forms, "elementary body" and "reticulate body". The elementary body is the 

infectious form, and has a rigid cell wall, primarily of cross-linked outer membrane 
20 proteins. The reticulate body is the intracellular, metabolically active form. A unique 

developmental cycle between these two forms characterizes Chlamydia growth. 

G pneumoniae is a human respiratory pathogen that causes acute 

respiratory disease, and approximately 10% of community-acquired pneumonia. 

Antibody prevalence studies have shown that virtually everyone is infected with C. 
25 pneumoniae at some time, and that reinfection is common. In addition to respiratory 

disease, studies have shown an association of this organism with coronary artery disease. 

It has been demonstrated in atherosclerotic lesions of the aorta and coronary arteries by 

immunocytochemistry and by polymerase chain reaction (Kuo et al. (1993) J Infect Pis 

167(4):84 1-849). 

30 Recent reports have further demonstrated the presence of C. pneumoniae 

in the walls of abdominal aortic aneurysms (Juvonen et al (1 997) J Vase Surg 
25(3):499-505). Abdominal aortic aneurysms are frequently associated with 
atherosclerosis, and inflammation may be an important factor in aneurysmal dilatation. 
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C. pneumoniae may play a role in maintaining an inflammation and triggering the 
development of aortic aneurysms. 

Muhlestein et al (1996) JACC 27:1555-61, reported a differential 
incidence of Chlamydia species within the coronary artery wall of patients with 
5 atherosclerosis versus those with other forms of cardiovascular disease. The extremely 
high rate of possible infection inpatients with symptomatic atherosclerotic disease 
compared to the very low rate in patients with normal coronary arteries or coronary artery 
disease from chronic transplant rejection provides evidence for a direct link between the 
atherosclerotic process and Chlamydia infection. Because a history of chlamydial 

10 infection is so prevalent in the population, the issue of causality remains. On a 

physiologic and pathologic level, abnormal interactions among endothelial cells, platelets, 
macrophages and lymphocytes may lead to a cascade of events resulting in acute 
endothelial damage, thrombosis and repair, chronically leading to the development of 
atheroma in blood vessels. 

15 C. pneumoniae is related to other Chlamydia species, but the level of 

sequence similarity is relatively low. Very little is known about the biology of this 
organism, although it appears to be an important human pathogen. Allelic diversity and 
structural relationships between specific genes of Chlamydial species is described in 
Kaltenboeck et al (1993) J Bacteriol 175(2):487-502; Gaydos et al (1992) Infect Immun 

20 60(12):5319-5323; Everett et al (1997) Int JSvst Bacteriol 47(2):46 1-473; and 
Pudjiatmoko et al (1997) Int J Svst Bacteriol 47(2):425-431. 

A number of studies have been published describing methods for detection 
of C. pneumoniae, and for distinguishing between Chlamydial species. Such methods 
include PCR detection (Rasmussen et al (1992) Mol Cell Probes 6(5):389-394; Holland 

25 et al (1990) J Infect Pis 162(4):984-987); a simplified polymerase chain reaction-enzyme 
immunoassay (Wilson et al. (1996) J Appl Bacteriol 80(4):43 1-438); sequence 
determination and restriction endonuclease cleavage (Herrmann et al (1996) J Clin 
MiSEobioi 34(8):1897-1902). 

Antigenic and molecular analyses of different C. pneumoniae strains is 

30 described in Jantos et al (1997) J Clin Microbiol 35(3):620-623. Some genes of C 

pneumoniae have been isolated and sequenced. These include the Gro E operon (Kikuta 
et al. (1991) Infect Immun 59(12):4665-4669); the major outer membrane protein Perez et 
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al (1991) Infect Immun 59(6):2 195-2 199; the DnaK protein homolog (Kornak et al 
(1991) Infect Immun 59(2):721-725); as well as a number of ribosomal and other genes. 

5 

SUMMARY OF THE INVENTION 
This invention provides the genomic sequence of Chlamydia pneumoniae. 
The sequence information is useful for a variety of diagnostic and analytical methods. 
The genomic sequence may be embodied in a variety of media, including computer 
10 readable forms, or as a nucleic acid comprising a selected fragment of the sequence. 

Such fragments generally consist of an open reading frame, transcriptional or translation^ 
control elements, or fragments derived therefrom. Proteins encoded by the open reading 
frames are useful for diagnostic purposes, as well as for their enzymatic or structural 
activity. 

15 

DEFINITIONS 

The term "amino acid" refers to naturally occurring and synthetic amino 
acids, as well as amino acid analogs and amino acid mimetics that function in a manner 
similar to the naturally occurring amino acids. Naturally occurring amino acids are those 

20 encoded by the genetic code, as well as those amino acids that are later modified, e.g., 

hydroxyproline, y-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 
compounds that have the same basic chemical structure as a naturally occurring amino 
acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and 
an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl 

25 sulfonium Such analogs have modified R groups (e.g., norleucine) or modified peptide 
backbones, but retain the same basic chemical structure as a naturally occurring amino 
acid. Amino acid mimetics refers to chemical compounds that have a structure that is 
different from the general chemical structure of an amino acid, but that functions in a 
manner similar to a naturally occurring amino acid. 

30 Amino acids may be referred to herein by either their commonly known 

three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB 
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by 
their commonly accepted single-letter codes. 

3 
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"Amplification" primers are oligonucleotides comprising either natural or 
analogue nucleotides that can serve as the basis for the amplification of a select nucleic 
acid sequence. They include, e.g., polymerase chain reaction primers and ligase chain 
reaction oligonucleotides. 
5 "Antibody" refers to an immunoglobulin molecule able to bind to a 

specific epitope on an antigen. Antibodies can be a polyclonal mixture or monoclonal. 
Antibodies can be intact immunoglobulins derived from natural sources or from 
recombinant sources and can be immunoreactive portions of intact immunoglobulins. 
Antibodies may exist in a variety of forms including, for example, Fv, F ab , and F(ab>2, as 
10 well as in single chains. Single-chain antibodies, in which genes for a heavy chain and a 
light chain are combined into a single coding sequence, may also be used. 

An "antigen" is a molecule that is recognized and bound by an antibody, 
e.g., peptides, carbohydrates, organic molecules, or more complex molecules such as 
glycolipids and glycoproteins. The part of the antigen that is the target of antibody 
15 binding is an antigenic determinant and a small functional group that corresponds to a 
single antigenic determinant is called a hapten. 

"Biological sample" refers to any sample obtained from a living or dead 
organism. Examples of biological samples include biological fluids and tissue specimens. 
Such biological samples can be prepared for analysis of the presence of C. pneumoniae 
20 nucleic acids, proteins, or antibodies specifically reactive with the proteins. 

The term "C. pneumoniae gene" shall be intended to mean the open 
reading frame encoding specific C. pneumoniae polypeptides, as well as adjacent 5' and 
3' non-coding nucleotide sequences involved in the regulation of expression, up to about 
2 kb beyond the coding region, but possibly further in either direction. The gene may be 
25 introduced into an appropriate vector for extrachromosomal maintenance or for 
integration into a host genome. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively 
modified variants refers to those nucleic acids which encode identical or essentially 
30 identical amino acid sequences, or where the nucleic acid does not encode an amino acid 
sequence, to essentially identical sequences. Specifically, degenerate codon substitutions 
may be achieved by generating sequences in which the third position of one or more 
selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 

4 
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(Batzer et al, Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al. % J. Biol. Chem. 260:2605- 
2608 (1985); Rossolini et al. f MoL Cell Probes 8:91-98 (1994)). Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids 
encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all 
5 encode the amino acid alanine. Thus, at every position where an alanine is specified by a 
codon, the codon can be altered to any of the corresponding codons described without 
altering the encoded polypeptide. Such nucleic acid variations are "silent variations," 
which are one species of conservatively modified variations. Every nucleic acid sequence 
herein which encodes a polypeptide also describes every possible silent variation of the 

10 nucleic acid. One of ski ll will recognize that each codon in a nucleic acid (except AUG, 
which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only 
codon for tryptophan) can be modified to yield a functionally identical molecule. 
Accordingly, each silen: variation of a nucleic acid which encodes a polypeptide is 
implicit in each describ id sequence. 

15 As to amino acid sequences, one of skill will recognize that individual 

substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 

20 Conservative substitution tables providing functionally similar amino acids are well 
known in the art. Such conservatively modified variants are in addition to and do not 
exclude polymorphic variants, interspecies homologs, and alleles of the invention. 



The following groups each contain amino acids that are conservative 



substitutions for one another: 



25 



30 



1 ) Alanine (A), Glycine (G); 

2) Serine (S), Threonine (T); 

3) Aspartic acid (D), Glutamic acid (E); 

4) Asparagine (N), Glutamine (Q); 

5) Cysteine (C), Methionine (M); 

6) Arginine (R), Lysine (K), Histidine (H); 

7) Isoleucine (I), Leucine (L), Valine (V); and 

8) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 
see, e.g., Creighton, Proteins (1984)). 



5 
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The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same or have a specified percentage of amino acid residues or nucleotides that 
are the same, when compared and aligned for maximum correspondence over a 

5 comparison window, as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspection. This definition also refers to 
the complement of a test sequence, which has a designated percent sequence or 
subsequence complementarity when the test sequence has a designated or substantial 
identity to a reference sequence. For example, a designated amino acid percent identity 

10 of 95% refers to sequences or subsequences that have at least about 95% amino acid 
identity when aligned for maximum correspondence over a comparison window as 
measured using one of the following sequence comparison algorithms or by manual 
alignment and visual inspection. Such sequences would then be said to have substantial 
identity, or to be substantially identical to each other. Preferably, sequences have at least 

1 5 about 70% identity, more preferably 80% identity, more preferably 90-95% identity and 
above. Preferably, the percent identity exists over a region of the sequence that is at least 
about 25 amino acids in length, more preferably over a region that is 50-100 amino acids 
in length. 

When percentage of sequence identity is used in reference to proteins or 
20 peptides, it is recognized that residue positions that are not identical often differ by 
conservative amino acid substitutions, where amino acids residues are substituted for 
other amino acid residues with similar chemical properties (e.g., charge or 
hydrophobicity) and therefore do not change the functional properties of the molecule. 
Where sequences differ in conservative substitutions, the percent sequence identity may 
25 be adjusted upwards to correct for the conservative nature of the substitution. Means for 
making this adjustment are well known to those of skill in the art. Typically this involves 
scoring a conservative substitution as a partial rather than a full mismatch, thereby 
increasing the percentage sequence identity. Thus, for example, where an identical amino 
acid is given a score of 1 and a non-conservative substitution is given a score of zero, a 
30 conservative substitution is given a score between zero and 1 . The scoring of 

conservative substitutions is calculated according to, e.g., the algorithm of Meyers & 
Miller, Computer Applic. Biol Sci. 4:1 1-17 (1988) e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California, USA).. 

6 
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For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence 
coordinates are designated, if necessary, and sequence algorithm program parameters are 

5 designated. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent sequence 
identity for the test sequence(s) relative to the reference sequence, based on the 
~~ designated or default program parameters. 

A comparison window includes reference to a segment of any one of the 

10 number of contiguous positions selected from the group consisting of from 25 to 600, 
usually about 50 to about 200, more usually about 100 to about 150 in which a sequence 
may be compared to a reference sequence of the same number of contiguous positions 
after the two sequences are optimally aligned. Methods of alignment of sequences for 
comparison are well-known in the art. Optimal alignment of sequences for comparison 

15 can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. 
Appl Math. 2:482 (1981), by the homology alignment algorithm of Needleman & 
Wunsch, J. Mol Biol. 48:443 (1970), by the search for similarity method of Pearson & 
Lipman, Proc. Natl Acad. Sci. USA 85:2444 (1988), by computerized implementations 
of these algorithms (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics 

20 Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
manual alignment and visual inspection (see, e.g., Ausubel et ai, supra). 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, pairwise 
alignments to show relationship and percent sequence identity. It also plots a tree or 

25 dendogram showing the clustering relationships used to create the alignment. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. 
Evol. 35:351-360 (1987). The method used is similar to the method described by Higgins 
& Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences, each 
of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment 

30 procedure begins with the pairwise alignment of the two most similar sequences, 

producing a cluster of two aligned sequences. This cluster is then aligned to the next 
most related sequence or cluster of aligned sequences. Two clusters of sequences are 
aligned by a simple extension of the pairwise alignment of two individual sequences. The 

7 
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final alignment is achieved by a series of progressive, pairwise alignments. The program 
is run by designating specific sequences and their amino acid or nucleotide coordinates 
for regions of sequence comparison and by designating the program parameters. Using 
PILEUP, a reference sequence is compared to other test sequences to determine the 

5 percent sequence identity relationship using the following parameters: default gap weight 
(3.00), default gap length weight (0. 10), and weighted end gaps. PILEUP can be obtained 
from the GCG sequence analysis software package, e.g, version 7.0 (Devereaux et ai t 
Nuc. Acids Res. 12:387-395(1984). 

Another example of algorithm that is suitable for determining percent 

10 sequence identity (i.e., substantial similarity or identity) is the BLAST algorithm, which 
is described in Altschul et ai, J. Mol Biol. 215:403-410 (1990). Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying 
high scoring sequence pairs (HSPs) by identifying short words of length W in the query 

1 5 sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul et al, supra). These initial neighborhood 
word hits act as seeds for initiating searches to find longer HSPs containing them. The 
word hits are then extended in both directions along each sequence for as far as the 

20 cumulative alignment score can be increased. Cumulative scores are calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues, always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
word hits in each direction are halted when: the cumulative alignment score falls off by 

25 the quantity X from its maximum achieved value; the cumulative score goes to zero or 
below, due to the accumulation of one or more negative-scoring residue alignments; or 
the end of either sequence is reached. The BLAST algorithm parameters W, T, and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, 

30 M=5, N=4, and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as default parameters a wordlength (W) of 3, an expectation (E) of 10, and 
the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 
89:10915 (1989)). 

8 



WO0027994 [file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0027994.c pc1 



Page 1 1 of 330 



WO 00/27994 PCI7US99/26923 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'L Acad. Sci. USA 
90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N))> which provides an indication of the probability by 
5 which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest 
sum probability in a comparison of the test nucleic acid to the reference nucleic acid is 
less than about 0. 1, more preferably less than about 0.01, and most preferably less than 
about 0.001. 

1 0 An indication that two nucleic acid sequences or polypeptides are 

substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross .reactive with the antibodies raised against the polypeptide 
encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically 
substantially identical tu a second polypeptide, for example, where the two peptides differ 

15 only by conservative substitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions, as described below. 

Another indication that polynucleotide sequences are substantially 
identical is if two molecules hybridize to each other under stringent conditions. Stringent 

20 conditions are sequence dependent and will be different in different circumstances. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm 
is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. Typically stringent conditions for a 

25 Southern blot protocol involve hybridizing in a buffer comprising 5x SSC, 1% SDS at 
65°C or hybridizing in a buffer containing 5x SSC and 1% SDS at 42°C and washing at 
65°C with a 0.2x SSC, 0.1% SDS wash. 

A "label" is a composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include 

30 32 P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an 
ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal 
antibodies are available. 



9 
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The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides 
and polymers thereof in either single- or double-stranded form. The term encompasses 
nucleic acids containing known nucleotide analogs or modified backbone residues or 
linkages, which are synthetic, naturally occurring, and non-naturally occurring, which 
5 have similar binding properties as the reference nucleic acid, and which are metabolized 
in a manner similar to the reference nucleotides. Examples of such analogs include, 
without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, 
chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs). 
Unless otherwise indicated, a particular nucleic acid sequence also 

10 implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon 
substitutions) and complementary sequences, as well as the sequence explicitly indicated. 
The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, 
and polynucleotide. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

1 5 nucleic acid capable of binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through complementary base 
pairing, usually through hydrogen bond formation. As used herein, a probe may include 
natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In 
addition, the bases in a probe may be joined by a linkage other than a phosphodiester 

20 bond, so long as it does not interfere with hybridization. Thus, for example, probes may 
be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather 
than phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe sequence 
depending upon the stringency of the hybridization conditions. The probes are preferably 

25 directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which a streptavidin complex may later bind. By assaying 
for the presence or absence of the probe, one can detect the presence or absence of the 
select sequence or subsequence. 

A labeled nucleic acid probe or oligonucleotide is one that is bound, either 

30 covalently, through a linker, or through ionic, van der Waals or hydrogen bonds to a label 
such that the presence of the probe may be detected by detecting the presence of the label 
bound to the probe. 

10 
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"Pharmaceutically acceptable" means a material that is not biologically or 
otherwise undesirable, i.e., the material can be administered to an individual along with a 
Chlamydia antigen without causing any undesirable biological effects or interacting in a 
deleterious manner with any of the other components of the pharmaceutical composition. 
5 The terms "polypeptide," "peptide" and "protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
~~ corresponding naturally occurring amino acid, as well as to naturally occurring amino 

acid polymers. 

10 The phrase "specifically or selectively hybridizing to," refers to 

hybridization between a probe and a target sequence in which the probe binds 
substantially only to the target sequence, forming a hybridization complex, when the 
target is in a heterogeneous mixture of polynucleotides and other compounds. Such 
hybridization is determinative of the presence of the target sequence. Although the probe 

1 5 may bind other unrelated sequences, at least 90%, preferably 95% or more of the 
hybridization complexes formed are with the target sequence. 

The term "recombinant" when used with reference to a cell, or nucleic 
acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by the 
introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, or 

20 that the cell is derived from a cell so modified. Thus, for example, recombinant cells 
express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. 

The phrase "specifically immunoreactive with", when referring to a protein 
25 or peptide, refers to a binding reaction between the protein and an antibody which is 

determinative of the presence of the protein in the presence of a heterogeneous population 
of proteins and other compounds. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not bind in a significant amount to 
other proteins present in the sample. Specific binding to an antibody under such 
30 conditions may require an antibody that is selected for its specificity for a particular 

protein. A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein and are described in detail below. 

11 
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The phrase "substantially pure" or "isolated" when referring to a 
Chlamydia peptide or protein, means a chemical composition which is free of other 
subcellular components of the Chlamydia organism. Typically, a monomeric protein is 
substantially pure when at least about 85% or more of a sample exhibits a single 

5 polypeptide backbone. Minor variants or chemical modifications may typically share the 
same polypeptide sequence. Depending on the purification procedure, purities of 85%, 
and preferably over 95% pure are possible. Protein purity or homogeneity may be 
indicated by a number of means well known in the an, such as polyacrylamide gel 
electrophoresis of a protein sample, followed by visualizing a single polypeptide band on 

10 a polyacrylamide gel upon silver staining. For certain purposes high resolution will be 
needed and HPLC or a similar means for purification utilized. 

DETAILED DESCRIPTION 
The present invention provides the nucleotide sequence of the C. 

15 pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form which 
can be readily used, analyzed, and interpreted by a skilled artisan. As used herein, a 
"representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1 refers to 
any portion which is not presently represented within a publicly available database. 
Preferred representative fragments of the present invention are open reading frames, 

20 expression modulating fragments, uptake modulating fragments, and fragments which can 
be used to diagnose the presence of C. pneumoniae in sample. Using the information 
provided in the present application, together with routine cloning and sequencing 
methods, one of ordinary skill in the art will be able to clone and sequence all 
"representative fragments" of interest including open reading frames (ORFs) encoding a 

25 large variety of C. pneumoniae proteins. A non-limiting identification of such preferred 
representative fragments is provided in Tables 2 and 3. 

Diagnostic use of C. pneumoniae nucleic acids 

Hvbridization-based assays 

Using the nucleic acids disclosed here, one of skill can design nucleic acid 
30 hybridization-based assays for the detection of C pneumoniae. Any of a number of well 
known techniques for the specific detection of target nucleic acids can be used. 
Exemplary hybridization-based assays include, but are not limited to, traditional "direct 

12 
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probe" methods such as Southern Blots, dot blots, in situ hybridization (e.g., FISH), PCR, 
and the like. The methods can be used in a wide variety of formats including, but not 
limited to substrate- {e.g. membrane or glass) bound methods or array-based approaches 
as described below. As noted above, this invention also embraces methods for detecting 
5 the presence of Chlamydia DNA or RNA in biological samples. These sequences can be 
used to detect Chlamydia in biological samples from patients suspected of being infected. 
A variety of methods of specific DNA and RNA measurement using nucleic acid 
hybridization techniques are known to those of skill in the an {see Sambrook et at., 
supra). 

10 In situ hybridization assays are well known {e.g.. Angerer (1987) Meth. 

Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: 
(1) fixation of tissue or biological structure to analyzed; (2) prehybridization treatment of 
the biological structure to increase accessibility of target DNA, and to reduce nonspecific 
binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic acid in the 

1 5 biological structure or tissue; (4) post-hybridization washes to remove nucleic acid 

fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagent used in each of these steps and the conditions for use vary 
depending on the particular application. 

In a typical in situ hybridization assay, cells are fixed to a solid support, 

20 typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured 
with heat or alkali. The cells are then contacted with a hybridization solution at a 
moderate temperature to permit annealing of labeled probes specific to the nucleic acid 
sequence encoding the protein. The targets {e.g., cells) are then typically washed at a 
predetermined stringency or at an increasing stringency until an appropriate signal to 

25 noise ratio is obtained. 

The nucleic acids of this invention are particularly well suited to array- 
based hybridization formats. Arrays are a multiplicity of different "probe" or "target" 
nucleic acids (or other compounds) attached to one or more surfaces {e.g., solid, 
membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other 

30 moieties) is attached to a single contiguous surface or to a multiplicity of surfaces 
juxtaposed to each other. 

In an array format a large number of different hybridization reactions can 
be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation 

13 
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of a number of hybridizations in a single "experiment". Methods of performing 
hybridization reactions in array based formats are well known to those of skill in the art 
(see. e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature 
Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958. 
5 Arrays, particularly nucleic acid arrays can be produced according to a 

wide variety of methods well known to those of skill in the art. For example, in a simple 
embodiment, "low density" arrays can simply be produced by spotting {e.g. by hand using 
a pipette) different nucleic acids at different locations on a solid support (e.g. a glass 
surface, a membrane, etc.). 

10 This simple spotting, approach has been automated to produce high 

density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the 
use of an automated systems that taps a microcapillary against a surface to deposit a small 
volume of a biological sample. The process is repeated to generate high density arrays. 
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for 

15 example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 
92/10092 teach the use of light-directed combinatorial synthesis of high density 
oligonucleotide arrays. 

Many methods for immobilizing nucleic acids on a variety of solid 
surfaces are known in the art. A wide variety of organic and inorganic polymers, as well 

20 as other materials, both natural and synthetic, can be employed as the material for the 
solid surface. Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, 
diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and 
cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene, 
and the like can be used. Other materials which may be employed include paper, 

25 ceramics, metals, metalloids, semiconductive materials, cermets or the like. In addition, 
substances that form gels can be used. Such materials include, e.g., proteins (e.g., 
gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where the solid 
surface is porous, various pore sizes may be employed depending upon the nature of the 
system. 

30 In preparing the surface, a plurality of different materials may be 

employed, particularly as laminates, to obtain various properties. For example, proteins 
(e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) 
can be employed to avoid non-specific binding, simplify covalent conjugation, enhance 

14 
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signal detection or the like. If covalent bonding between a compound and the surface is 
desired, the surface will usually be polyfunctional or be capable of being 
polyfunctionalized. Functional groups which may be present on the surface and used for 
linking can include carboxylic acids, aldehydes, amino groups, cyano groups, ethylenic 
5 groups, hydroxyl groups, mercapto groups and the like. The manner of linking a wide 
variety of compounds to various surfaces is well known and is amply illustrated in the 
literature. 

For example, methods for immobilizing nucleic acids by introduction of 
various functional groups to the molecules is known (see, e.g., Bischoff (1987) Anal. 

10 Biochem., 164: 336-344; Kremsky (1987) Nucl. Acids Res. 15: 2891-2910). Modified 
nucleotides can be placed on the target using PCR primers containing the modified 
nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or 
membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays 
of the invention is advantageous because of well developed technology employing 

1 5 manual and robotic methods of arraying targets at relatively high element densities. Such 
membranes are generally available and protocols and equipment for hybridization to 
membranes is well known. 

Target elements of various sizes, ranging from 1 mm diameter down to 1 
Urn can be used. Smaller target elements containing low amounts of concentrated, fixed 

20 probe DNA are used for high complexity comparative hybridizations since the total 

amount of sample available for binding to each target element will be limited. Thus it is 
advantageous to have small array target elements that contain a small amount of 
concentrated probe DNA so that the signal that is obtained is highly localized and bright. 
Such small array target elements are typically used in arrays with densities greater than 

25 1 0 4 /cm 2 . Relatively simple approaches capable of quantitative fluorescent imaging of 1 
cm 2 areas have been described that permit acquisition of data from a large number of 
target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-213). 

If fluorescently labeled nucleic acid samples are used, arrays on solid 
surface substrates with much lower fluorescence than membranes, such as glass, quartz, 

30 or small beads, can achieve much better sensitivity. Substrates such as glass or fused 
silica are advantageous in that they provide a very low fluorescence substrate, and a 
highly efficient hybridization environment. Covalent attachment of the target nucleic 
acids to glass or synthetic fused silica can be accomplished according to a number of 

15 
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known techniques (described above). Nucleic acids can be conveniently coupled to glass 
using commercially available reagents. For instance, materials for preparation of 
silanized glass with a number of functional groups are commercially available or can be 
prepared using standard techniques (see, e.g., Gait (1984) Oligonucleotide Synthesis: A 
5 Practical Approach, IRL Press, Wash., D.C.). Quartz cover slips, which have at least 10- 
fold lower auto fluorescence than glass, can also be silanized. 

Alternatively, probes can also be immobilized on commercially available 
coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be 
bound to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin 

1 0 antibody can also be attached to silanized glass slides by protein-mediated coupling using 
e.g., protein A following standard protocols (see, e.g., Smith (1992) Science 258: 1 122- 
1 126). Biotin or digoxigenin end-labeled nucleic acids can be prepared according to 
standard techniques. Hybridization to nucleic acids attached to beads is accomplished by 
suspending them in the hybridization mix, and then depositing them on the glass substrate 

15 for analysis after washing. Alternatively, paramagnetic particles, such as ferric oxide 
particles, with or without avidin coating, can be used. 

A variety of other nucleic acid hybridization formats are known to those 
skilled in the art. For example, common formats include sandwich assays and 
competition or displacement assays. Hybridization techniques are generally described in 

20 Hames and Higgins (1 985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; 
Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al. (1969) 
Nature 223: 582-587. 

Sandwich assays are commercially useful hybridization assays for 
detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid 

25 covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. 
The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal" 
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich" 
hybridization complex. To be most effective, the signal nucleic acid should not hybridize 
with the capture nucleic acid. 

30 Detection of a hybridization complex may require the binding of a signal 

generating complex to a duplex of target and probe polynucleotides or nucleic acids. 
Typically, such binding occurs through ligand and anti-ligand interactions as between a 
ligand-conjugated probe and an anti-ligand conjugated with a signal. 

16 
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The sensitivity of the hybridization assays may be enhanced through use of 
a nucleic acid amplification system that multiplies the target nucleic acid being detected. 
Examples of such systems include the polymerase chain reaction (PCR) system and the 
ligase chain reaction (LCR) system. Other methods recently described in the art are the 

5 nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) 
and Q Beta Replicase systems. 

Nucleic acid hybridization simply involves providing a denatured probe 
and target nucleic acid under conditions where the probe and its complementary target 
can form stable hybrid duplexes through complementary base pairing. The nucleic acids 

10 that do not form hybrid duplexes are then washed away leaving the hybridized nucleic 
acids to be detected, typically through detection of an attached detectable label. It is 
generally recognized that nucleic acids are denatured by increasing the temperature or 
decreasing the salt concentration of the buffer containing the nucleic acids, or in the 
addition of chemical agents, or the raising of the pH. Under low stringency conditions 

1 5 (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes 
(e.g., DNArDNA, RNA:RNA, or RNA:DNA) will form even where the annealed 
sequences are not perfectly complementary. Thus specificity of hybridization is reduced 
at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower 
salt) successful hybridization requires fewer mismatches. 

20 One of skill in the art will appreciate that hybridization conditions may be 

selected to provide any degree of stringency. In a preferred embodiment, hybridization is 
performed at low stringency to ensure hybridization and then subsequent washes are 
performed at higher stringency to eliminate mismatched hybrid duplexes. Successive 
washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 

25 X SSPE-T at 37°C to 70°C) until a desired level of hybridization specificity is obtained. 
Stringency can also be increased by addition of agents such as formamide. Hybridization 
specificity may be evaluated by comparison of hybridization to the test probes with 
hybridization to the various controls that can be present. 

In general, there is a tradeoff between hybridization specificity 

30 (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed 
at the highest stringency that produces consistent results and that provides a signal 
intensity greater than approximately 10% of the background intensity. Thus, in a 
preferred embodiment, the hybridized array may be washed at successively higher 

17 
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stringency solutions and read between each wash. Analysis of the data sets thus produced 
will reveal a wash stringency above which the hybridization pattern is not appreciably 
altered and which provides adequate signal for the particular probes of interest. 

Methods of optimizing hybridization conditions are well known to those of 
5 skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology, Vol 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.). 

Labeling and detection of nucleic acids. 

In a prefeired embodiment, the hybridized nucleic acids are detected by 
detecting one or more labels attached to the sample or probe nucleic acids. The labels 

10 may be incorporated by any of a number of means well known to those of skill in the art. 
Means of attaching labels to nucleic acids include, for example nick translation or end- 
labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent 
attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label 
(e.g., a fluorophore). A wide variety of linkers for the attachment of labels to nucleic 

15 acids are also known. In addition, intercalating dyes and fluorescent nucleotides can also 
be used. 

Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Useful labels in the present invention include biotin 

20 for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), 
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and 
the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 I, 
35 S, 14 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others 
commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold 

25 particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or 
colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents 
teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241. 

A fluorescent label is preferred because it provides a very strong signal 

30 with low background. It is also optically detectable at high resolution and sensitivity 
through a quick scanning procedure. The nucleic acid samples can all be labeled with a 
single label, e.g., a single fluorescent label. Alternatively, in another embodiment, 
different nucleic acid samples can be simultaneously hybridized where each nucleic acid 

18 
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sample has a different label. For instance, one target could have a green fluorescent label 
and a second target could have a red fluorescent label. The scanning step will distinguish 
cites of binding of the red label from those binding the green fluorescent label. Each 
nucleic acid sample (target nucleic acid) can be analyzed independently from one another. 

5 Suitable chromogens which can be employed include those molecules and 

compounds which absorb light in a distinctive range of wavelengths so that a color can be 
observed or, alternatively, which emit light when irradiated with radiation of a particular 
wave length or wave length range, e.g., fluorescers. 

Desirably, fluorescers should absorb light above about 300 nm, preferably 

10 about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths 
greater than about 1 0 nm higher than the wavelength of the light absorbed. It should be 
noted that the absorption and emission characteristics of the bound dye can differ from 
the unbound dye. Therefore, when referring to the various wavelength ranges and 
characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye 

15 which is unconjugated and characterized in an arbitrary solvent. 

Fluorescers are generally preferred because by irradiating a fluorescer with 
light, one can obtain a plurality of emissions. Thus, a single label can provide for a 
plurality of measurable events. 

Detectable signal can also be provided by chemiluminescent and 

20 bioluminescent sources. Chemiluminescent sources include a compound which becomes 
electronically excited by a chemical reaction and can then emit light which serves as the 
detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can 
be used in conjunction with luciferase or lucigenins to provide bioluminescence. 
Spin labels are provided by reporter molecules with an unpaired electron spin which can 

25 be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels 
include organic free radicals, transitional metal complexes, particularly vanadium, 
copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free 
radicals. 

The label may be added to the target (sample) nucleic acid(s) prior to, or 
30 after the hybridization. So called "direct labels" are detectable labels that are directly 

attached to or incorporated into the target (sample) nucleic acid prior to hybridization. In 
contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization. 
Often, the indirect label is attached to a binding moiety that has been attached to the 
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target nucleic acid prior to the hybridization. Thus, for example, the target nucleic acid 
may be biotinylated before the hybridization. After hybridization, an avidin-conjugated 
fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily 
detected. For a detailed review of methods of labeling nucleic acids and detecting labeled 
5 hybridized nucleic acids see Laboratory Techniques in Biochemistry and Molecular 

Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., 
(1993)). 

Fluorescent labels are easily added during an in vitro transcription 
reaction. Thus, for example, fluorescein labeled UTP and CTP can be incorporated into 

10 the RNA produced in an in vitro transcription. 

The labels can be attached directly or through a linker moiety. In general, 
the site of label or linker-label attachment is not limited to any specific position. For 
example, a label may be attached to a nucleoside, nucleotide, or analogue thereof at any 
position that does not interfere with detection or hybridization as desired. For example, 

15 certain Label-ON Reagents from Clontech (Palo Alto, CA) provide for labeling 

interspersed throughout the phosphate backbone of an oligonucleotide and for terminal 
labeling at the 3' and 5' ends. As shown for example herein, labels can be attached at 
positions on the ribose ring or the ribose can be modified and even eliminated as desired. 
The base moieties of useful labeling reagents can include those that are naturally 

20 occurring or modified in a manner that does not interfere with the purpose to which they 
are put. Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A 
and G, and other heterocyclic moieties. 

It will be recognized that fluorescent labels are not to be limited to single 
species organic molecules, but include inorganic molecules, multi-molecular mixtures of 

25 organic and/or inorganic molecules, crystals, heteropolymers, and the like. Thus, for 
example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily 
derivatized for coupling to a biological molecule (Bruchez et al. (1998) Science, 281: 
2013-2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium 
selenide) have been covalently coupled to biomolecules for use in ultrasensitive 

30 biological detection (Warren and Nie (1998) Science, 28 1 : 20 1 6-201 8). 

Amplification-based assays. 

In another embodiment, amplification-based assays can be used to detect 
nucleic acids. In such amplification-based assays, the nucleic acid sequences act as a 

20 
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template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR). Detailed 
protocols for quantitative PCR are provided in Innis et ai (1990) PCR Protocols, A Guide 
to Methods and Applications, Academic Press, Inc. N.Y.). 

Other suitable amplification methods include, but are not limited to ligase 
5 chain reaction (LCR) {see Wu and Wallace (1989) Genomics 4: 560, Landegren et ai 
(1988) Science 241: 1077, and Barringer et ai (1990) Gene 89: 1 17, transcription 
amplification (Kwoh et ai (1989) Proc. NatL Acad. ScL USA 86: 1 173), and self- 
sustained sequence replication (Guatelli et ai (1990) Proc. Nat. Acad. Sci. USA 87: 
1874). 

10 Detection of C. pneumoniae gene expression 

The nuch:ic acids of the invention can also be used to C. pneumoniae 

detect gene transcripts. Methods of detecting and/or quantifying gene transcripts using 

nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook 

et ai supra). For example , a Northern transfer may be used for the detection of the 

15 desired mRNA directly. In brief, the mRNA is isolated from a given cell sample using, 
for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA is 
then electrophoresed to separate the mRNA species and the mRNA is transferred from the 
gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are used to 
identify and/or quantify the target mRNA. 

20 In another preferred embodiment, the gene transcript can be measured 

using amplification (e.g. PCR) based methods as described above for directly assessing 
copy number of the target sequences. 

Expression of C. pneumoniae proteins 

The nucleic acids disclosed here can be used for recombinant expression 

25 of the proteins. In these methods, the nucleic acids encoding the proteins of interest are 
introduced into suitable host cells, followed by induction of the cells to produce large 
amounts of the protein. The invention relies on routine techniques in the field of 
recombinant genetics, well known to those of ordinary skill in the art. A basic text 
disclosing the general methods of use in this invention is Sambrook et ai, Molecular 

30 Cloning, A Laboratory Manual (2nd ed. 1 989). 

Standard transfection methods are used to produce prokaryotic, 
mammalian, yeast or insect cell lines which express large quantities of the desired 
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polypeptide, which is then purified using standard techniques (see. e.g., Colley ex alj. 
Biol Chem. 264:17619-17622, 1989; Guide to Protein Purification, supra). 

The nucleotide sequences used to transfect the host cells can be modified 
to yield Chlamydia polypeptides with a variety of desired properties. For example, the 

5 polypeptides can vary from the naturally-occurring sequence at the primary structure 

level by amino acid, insertions, substitutions, deletions, and the like. These modifications 
can be used in a number of combinations to produce the final modified protein chain. 

The amino acid sequence variants can be prepared with various objectives 
in mind, including facilitating purification and preparation of the recombinant 

1 0 polypeptide. The modified polypeptides are also useful for modifying plasma half life, 
improving therapeutic efficacy, and lessening the severity or occurrence of side effects 
during therapeutic use. The amino acid sequence variants are usually predetermined 
variants not found in nature but exhibit the same immunogenic activity as naturally 
occurring protein. In general, modifications of the sequences encoding the polypeptides 

1 5 may be readily accomplished by a variety of well-known techniques, such as site-directed 
mutagenesis (see Gillman & Smith, Gene 8:81-97 (1979); Roberts et a/., Nature 328:731- 
734 (1987)). One of ordinary skill will appreciate that the effect of many mutations is 
difficult to predict. Thus, most modifications are evaluated by routine screening in a 
suitable assay for the desired characteristic. For instance, the effect of various 

20 modifications on the ability of the polypeptide to elicit a protective immune response can 
be easily determined using in vitro assays. For instance, the polypeptides can be tested 
for their ability to induce lymphoproliferation, T cell cytotoxicity, or cytokine production 
using standard techniques. 

The particular procedure used to introduce the genetic material into the 

25 host cell for expression of the polypeptide is not particularly critical. Any of the well 
known procedures for introducing foreign nucleotide sequences into host cells may be 
used. These include the use of calcium phosphate transfection, spheroplasts, 
electroporation, liposomes, microinjection, plasmid vectors, viral vectors and any of the 
other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA 

30 or other foreign genetic material into a host cell (see Sambrook et aL, supra). It is only 
necessary that the particular procedure utilized be capable of successfully introducing at 
least one gene into the host cell which is capable of expressing the gene. 

22 



WO0Q27994 (file:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0027994.c pc] 



Page 25 of 330 



WO 00/27994 PCT/US99/26923 

Any of a number of well known cells and cell lines can be used to express 
the polypeptides of the invention. For instance, prokaryotic cells such as E. coli can be 
used. Eukaryotic cells include, yeast, Chinese hamster ovary (CHO) cells, COS cells, and 
insect cells. 

5 The particular vector used to transport the genetic information into the cell 

is also not particularly critical. Any of the conventional vectors used for expression of 
recombinant proteins in prokaryotic and eukaryotic cells may be used. Expression 
vectors for mammalian cells typically contain regulatory elements from eukaryotic 
viruses. 

1 o The expression vector typically contains a transcription unit or expression 

cassette that contains all the elements required for the expression of the polypeptide DNA 
in the host cells. A typical expression cassette contains a promoter operably linked to the 
DNA sequence encoding a polypeptide and signals required for efficient polyadenylation 
of the transcript. The terni "operably linked" as used herein refers to linkage of a 

15 promoter upstream from a DNA sequence such that the promoter mediates transcription 
of the DNA sequence. The promoter is preferably positioned about the same distance 
from the heterologous transcription start site as it is from the transcription start site in its 
natural setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

20 Following the growth of the recombinant cells and expression of the 

polypeptide, the culture medium is harvested for purification of the secreted protein. The 
media are typically clarified by centrifugation or filtration to remove cells and cell debris 
and the proteins are concentrated by adsorption to any suitable resin or by use of 
ammonium sulfate fractionation, polyethylene glycol precipitation, or by ultrafiltration. 

25 Other routine means known in the art may be equally suitable. Further purification of the 
polypeptide can be accomplished by standard techniques, for example, affinity 
chromatography, ion exchange chromatography, sizing chromatography, His 6 tagging and 
Ni-agarose chromatography (as described in Dobeli et a/., Mol andBiochem. Parasit. 
41:259-268 (1990)), or other protein purification techniques to obtain homogeneity. The 

30 purified proteins are then used to produce pharmaceutical compositions, as described 
below. 

An alternative method of preparing recombinant polypeptides useful as 
vaccines involves the use of recombinant viruses (e.g., vaccinia). Vaccinia virus is grown 
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in suitable cultured mammalian cells such as the HeLa S3 spinner cells, as described by 
Mackett et a/., in DNA cloning Vol II: A practical approach, pp. 191-211 (Glover, ed.). 

Antibody Production 

The proteins of the present invention can be used to produce antibodies 
5 specifically reactive with C pneumoniae antigens. If isolated proteins are used, they may 
be recombinantly produced or isolated from Chlamydia cultures. Synthetic peptides 
made using the protein sequences may also be used. 

Methods of production of polyclonal antibodies are known to those of skill 
in the art. In brief, an immunogen, preferably a purified protein, is mixed with an 
1 0 adjuvant and animals are immunized. When appropriately high titers of antibody to the 
immunogen are obtained, blood is collected from the animal and antisera is prepared. 
Further fractionation of the antisera to enrich for antibodies reactive to Chlamydia 
proteins can be done if desired {see Harlow & Lane, Antibodies: A Laboratory Manual 
(1988)). 

15 Polyclonal antisera are used to identify and characterize Chlamydia in the 

tissues of patients using, for instance, in situ techniques and immunoperoxidase test 
procedures described in Anderson et al. J A VMA 198:241 (1991) andBarr et al Vet. 
Pathol 28:110-116(1991). 

Monoclonal antibodies may be obtained by various techniques familiar to 

20 those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell {see Kohler & 
Milstein, Eur. 1 Immunol 6:51 1-519 (1976)). Alternative methods of immortalization 
include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 

25 screened for production of antibodies of the desired specificity and affinity for the 

antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced 
by various techniques, including injection into the peritoneal cavity of a vertebrate host. 

Monoclonal antibodies produced in such a manner are used, for instance, 
in ELISA diagnostic tests, immunoperoxidase tests, immunohistochemical tests, for the in 

30 vitro evaluation of spirochete invasion, to select candidate antigens for vaccine 

development, protein isolation, and for screening genomic and cDNA libraries to select 
appropriate gene sequences. 
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Ipmunodiagonostic detection of C. pneu moniae infections 

The present invention also provides methods for detecting the presence or 
absence of C pneumoniae, or antibodies reactive with it, in a biological sample. For 
instance, antibodies specifically reactive with Chlamydia can be detected using either 

5 Chlamydia proteins or the isolates described here. The proteins and isolates can also be 
used to raise specific antibodies (either monoclonal or polyclonal) to detect the antigen in 
a sample. In addition, the nucleic acids disclosed and claimed here can be used to detect 
Chlamydia-specific sequences using standard hybridization techniques. 

For a review of immunological and immunoassay procedures in general, 

1 0 see Basic and Clinical immunology (Stites & Terr ed., 7th ed. 1 99 1 )). The immunoassays 
of the present invention can be performed in any of several configurations, which are 
reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); Tijssen, Laboratory 
Techniques in Biochemistry and Molecular Biology (1985)). For instance, the proteins 
and antibodies disclose 1 here are conveniently used in ELISA, immunoblot analysis and 

1 5 agglutination assays. 

In brief, immunoassays to measure znti-Chlamydia antibodies or antigens 
can be either competitive or noncompetitive binding assays. In competitive binding 
assays, the sample analyte (e.g., anti-Chlamydia antibodies) competes with a labeled 
analyte (e.g., anti-Chlamydia monoclonal antibody) for specific binding sites on a capture 

20 agent (e.g., isolated Chlamydia protein) bound to a solid surface. The concentration of 
labeled analyte bound to the capture agent is inversely proportional to the amount of free 
analyte present in the sample. 

Noncompetitive assays are typically sandwich assays, in which the sample 
analyte is bound between two analyte-specific binding reagents. One of the binding 

25 agents is used as a capture agent and is bound to a solid surface. The second binding 
agent is labelled and is used to measure or detect the resultant complex by visual or 
instrument means. 

A number of combinations of capture agent and labelled binding agent can 
be used. For instance, an isolated Chlamydia protein or culture can be used as the 
30 capture agent and labelled anti-human antibodies specific for the constant region of 

human antibodies can be used as the labelled binding agent. Goat, sheep and other non- 
laiman antibodies specific for human immunoglobulin constant regions (e.g., y or \i) are 
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well known in the art. Alternatively, the anti-human antibodies can be the capture agent 
and the antigen can be labelled. 

Various components of the assay, including the antigen, dXiix-Chlamydia 
antibody, or anti-human antibody, may be bound to a solid surface. Many methods for 
5 immobilizing biomolecules to a variety of solid surfaces are known in the art. For 
instance, the solid surface may be a membrane (e.g., nitrocellulose), a microtiter dish 
(e.g., PVC or polystyrene) or a bead. The desired component may be covalently bound or 
noncovalently attached through nonspecific bonding. 

Alternatively, the immunoassay may be carried out in liquid phase and a 
10 variety of separation methods may be employed to separate the bound labeled component 
from the unbound labelled components. These methods are known to those of skill in the 
art and include immunoprecipitation, column chromatography, adsorption, addition of 
magnetizable particles coated with a binding agent and other similar procedures. 

An immunoassay may also be carried out in liquid phase without a 
1 5 separation procedure. Various homogeneous immunoassay methods are now being 
applied to immunoassays for protein analytes. In these methods, the binding of the 
binding agent to the analyte causes a change in the signal emitted by the label, so that 
binding may be measured without separating the bound from the unbound labelled 
component. 

20 Western blot (immunoblot) analysis can also be used to detect the presence 

of antibodies to Chlamydia in the sample. This technique is a reliable method for 
confirming the presence of antibodies against a particular protein in the sample. The 
technique generally comprises separating proteins by gel electrophoresis on the basis of 
molecular weight, transferring the separated proteins to a suitable solid support, (such as a 

25 nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample 
with the separated proteins. This causes specific target antibodies present in the sample 
to bind their respective proteins. Target antibodies are then detected using labeled anti- 
human antibodies. 

The immunoassay formats described above employ labelled assay 

30 components. The label may be coupled directly or indirectly to the desired component of 
the assay according to methods well known in the art. A wide variety of labels may be 
used. The component may be labelled by any one of several methods. Traditionally a 
radioactive label incorporating 3 H, 125 1, 35 S, I4 C, or 32 P was used. Non-radioactive labels 
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include ligands which bind to labelled antibodies, fluorophores, chemiluminescent agents, 
enzymes, and antibodies which can serve as specific binding pair members for a labelled 
ligand. The choice of label depends on sensitivity required, ease of conjugation with the 
compound, stability requirements, and available instrumentation. 

5 Enzymes of interest as labels will primarily be hydrolases, particularly 

phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases. 
Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 
derivatives, dansyl, umbelliferone, etc. Chemiluminescent compounds include luciferin, 
and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labelling or 

10 signal producing systems which may be used, see U.S. Patent No. 4,391,904, which is 
incorporated herein by reference. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or 

15 covalently bound to a signal system, such as a detectable enzyme, a fluorescent 

compound, or a chemiluminescent compound. A number of ligands and anti-ligands can 
be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, and 
Cortisol, it can be used in conjunction with the labelled, naturally occurring anti-ligands. 
Alternatively, any haptenic or antigenic compound can be used in combination with an 

20 antibody. 

Some assay formats do not require the use of labelled components. For 
instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labelled and the presence of 
25 the target antibody is detected by simple visual inspection. 

Pharmaceutical Compositions 

The peptides or antibodies (typically monoclonal antibodies) of the present 
invention and pharmaceutical compositions thereof are useful for administration to 
mammals, particularly humans, to treat and/or prevent Chlamydia infections. Suitable 
30 formulations are found in Remington's Pharmaceutical Sciences, Mack Publishing 
Company, Philadelphia, PA, 17th ed. (1985). 

27 



WO0027994ffile:/A\dcwas03Vfirmdata\lp\FoleyPat\PatentDocurTients\WO0027994.c pc] . Page 30 of 330 

WO 00/27994 PCT/US99/26923 

The immunogenic peptides or antibodies of the invention are administered 
prophylactically or to an individual already suffering from the disease. The peptide 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to Chlamydia. An effective immune response is one that inhibits 

5 infection. An amount adequate to accomplish this is defined as "therapeutically effective 
dose 1 ' or "immunogenically effective dose." Amounts effective for this use will depend 
on, e.g., the peptide composition, the manner of administration, the stage and severity of 
the disease being treated, the weight and general state of health of the patient, and the 
judgment of the prescribing physician, but generally range for the initial immunization 

10 (that is for therapeutic or prophylactic administration) from about 0.1 mg to about 1.0 mg 
per 70 kilogram patient, more commonly from about 0.5 mg to about 0.75 mg per 70 kg 
of body weight. Boosting dosages are typically from about 0.1 mg to about 0.5 mg of 
peptide using a boosting regimen over weeks to months depending upon the patient's 
response and condition. A suitable protocol would include injection at time 0, 4, 2, 6, 10 

15 and 14 weeks, followed by further booster injections at 24 and 28 weeks. 

For therapeutic use, administration should begin at the first sign of 
infection. This is followed by boosting doses until at least symptoms are substantially 
abated and for a period thereafter. In some circumstances, loading doses followed by 
boosting doses may be required. The resulting immune response helps to cure or at least 

20 partially arrest symptoms and/or complications. Vaccine compositions containing the 
peptides are administered prophylactically to a patient susceptible to or otherwise at risk 
of the infection. 

The pharmaceutical compositions (containing either peptides or 
antibodies) are intended for parenteral or oral administration. Preferably, the 

25 pharmaceutical compositions are administered parenterally, e.g., subcutaneously, 
intradennally, or intramuscularly. Thus, the invention provides compositions for 
parenteral administration which comprise a solution of the immunogenic polypeptides 
dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. A variety 
of aqueous carriers may be used, e.g., water, buffered water, 0.4% saline, 0.3% glycine, 

30 hyaluronic acid and the like. These compositions may be sterilized by conventional, well 
known sterilization techniques, or may be sterile filtered. The resulting aqueous solutions 
may be packaged for use as is, or lyophilized, the iyophilized preparation being combined 
with a sterile solution prior to administration. The compositions may contain 
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pharmaceutical ly acceptable auxiliary substances as required to approximate 
physiological conditions, such as buffering agents, tonicity adjusting agents, wetting 
agents and the like, for example, sodium acetate, sodium lactate, sodium chloride, 
potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

5 The compositions may also comprise carriers to enhance the immune 

response. Useful carriers are well known in the art, and include, e.g., KLH, 
thyroglobulin, albumins such as human serum albumin, tetanus toxoid, polyamino acids 
such as poly(lysine:glutamic acid), influenza, hepatitis B virus core protein, hepatitis B 
virus recombinant vaccine and the like. 

10 For solid compositions, conventional nontoxic solid carriers may be used 

which include, for example, pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic 
composition is formed t y incorporating any of the normally employed excipients, such as 

15 those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 
more peptides of the invention, and more preferably at a concentration of 25%-75%. 

As noted above, the peptide compositions are intended to induce an 
immune response to Chlamydia. Thus, compositions and methods of administration 
suitable for maximizing the immune response are preferred. For instance, peptides may 

20 be introduced into a host, including humans, linked to a carrier or as a homopolymer or 
heteropolymer of active peptide units from various Chlamydia proteins disclosed here. 
Alternatively, a "cocktail" of polypeptides can be used. A mixture of more than one 
polypeptide has the advantage of increased immunological reaction and, where different 
peptides are used to make up the polymer, the additional ability to induce antibodies to a 

25 number of epitopes. 

The compositions also include an adjuvant. As used here, number of 
adjuvants are well known to one skilled in the art. Suitable adjuvants include incomplete 
Freund's adjuvant, alum, aluminum phosphate, aluminum hydroxide, 
N-acetyl-muramyUL-threonyl-D-isoglutamine (thr-MDP), 

30 N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor-MDP), 
N-acetylmuramyl-Ldanyl-D-isoglutaminyl-L-alanine-2-( 1 '^-dipalmitoyl-sn- 
g;ycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 1983 5 A, referred to as MTP-PE), 
and RIBI, which contains three components extracted from bacteria, monophosphoryl 
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lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% 
squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by 
measuring the amount of antibodies directed against the immunogenic peptide. 

The concentration of immunogenic peptides of the invention in the 

5 pharmaceutical formulations can vary widely, i.e. from less than about 0.1%, usually at 
or at least about 2% to as much as 20% to 50% or more by weight, and will be selected 
primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of 
administration selected. 

The peptides of the invention can also be expressed by attenuated viral 

10 hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus as a 
vector to express nucleotide sequences that encode the peptides of the invention. Upon 
introduction into a host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits an immune response. Vaccinia vectors and methods useful in 
immunization protocols are described in, e.g., U.S. Patent No. 4,722,848. Another vector 

15 is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et al. (Nature 
35 1 :456-460 (1991)). A wide variety of other vectors useful for therapeutic 
administration or immunization of the peptides of the invention, e.g., Salmonella typhi 
vectors and the like, will be apparent to those skilled in the art from the description 
herein. 

20 The DNA encoding one or more of the peptides of the invention can also 

be administered to the patient. This approach is described, for instance, in Wolff et ai y 
Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 and 5,589,466. 

In order to enhance serum half-life, the peptides may also be encapsulated, 
introduced into the lumen of liposomes, prepared as a colloid, or other conventional 

25 techniques may be employed which provide an extended serum half-life of the peptides. 
A variety of methods are available for preparing liposomes, as described in, e.g., Szoka et 
al., Ann. Rev, Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos. 4, 235,871, 4,501,728 and 
4,837,028. 

30 EXAMPLES 

The following examples are offered to illustrate, but no to limit the 
claimed invention. 
Example 1 : 

30 
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This example describes comparison of the C. pneumoniae genome 
disclosed here and the, previously sequenced, C. trachomatis genome (Stephens, et al 
Science 282:754-759 (1998)). 

The apparent low level of DNA homology between C. trachomatis and C 
5 pneumoniae (Campbell, et a/., J. Clin. Microbiol. 25:191 1-1916 (1987)) yet analogous 
cell structures and developmental cycles, predicts that comparative analysis of the two 
genomes will significantly enhance the understanding of both pathogens. Identification 
~~ of genes that are present in one species but not the other are of particular importance for 

the mutually exclusive biological, virulence and pathogenesis capabilities of each. 

10 Identification of genes shared between the two species strongly supports the requirement, 
for these capabilities in a biological system that has, over its long-term association with 
mammalian host cells, evolved to reduce the metabolic capacities while optimizing 
survival, growth and transmission of these unique pathogens. 

The previously sequenced C. trachomatis genome contains 1,042,519 

15 nucleotides and 875 likely protein-coding genes. Similarity searching permitted the 
inferred functional assignment of sequences 636 (60%) genes disclosed here and 251 
(23%) are similar to hypothetical genes for other bacterial organisms including those for 
C trachomatis. The remaining 186 (17%) genes are not homologous to sequences 
deposited in GenBank.. Seventy C. trachomatis genes are not represented in the C. 

20 pneumoniae genome. These are contained within blocks consisting of 2-17 genes and 19 
single genes. Of the 70 C. trachomatis genes without homologs in C. pneumoniae, 60 are 
classified as encoding hypothetical proteins. The remaining genes not represented in C. 
pneumoniae consist of the tryptophan operon (trpA t B t R), trpC, two predicted thiol 
protease genes, and 4 genes assigned to the phospholipase-D superfamily, 

25 It is evident that there is a high level of functional conservation between C 

pneumoniae and C. trachomatis as orthologs to C. trachomatis genes were identified for 
859 (80%) of the predicted coding sequences for C. pneumoniae. The level of similarity 
for individual encoded proteins spans a wide spectrum (22-95% amino acid identity) with 
an average of 62% amino acid identity between orthologs from the two species. The 

30 percent amino acid identity between orthologous chlamydial proteins is similar among 
functional groups with the highest for proteins associated with translation and the lowest 
for proteins whose function in chlamydiae is uncharacterized and not related to proteins 
encoded by other organisms. The gene order of the homologous set of genes in C. 
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pneumoniae shows reorganization relative to the genome of C trachomatis', however, 
there is a high level of synteny for the gene organization of the two genomes. We 
identified thirty-nine blocks of 2 or more genes whose gene organization is colinear with 
homologs to C. trachomatis, although some of these are inverted. The distribution of 
5 genome reorganization is not evenly distributed on the chromosome as the region 
between C pneumoniae coding sequences 0130-0300 contains substantially more 
reorganization than other areas of the genome. This region coincides with the predicted 
chromosome replication terminus. 

We identified orthologs of enzymes characterized in other bacteria that 

10 account for the essential requirements for DNA replication, repair, transcription and 
translation including two predicted DNA helicases of the Swi2/Snf2 family found in C 
trachomatis. Similar to C. trachomatis, alternative sigma subunits for RNA polymerase, 
ct 28 and a 54 , were identified in addition to anti-a regulatory system factors RsbV, a 
RsbW-like single-domain histidine kinase, and a RsbU-like protein phosphatase. These 

15 findings suggest that the fundamental mechanisms of transcriptional regulation are 
conserved among Chlamydia. The C. trachomatis proteins containing SET and SWIB 
domains, and a SWIB domain fused to the C-terminus of the chlamydial topoisomerase I, 
not identified outside eukaryotes, are found in C pneumoniae supporting their possible 
role in the chromatin condensation-decondensation characteristic of the biologically 

20 unique chlamydial developmental cycle. 

The central metabolic pathways inferred from the C. pneumoniae genome 
sequence are the same as those identified for C trachomatis C pneumoniae has a 
glycolytic pathway and a linked tricarboxylic acid cycle, although likely functional, is 
incomplete as genes for citrate synthase, aconitase, and isocitrate dehydrogenase were not 

25 identified. C pneumoniae has a complete glycogen synthesis and degradation system 
supporting a role for glycogen synthesis and utilization of glucose-derivatives in 
chlamydial metabolism. Genes encoding essential functions in aerobic respiration are 
present and electron flux may be supported by pyruvate, succinate, glycerol-3-phosphate, 
and NADH dehydrogenases, NADH-ubiquinone oxidoreductase and cytochrome oxidase. 

30 C pneumoniae also contains the V (vacuolar)-type ATPase operon and the two ATP 
translocases found in C trachomatis. 

The type-Ill secretion virulence system required for invasion by several 
pathogenic bacteria and found in the C trachomatis genome in three chromosomal 
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locationsis also present in the C pneumoniae genome. Each of the components is 
conserved and their relative genomic contexts are conserved. Genes such as a predicted 
serine/threonine protein kinase and other genes physically linked to genes encoding 
structural components of the type-Ill secretion apparatus, but without identified 
5 homologs, are also highly similar between the two species suggesting the functional roles 
in modifying cellular biology are fundamentally conserved. 

Chlamydia-encoded proteins that are not found in chlamydial organisms 
but localized to the intracellular chlamydial inclusion membrane are likely essential for 
the unique intracellular biology and perhaps differences in inclusion morphology 
10 observed between species of Chlamydia. Several such proteins, termed IncA,B&C, have 
been characterized for a C. psittaci strain (Rockey, et al. Mol Microbiol. 15:617-626 
(1995); Rockey et al. Inject. Immun. 62:106-1 12 (1994)). C pneumoniae and C. 
trachomatis encode orthologs to C psittaci IncB and IncC and C. trachomatis also 
contains an ortholog to Lie A. C. pneumoniae contains two genes that encode proteins 
15 with similarity to IncA (CPn0l86 and CPn0585), although the level of homology is low 
suggesting analogous but possibily altered functions. 

The tryptophan biosynthesis operon (trpA, trpB, trpR) and trpC identified 
in C. trachomatis is conspicuously missing in the C pneumoniae genome. This 
represents the entire repertoire of genes associated with tryptophan biosynthesis identified 
20 in C. trachomatis. Seventeen genes adjacent to the C trachomatis tryptophan operon also 
were not found in the C pneumoniae genome. This region is the single largest loss of a 
contiguous genomic segment and includes 4 HKD superfamily encoding genes that 
encompass a family of proteins related to endonuclease and phospholipase D. These 
findings may be important for the ability of Chlamydia to persist in their hosts and cause 
25 disease by eliciting potent, focal and persistent inflammatory responses thought to be 
essential for pathogenesis. 

The C. pneumoniae genome contains 187,71 1 additional nucleotides 
compared to the C. trachomatis genome, and the 214 coding sequences not found in C 
trachomatis account for most of the increased genome size. Eighty-eight of these genes 
30 are found in blocks of >10 genes (1 1-30 genes/block), 41 are single genes, and the 
remainder are partnered with at least one other gene. Based upon the observation that 
-70% of all the C pneumoniae genes have an identifiable homolog in GenBank, 
exclusive of C. trachomatis, it would be expected that over 150 of the 214 genes should 
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have a homolog in GenBank, many associated with a function. However, only 28 coding 
sequences have similarity to genes from other organisms. Thus the majority of the genes 
that are mutually exclusive of C. trachomatis (186 of 214), and the 60 of 70 C. 
trachomatis genes that lacked an identifiable homolog in C. pneumoniae, do not have 

5 detectable homologs to genes from other organisms. We predict that most of the unique 
genes are essential for specific attributes that define the differential biology, tropism and 
pathogenesis of C trachomatis and C. pneumoniae. Moreover, this suggests that C. 
pneumoniae has more unique biological (i.e., virulence) capacity than C. trachomatis. 
The ability of C. pneumoniae to be more invasive and survive in a broader range of host 

10 cell types than C trachomatis is consistent with this hypothesis. Not all of the 

differences in biological capacity may be associated with mutually exclusive genes. One 
explanation for the significantly lower level of homology between protein sequences 
assigned as having C. pneumoniae and C. trachomatis orthologs but no identifiable 
orthologs in other organisms is that this set of proteins is not only associated with 

15 biological requirements specific for Chlamydia but this polymorphism may account for 
differential biology between the two species. The determination of the genome sequence 
from a representative of the C. psittaci group will precisely delineate those genes that are 
mutually exclusive and specific for each species. 

The major functionally identifiable addition to the C. pneumoniae genome 

20 is a large expansion of genes encoding a new family of chlamydial polymorphic 

membrane proteins (Pmp), alone representing 22% of the increased coding capacity. 
While the C. trachomatis genome has 9 pmp genes, remarkably the C. pneumoniae 
genome contains 21 pmp genes. Most of these genes appear to be amplified in two 
regions of the genome with three stand-alone genes. Interestingly one of the stand-alone 

25 genes is most closely related to the C. trachomatis pmpD which is the only stand-alone 
pmp gene in the C trachomatis genome and it is located with the same relative genomic 
context, suggesting an essential and conserved function for this paralog. Six Pmp-coding 
genes are presumably not functional as five contain predicted coding frame-shifts and one 
is truncated. The amplification of this gene family and the confidently predicted frame- 

30 shifts suggest a specific molecular mechanism to promote functional or antigenic 

diversity. The biological role of this protein family remains enigmatic, although at least 
one of the proteins in C. psittaci related to this family is exposed on the chlamydial 
surface. 

34 



WO0027994 f file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pc] 



Page 37 of 330 



WO 00/27994 PCT/US99/26923 

While a function could not be assigned for most of the unique C. 
pneumoniae genes, several have significant similarity to genes from other organisms. 
Functional assignments could be made for genes encoding GMP synthetase, IMP 
dehydrogenase, UMP synthase, uridine kinase, biotin synthase pathway proteins, 
5 methylthioadenosine nucleosidase, a DNA glycosylate and aromatic amino acid 
hydroxylase. Thus a complete pathway was identified for biotin biosynthesis. The 
additional purine and pyrimidine salvage pathway genes presumably reflect metabolic 
limitations in one of the cell types that C. pneumoniae infects or differences in the ability 
of C. pneumoniae to transport precursor nucleosides or nucleotides. 

10 The addition of aromatic amino acid hydroxylase in C. pneumoniae is 

intriguing especially in light of the loss of tryptophan biosynthetic genes and the inability 
to synthesize other amino acids including phenylalanine. Aromatic amino acid 
hyroxlyases include three distinct enzymes that function to receptively oxidize 
phenylalanine to tyrosine, tyrosine to Dopa, and tryptophan to 5-hydroxytryptophan and 

15 serotonin. Although the chlamydial protein is similar to proteins of this family and 

incrementally more closely related to tryptophan hydroxylase, its specific function could 
not be confidently predicted. We hypothesize that it may be involved in C pneumoniae 
virulence. Tryptophan hydroxylase has not been previously identified in bacteria and the 
origin of the chlamydial gene appears to be from eukaryotes. The functional role of an 

20 aromatic amino acid hydroxylase for C. pneumoniae is linked to the unique intracellular 
biology of this organism and may represent a key contribution to C. pneumoniae 
persistence and pathogenesis. 

It is understood that the examples and embodiments described herein are 
for illustrative purposes only and that various modifications or changes in light thereof 

25 will be suggested to persons skilled in the art and are to be included within the spirit and 
purview of this application and scope of the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated by reference in their entirety 
for all purposes. 

Table 1 provides functional assignments of C. pneumoniae nonprotein- 
30 encoding genomic sequences. Table 2 provides functional assignments of protein coding 
sequences. Table 3 provides the amino acid sequences of the proteins corresponding to 
the coding sequences. 
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e 


SEQ!DNO:1 




start position 


Ori 


841664 


tmRNA 


138493 


pRNA 


607342 


rRNA 


1000564 


rRNA 


1002415 


rRNA 


1005393 


tRNA 


269070 


tRNA 


164318 


tRNA 


296224 


tRNA 


836191 


tRNA 


1030533 


tRNA 


784896 


tRNA 


781680 


tRNA 


961536 


tRNA 


999949 


tRNA 


268992 


tRNA 


672236 


tRNA 


680178 


tRNA 


715889 


tRNA 


739403 


tRNA 


1175863 


tRNA 


784994 


tRNA 


843926 % 


tRNA 


409922* 


tRNA 


631373 


tRNA 


677337 


tRNA 


807413 


tRNA 


877473 


tRNA 


462141 


tRNA 


1085605 


tRNA 


786780 


tRNA 


89728 


tRNA 


293477 


tRNA 


87522 


tRNA 


199301 


tRNA 


199390 


tRNA 


626904 


tRNA 


708359 


tRNA 


1142034 


tRNA 


1230028 


tRNA 


91070 


tRNA 


293399 


CRN A 


296147 


tRNA 


1137389 



PCT/US99/26923 

TABLE 1 



SEQ ID NO: 1 Gene 
end position 







1 J oU / 4 


( ft J CmrtNA 


OU / 049 


KlDOuUCicaSe fr fsT».r\ 


1U 


ICC rDMl 
10 b tivNA 


10 0 DZ7 0 








269142 


% 1 3 j» DMA 1 

Aid CKNA_1 




ASH CRNA 


ft ft ^ ^ CI 

296151 


(R) ASp tRNA 


83 6119 


V K ; Ala CKNA_z 


1 ft ft ft ^ft. ft 

1030603 


Cys CRNA 


•* O 4 ft ft ft 

784822 


(R) GlU CRNA 


781510 


(K) oly CRNA^l 


ft e ^ ft "T 

961607 


Gly CRNA_ 2 


« ft ft ft ft ft *i 
1000023 


HIS CRNA 


259065 


lie CRNA 


£ "T ft ft *1 O 

572318 


L6U CKWA_1 


680257 


Leu CKNA 4 


715971 


Leu CRIsIA_J 


739480 


LeU 


1175944 


T At t t- DMA 5 
LeU 


ft* q 4(JT5 


\ ft / Lys crux a 




rZTO CKNA_* 


409648 


(K) rTO CruNA_i 


fa J 1445 


Dft a t" DMA 

file CrCNA 


577254 


fn \ HOMA 0 

IK; Ary CKNA^Z 


OU / J41 


I ft ) Aty UfUM/V_ J 


Oti 400 


I ft ) Airy ufUN/\_ ti 


462214 


Arg tRNAJL 


10.85676 


Gin tRNA 


786708 


(R) Thr tRNA_3 


89657 


(R) Thr tRNA_l 


293405 


(R) Thr tRNA_2 


87450 


{R) Met tRNA_l 


199229 


(R) Met tRNA_2 


199317 


(R) Met tRNA_3 


626987 


Ser tRNA_l 


708440 


Ser tRNA_2 


1142117 


Ser tRNA_3 


1229945 


(R) Ser tRNA_4 


90999 


(R) Trp tRNA 


293317 


(R) Tyr tRNA 


296075 


(R) Val tRNA_l 


1137462 


VaL CRNA_2 
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CPnOQOl 


212 


4 


CPO0003 


973 


675 


CPnOOOJ 


899 


2370 


i-m)U>ll 


J 170 


: i : \ 




4127 


*4'J2 


CPn0006 


7293 


7141 


CPO00Q7 


7605 


10496 


CPnOOOS 


10973 


11665 


CPtlO0O9 


11915 


13119 


CPnOOlO 


13435 


14325 


CPnooio 


14379 


1S746 


CPnOOii 


1S892 


16614 


CPn0012 


16644 


16212 


cpnoais 


18S84 


21106 


CPn0014 


21392 


21922 


CPnOOlS 


21835 


24174 


CPn00l6 


24416 


26168 


CPtlO017 


26094 


27170 


CrawOlO 


27522 


29003 


qpn0019 


29007 


30356 


CPH0020 


32687 


30603 


CPB0021 


34410 


32707 


CPOC022 


34962 


34395 


CP&0023 


36603 


35014 


CPH0024 


37S96 


36661 


CPB0O2S 


36604 


37684 


CPn0026 


39625 


38762 


CPn0027 


42234 


39778 


CF&0028 


43325 


42543 


C9B0029 


43795 


43390 


CPB0030 


43891 


44529 


CPB0031 


44711 


44864 


CP&0032 


44923 


46098 


CPn0033 


46138 


46171 


CPn0034 


49457 


46210 


CPn003S 


51029 


49569 


CPn0036 


51002 


51796 


CPnO037 


51792 


52115 


CPn0038 


52119 


53631 


CPttO039 


54250 


53963 


CPn0040 


55643 


54318 


CPn0041 


55996 


57342 


CPnQ042 


57403 


56182 


CPH0043 


58447 


60372 


CPn0044 


60419 


60776 


CPn0045 


61069 


62790 


CPn0046 


62790 


63263 


CPn0047 


63455 


63652 


CPn0048 


63687 


6S8Q1 


CPn0049 


66296 


65617 


CPnOOSO 


66813 


66499 


CPnOOSl 


66833 


67111 


CPn0052 


68005 


67304 


CPnOOS3 


69344 


67966 


CP&00S4 


70023 


69313 


CPnOOSS 


70129 


70590 


CPnOOS* 


70953 


72746 


CPti0057 


72934 


73554 


CPn0058 


73639 


74562 


CPn0059 


T4616 


75050 


CPn0060 


75055 


75528 


cpnon< I 


75534 


76208 


cpnon«2 


76)06 


77690 


CPflOO<3 


78U2 


76267 


CPnO044 


74)46 


78576 


TPnO<»« 


749:4 


60651 


CPn0i)64 


H09:5 


8.2655 



TABLE 2 



CT001 hypochetical procain 

gaeC-Glu-tBHA Cln AmiaotransJsrsss tC subunit) • (CT002) 
a«A-Clu c*KA Gin Aaidotrsnsf erae-{CT003) 

.H1B IP»L12I -U'l rt'riA Olft Ami-iocfins^WS-* ( B Dublin i r. > - ICTOOI i 
p»p_i- Polymorphic Outer Meaoren* Protein C Feaiiy 



-shite with 0010 



pmp_2- Polymorphic Outer Membra ns Protein C family 
pnp.3 -Polymorphic Outer Membrane Protein G really 
pmp_3-PMF_3 (Crame-shife wich 0014) 
pmp_4 -Polymorphic Outer Membrane Protein G really 
pmp_4-PKP_4 (frame -shift with 0016) 
pmp_5 - Polymorphic Outer Hembrane Protein G really 
pmp_5-PKP_S (fraae-shift with 0018) 
Predicted OMP (leader (14) peptide t outer ■ 
Predicted OKP (leader (19) peptide] -(CT3 SO) 
ma£-(CT349) 

yjjx/alr-ABC Transporter Protein ATPaee- (CT348) 
xerC-xntegrase/recoabinaea- (CT347 ) 
elaC/atsA-Sulphohydrolase/Glycosulfatase- ( CT346) 
CT34S hypothetical protein- (CT345) 
lan-Lon ATP-dependenc Protease- (CT344) 



ane]-(CT351) 



gcp_l-0-Sieloglycoprocein £ndopeptid&**_l- (CT343) 
rs21-S21 Ribosoaal Protein- (CT3 42) 

dnaJ-Heat Shock Protein J-JCT341) 

pdhAAB/odbAaodbB- (pyruvate) Oxo i s oval era te Dehydrogenase 
Fusion- (CT3 40) 

CT339 hypothetical protein 

CT338 hypothetical protein 

ptsH-PTS phosphocarrier Protein Kpr-(CT337) 

ptsI-PTS PEP Phosphotransferase-(CT336) 

ybaB-(CT335) 

dnaX_l-ONA Pol XXZ Gaama and Tau.l-(CT334) 



Alpha a Beta 



"yqfr-Bs conserved hypothetical IK protein 



heaC- Porphobilinogen Deaminase- (CT299) 
sms-SfflJ Proceln-(CT296) 
rnc-Rihonuclease XIX-(CT297) 
CT296 hypothetical pro coin 
. mrsA-Phosphomannomutase- (CT29S) 
sodM- Superoxide Dismutase (Kn) -(CT294) 
accO-AcCoA Carboxylase /Transferase Beta-(CT293) 
dut-dUTP Nucleocidohydrolase-(CT292) 
pcsN.l-PTS IIA Protein- (CT291) 

ptsN_2-PTS IIA Procain • KTK DMA-Binding OomaLn- (CT290) 
CTZ89 hypothetical protein 



CT288 hypoeher.i.Mi pror.«in 
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CPn0067 


62953 


84053 


F 


CPn0068 


84903 


84331 


R 


CPn0069 


85236 


87086 


F 


CPnOO70 


87378 


67208 


R 


CPn007l 


88045 


87599 


R 


CPn0072 


8906X 


68057 


R 


CPn0073 


89356 


69574 




CPn0074 


89774 


90955 


p 


CPn0075 


91102 


91350 


F 


CPn0076 


9X358 


91903 


F 


CPn0077 


92013 


92435 


F 


CPn0078 


92465 


93160 


F 


CPn0079 


93179 


93688 


F 


CPnOOSO 


93735 


94121 


P 


CPnOOSl 


94261 


98016 


F 


CPnOOS2 


98043 


102221 


F 


CPn0083 


102332 


103312 


F 


CPn0084 


X03362 


103751 


P 


CPn0085 


X04506 


103766 




CPn0086 


104904 


10S527 


P 


CPn0087 


105579 


106376 


F 


CPnOOSB 


106373 


108145 


F 


CPn0089 


108153 


109466 


P 


ppn0090 


109454 


110060 


F 


CPn0091 


110074 


112053 


F 


CPn0092 


112151 


112573 




CPn0093 


112509 


113015 


F 


CPD0094 


113152 


115971 


F 


CPn0095 


116037 


118790 


F 


CPn0096 


124314 


118837 




CPn0097 


124555 


126006 


F 


CPn0098 


127491 


126091 




CPn0099 


127593 


127865 


P 


CPaOlOO 


129141 


127882 


R 


CPnOlOl 


129932 


129141 


R 


CPn0102 


130123 


131466 


F 


CPn0103 


131480 


132511 


F 


CPn0104 


133875 


132676 


R * 


CPnOIOS 


134847 


134029 


R 


CPn0106 


135091 


136374 


F 


CPn0107 


137162 


136392 


R 


CPnOIOS 


137857 


137303 


R 


CP&0109 


138655 


141783 


F 


CPnOilO 


143734 


141827 


R 


CPnOlll 


X44686 


143934 


R 


CPn0112 


144767 


145093 


F 


CPn0113 


145335 


146405 


F 


CPn0114 


146398 


147261 


F 


CPnOUS 


147279 


148622 


F 


CPn0116 


148616 


148972 


F 


CPn0117 


148989 


150071 


F 


cpnoxxs 


150102 


150464 


• F 


CPnOXX9 


150523 


151164 


F 


CP&0X20 


151164 


151778 


F 


CPnOX2X 


151778 


152068 


F 


CPnOX22 


152071 


153723 


F 


CPnOX23 


155969 


153774 


R 


CPI10X24 


1566X4 


158068 


F 


CPn0125 


X58096 


158605 


F 


CPnOX26 


X5B809 


161085 


F 


CPn0127 


X62X43 


161130 


R 


CPnOX28 


162277 


163053 


F 


CPnOX29 


1637X7 


163064 


R 


CPn0130 


X64245 


163751 


R 


CPn0131 


164549 


165580 


F 


CPnOX3 2 


165587 


166561 


F 


CPnOX33 


167334 


166564 


R 


CPn0134 


169098 


167467 


R 


CPBOX3 5 


169448 


169X43 


R 


CPnOX36 


171401 


X69569 


R 


CPnOX37 


172254 


17X502 


R 


CPnOX38 


174019 


172700 


R 



CT360 hypothetical protein 



C732S hypothetical protein 
CT324 hypothetical protein 
infA-Initiation Factor IF-1-(C7323) - 
tufA-Elongation Factor Tu-(CT322) 
secE-preprotein translocate- (CT321 ) 
nusG -Transcriptional Ant i termination- (CT320) 
rllX-Ul Ribosomal Protein* (CT319) 
rll-Ll Ribosomal Protein- (CT318 ) 
rllO-LlO Ribosomal Protein- (CT317) 
rl7-L7/L12 Ribosomal Protein- (CT31 6) 
rpoB-RNA Polymerase Beta-(CT315) 
rpoC-RNA Polymerase Beta' -(CT314) 
tal-Transaldolase- (CT313 ) 
predicted ferredoxin- (CT312) 
CT311 hypothetical protein 
atpE-ATP Synthase Subunit E-(CT310J 
CT309 hypothetical protein 
atpA-ATP Synthase Subunit A- (CT308) 
atpB-ATP Synthase Subunit B-<CT307) 
atpD-ATP Synthase Subunit D-(CT306) 
atpI-ATP Synthase Subunit I-(CT305) 
atpK-ATP Synthase Subunit K- (CT304) 
CT303 hypothetical protein 
valS-Valyl tRKA Synthetase- (CT3 02 ) 
pknD-S/T Protein Kinase- (CT3 01) 
uvrA-Excinuelease ABC Subunit A-CCT333) 
pyk-Pyruvate Kinase- <CT3 32) 
htrB-Acyl transferase- (CT010) 

CT011 hypothetical protein 

ybbP family hypothetical protein- (CT012) 

cydA -Cytochrome Oxidase Subunit I- (CT013) 

cydfi -Cytochrome Oxidase Subunit II- (CT014) 

CT017 hypothetical protein 

CT016 hypothetical protein 

phoH-ATPase-(CT015) 

CT058 hypothetical proteizv.1 

CT018 

UeS-Isoleucyl-tRNA Synthetase- (CT01 9) 

lepB-Signal Peptidase I-(CT020) 

CT021 hypothetical protein 

rl31-L31 Ribosomal Protein- (CT022) 

pfrA-Peptide Chain Releasing Factor (RF-1) - CCT023) 

hemK-A/G specific methylase- (CT024) 

ffh-Signal Recognition Particle GTPase- (CT025) 

rs!6-S16 Ribosomal Protein- (CT026) 

trmO-cRNA (guanine N-l ) -Methyl trans ferae e- (CT0 27 ) 

rll9-L19 Ribosomal Protein- (CT0 28) 

rnhB.l-Ribonuclease HII_i- (CT029) 

gmk-GMP Kinase- (CT030) 

CT031 hypothetical protein 

metG-Methionyl-tRNA Synthetase- (CT032) 

recO.l-Exodeoxyribonuc lease V (Alpha Subunit) _1- (CT03 3 ) 



ytfF-Cationic Amino Acid Transporter- (CT03 4) 
bpll-Biotin Protein Ligase- (CT035) 
similarity to CT036 



CKLPS hypothetical protein- (CT109) 
groEL_l-KSF-60_l- (CT110) 
groES-lOKOa Chaperonin- (CT111) 
pepF-Oligopeptidase- (CT112) 
ybgl-ACR family- (CT108) 

hemL-Glutamate-l-semialdehyde-2, 1-aminomutase- (CT210) 
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CPn0139 


174636 


174093 


R 


yqgE-(CT210) 


CFA0140 


173110 


174673 


R 


yQjdX- (CT2121 


CPB0141 


173602 


173110 


R 


rplA-Rlbooo-5-P Xsomerese A-ICT213) 


CPn0142 


176091 


179916 


R 




crnoi43 


177333 


176214 


R 


*yxjGLBs_l Hypothetical Pro to in 


CFn0144 


177963 


180360 


F 


clpa-Clp Proteaee ATPeso- (CT113 ) 


CPnOliS 


180777 


182369 


p 


CT1 14 hypothetical protein 


CPn0146 


1826X3 


183093 


r 




CPn0147 


183223 


183671 


F 




CPnOX48 


183846 


183702 


F 


pkni-S/T Protein Kinase- (CT145) 


CPn0149 


1897X5 


187700 


T 


dnU-OHA Ligese- (CT146) 


CPftOXSO 


187634 


192444 


F 


CT147 hypothetical protein 


CPnoisi 


194X42 


192625 


R 


rahpA-*ionooxygenase- (CTX48 ) 


CPn0152 


193265 


1943X8 


R 


CTX49 hypothetical protein 


TPn01S3 


193433 


197892 


F 


leuS-Leucyl cRKA synthetase- (CT2 09) 


CPO0XS4 


197892 


199202 


F 


goeA-KDO Transferees- (CT208) 


CPn0155 


199691 


199488 


R 




CPn01S6 


200X17 


199770 


R 




CPn0137 


200723 


200298 


R 




cpooise 


20X430 


200694 


R 




CPB0159 


20X772 


20X467 


R 




CPnOl60 


203791 


202X27 


R 


p f 1 - F rue tot e - 6 - P Phospnotrsas t erase_l - i CT2 07 > 


CPO0X61 


204622 


203798 


R 


predicted acyltrans feres o family- (CT206) 


CPnOX62 


203828 


204603 


R 




CPH0X63 


206026 


206394 


P 




C9H0X64 


206498 


206998 


P 




CF&0163 


206998 


207982 


F 




CPn0166 


207630 


207962 


F 




CPD0X67 


208306 


207977 


R 




CPaOX68 


208641 


209417 


R 




CPn0169 


209301 


208710 


R 




CPB0X70 


2XX026 


210025 


R 




CF&0171 


2X2435 


2X1149 


R 


•guaA-GKP synthase 


CPn0172 


2X3X77 


2X2440 


R 


• otttB / impP- Inoe ine 5 * -nonophosphase dehydrogenase (CO OH 










only) 


CPnOX73 


2X3987 


2X37X5 


R 




CPnOX74 


2X4237 


2X4724 


F 




CPn0175 


2X4698 


91 CMC 

219275 


F* 




CP&0176 


2X3286 


2X69X8 


F 


CTX53 hypothetical protein 


CP&0177 


Z17€37 


210008 


R 




CPD0X78 


218092 


91 9"9B O 


R 




CPn0179 


91 ■ini 


91 IAKC 
21003b 


R 




CPnOlOO 


91 1 

210031 


91 A1KC 

210293 


R 




CPnOlSl 


91 ai?c 
2191/9 


51 e755 


R 






120093 


91 Dili 


R 


■ aiaaf~ _tl I mm i fm »- < tmmmmm ■ a | 

eccc-Biotm carooxy lose- twri2 u 


creois j 


22119? 


129073 


R 


eccB-Biotin Carhoxyl Carrier Protein* (CT1 23 ) 


CPHU1B4 


22177? 


551551 
• lavlll 


R 


. o£p_i-£i.ongstion rector r^.i- ten 2 2 j 


wrnoio9 


9994C1 


2217* < ? 


p 

i\ 


*pe# erap^siDuioso^r spunrue* igtaixj 


creoioo 


222197 


9 9 4 ACQ 
22990 o 


F 


• at 4 anl 1 »J mwmm mtmt l~ f _ A « « / «BM a n I 

similarity to cps incA_i - (cnisi 


cnvu # 


22 0200 


259.01* 


» 
r 


preoicxeo metnyiese*(CTU2; 


»BmA1 at G 
CTO01BB 


95«1 1 1 

229111 


59Cinn 

12Q9UU 


F 


CTX32 hypo the ticel protein 


urnoioy 






r 


wTUi nrunoxog- tyossioic TransaanaDcsxaO rtwww 


rPnOlOfl 


220919 
• •Wat 7 


211271 


• 

r 




crooiyi 


511991 


511514 


R 


f«SQ"ABC Amino Acia Treneporter ATrift»iuiiJwj 






231981 

•JilO* 


*> 


nlnDalTtT iajiaiaj aVP>4 al t»niiiM>| i ay pajaauM at ai / CT12 9 ) 


CPn0X93 


233X26 


232686 


R 


•aroR-Aroinine Repressor 


CPTI0X94 


2332X0 


234241 




gcp^-O-Sialoclycoprotein Cndopeptidase.2- (CTX97) 


CPB0X95 


234X90 


233785 




oppA^l -Oligopeptide Binding Protein_l 


CP&0X96 


233939 


237919 




oppA^2- Oligopeptide Binding Protein^- (CT19 8) 


CP&0X97 


237378 


238982 




oppA.3 -Oligopeptide Binding ProteinJJ 


CPn0198 


239X69 


240746 




oppA.4 -Oligopeptide Binding Protein.4 


CFB0X99 


24X042 


24X98) 




oppn.1 -Oligopeptide Pernease.1- (CT199) 


CPn0200 


2420X7 


242968 




oppt^X -Oligopeptide PezxMaoe.l- (CT200) 


CPD0201 


242664 


243715 




oppD -Oligopeptide Transport A?Pese-(CT201) 


CPn0202 


2437X3 


244300 




oppP-Ollgopeptide Transport ATPese- (CT202) 


CP&0203 - 


243008 


243802 






CPH0204 


243817 


246002 






CPnO205 


246X33 


246327 






CP&0206 


246409 


247161 




CT203 hypothetical protein 


CPH0207 


247208 


248617 




ybhX/eodiTl-Oxogluterste/Melete Trans locator- (CT204) 


cre020e 


248933 


230602 




p CaA.2- Fructose- 6- P Pnospho trans in roe e^2- (CT20S) 


CPH0209 


231036 


251272 







•terminal regioc 
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CPn0210 
CPn021l 
CPn02l2 
CPn0213 
CPn0214 
CPn0215 
CPO0216 
CPI10217 
CPn021B 
CPa0219 
CPn0220 
CPn0221 
CPn0222 
CPn0223 
CPn0224 
CPn0225 
CPn0226 
CPn0227 
CPH0228 
CPn0229 
CPn0230 
CPn0231 
CPn0232 

CPn0233 

CP&0234 

CPn0235 

CPn0236 

CPn0237 

CPn0238 

CPn0239 

CPn0240 

CPn0241 

CPn0242 

CPn0243 

CPn0244 

CPn0245 

CPn0246 

CPn0247 

CPn024 8 

CPn0249 

CPn02SO 

CPn0251 

CPn0252 

CPn0253 

CPn0254 

CPn0255 

CPn0256 

CPn0257 

CPn0258 

CPn0259 

CPn0260 

CPn0261 

CPZ10262 

CPn0263 

CPn0264 

CPn0265 

CPn0266 

CPn0267 

CPn0268 

CPn0269 

CPn0270 

CPn0271 

CPn0272 

CPn0273 

CPn0274 

CPn0275 

CPn0276 

CPn0277 

CPn0278 

CPn0279 

CPn0280 



252364 

252756 

254066 

254342 

255657 

257015 

257608 

257896 

259058 

259357 

260696 

261657 

262504 

262956 

263435 

263873 

264566 

265416 

266110 

266328 

268253 

268957 

270122 

270424 

271240 

271416 

272156 

273762 

274303 

275899 

277861 

279354 

279916 

260555 

260918 

261645 

262952 

283415 

284327 

265841 

286057 

286060 

288112 

288456 

289262 

290165 

291264 

292127 

292534 

292986 

294045 

294302 

295091 

296249 

297730 

298620 

299184 

300122 

30093S 

302450 

303325 

303634 

305233 

305644 

3083S3 

310786 

311137 

311910 

312875 

313537 

314572 



251440 

252463 

252688 

254190 

254446 

255759 

257174 

258579 

258582 

260472 

261236 

262064 

262842 

263333 

263674 

264541 

264967 

265009 

265412 

267560 

267576 

268253 

269232 

270248 

270548 

272177 

273766 

274214 

275838 

276672 

276696 

278203 

279487 

280133 

281556 

282499 

262551 

282969 

283650 

284333 

285902 

287559 

287576 

287950 

288459 

289329 

290398 

291267 

292133 

292441 

293548 

295033 

295933 

297136 

297155 

297730 

299876 

300910 

301318 

301476 

302468 

304362 

304340 

305227 

305852 

308372 

310793 

311404 

312060 

312875 

313550 



R 
R 
R 
R 
R 
R 
R 
F 
R 
F 



R 
F 
R 
R 
R 

R 
R 
F 
F 
F 
F 
F 
R 
R 

R 

R 

F 

F 

R % 

R 

R 

R 

R 

F 

R 

R 

R 

R 

R 

R 

R 

R 

R 

F 

F 

F 

R 

R 

F 

F 
F 
R 
R 
F 
R 
R 
R 
R 
R 
R 
R 
R 
R 



ypdP-(CT140) 



tgt-Oueuine tRNA Ribosyl Transferase- (CT193 ) 



•weak similarity to Bacteriophage CHP1 (Orf4) 



dsbB-Disulfide bond Oxidoreductase- (CT176) 
dsbG-Disulfide Bond Chape rone- (CT177) 
CT178 hypothetical protein 
CT179 hypothetical protein 

tauB-ABC Transport ATPase (Nitrate/FeJ - (CT180) 
•similarity to 5 ■ -Methyl thioadenosine / S-Adenosylhoraocysteine 
Nucleosidase 

CT181 hypothetical protein 

kdsB-deoxyoctulonosic Acid Synthetase- (CT182) 

pyrG-CTP Synthetase- (CT183) 

ygoF Family- {CT184} 

rwf-Glucose-6-P Dehyroffenase-(CT185) 

devB-Glucose-6-P Dehyrogenase (DevB f amily) - (CT186) 



adk-Adenylate Kinase- (CT128) 

ydhO- Polysaccharide Hydrolase- Invasin Repeat Family- (CT127) 

rs9-S9 Ribosomal Protein- (CT12 6) 

rll3-L13 Ribosomal Protein- (CT12 5) 

ycfV/ybbA-ABC Transporter ATPase- (CT152) 

CT1S1 hypothetical protein 

rl33-L33 Ribosomal Protein- (CT1 50) 

•conserved hypothetical protein 

CT144 hypothetical protein (frame-shift with 0253?) 

CT144 hypothetical protein_l 

CT143 hypothetical protein^! 

CT142 hypothetical proteio_l 

CT144 hypothetical protein_2 

CT143 hypothetical protein_2 

CT1 4 2 hypothetical protein (frame-shift with 0259?) 

CT142 hypothetical protein_2 

secA_l- Protein Translocase Subunit_l- (CT141 ) 

ydaO-PP-Loop Superfamily ATPase- (CT217) 

surE-Sur£-like Acid Phosphatase- (CT2 18) 

yqfU hypothetical protein- (CT221) 

ubiD- Phenyl aery late Decarboxylase - (CT220) 

ubiA-Benxoate Octaphenyl transferase- (CT219) 



Dipeptidase- (CT138) 

ywlC-SuAS Superfamily- related Protein- (CT13 7) 
Lysophospholipase esterase- (CT136) 
dnaX_2-DNA Pol XIX Gamma and Tau_2- (CT187) 
tdk-Thymidylate Kinase- (CT18 8) 
gyrA_l-DNA Cyras e Subunit A_l- (CT189) 
cyrB_l-DHA Cyras e Subunit B_l- (CT190) 
CT191 hypothetical protein 

•conserved outer membrane lipoprotein protein 
•Possible ABC Transporter Permease Protein 
dppr-Dipeptide Transporter ATPase- (CT669) 
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CPn0281 

CPn0282 
CPa0283 
CPn0284 
CPH0285 
CPn0286 
CPn0287 
CPn0288 
CPn0289 
CPnO290 
CPn0291 
CPn0292 
CPn0293 
CPn0294 
CPn.0295 
CPn0296 
CPn0297 
CPH0298 
CPn0299 
CPn0300 
CPn0301 
CPn0302 
CPn0303 
CPn0304 
CPn030S 
CPn0306 
CPnO307 

cpnosoe 

CPnO309 

CPH0310 

CPn03U 

CPn0312 

CPn0313 

CPn03l4 

CPn0315 

CPn0316 

CPH03X7 

CPn0318 

CPn0319 

CPn0320 

CPn0321 

CPn0322 

CPn0323 

CPn0324 

CPn0325 

CPn0326 

CPn0327 

CPn032B 

CPn0329 

CPn0330 

CPn033l 

CPn0332 

CPn0333 

CPn0334 

CPn0335 

CPn0336 

CPn0337 

CPn033B 

CPn0339 

CPn0340 

CPn034i 

CPn0342 

CPn034 3 

CPn0344 

CPn0345 

CPn0346 

CPn0347 

CPn0348 

CPn0349 

CPn033D 

CPn035l 



31S0S7 

316126 

318497 

319045 

320595 

322059 

324221 

325716 

325812 

327042 

328667 

329228 

329949 

333092 

333863 

33476S 

335697 

336721 

336816 

337783 

340250 

340767 

342958 

343133 

344154 

345145 

348986 

349234 

350974 

353433 

354438 

354524 

354990 

35628S 

356977 

358620 

360081 

362767 

363175 

363860 

365858 

366249 

367331 

369492 

370708 

371148 

372945 

373241 

375086 

376675 

378437 

37865S 

379090 

379311 

379817 

380650 

382027 

382278 

383420 

383842 

384160 

384622 

.84999 

387420 

388572 

389675 

391021 

391803 

392770 

393181 

393686 



316103 

317529 

317532 

318551 

319051 

320650 

322089 

324571 

326996 

328523 

329194 

329836 

332723 

333502 

333627 

334022 

334774 

335717 

337415 

340152 

340762 

341866 

341921 

344158 

345137 

346431 

346515 

349596 

349595 

351049 

353575 

354976 

355355 

355353 

358716 

360121 

362750 

363126 

363879 

364783 

364767 

367328 

369460 

370668 

371148 

372725 

373211 

374992 

376146 

376202 

376701 

378536 

378800 

379823 

380674 

381591 

381575 

383375 

384034 

364156 

364495 

385062 

385595 

385558 

387436 

388704 

389678 

391027 

391790 

393684 

395432 



r 

R 
R 
R 
R 
R 
R 

r 

F 

r 

F 
F 
F 

R 
R 

K 
V 
I 

r 
r 

IS 

} 
i 

i 

1: 
r 

R 
R 
R 



R 
R 
R 
R 
F 
F 
F 
R 
F 
F 
F 
F 
F 
F 
R 
R 
R 
R 
R 
R 
F 

r 



dhnA- Predicted 1.6 -Fructose Bipftosph*..* Aldolase Idehydrin family) - 
{CT215I 

xasA/gadC-Aaino Acid Transporter- (CT2 16 J 



rogtE-Mg*- Transporter (CBS Domain) - (CT19 4) 
CT195 hypothetical protein 

aaaT~Neut;al Amino Acid (Glutamate) Transporter- (CT230) 
Na -dependent Transporter- (CT231) 
incB-lnclusion Membrane Protein B-(CT232) 
incc-lnclusion Membrane Protein C-(CT233) 
CT234 hypothetical protein 

CAMP -Dependent Protein Kinase Regulatory Subunit- (CT235) 

acpP-Acyl Carrier Protein- (CT23 6) 

fabC-Oxoacyl (Carrier Protein) Reductase- (CT237) 

fahD-Malonyl Acyl Carrier Transcyclase- (CT236) 

fahH-Oxoacyl carrier Protein Synthase III-(CT239) 

recR-Recntnhl nation Protein- (CT240) 

yaeT-Ompes Analog- (CT241) 

(OespH-Like Outer Membrane Protein) - (CT242) 

lpxO-UDP Glucosamine N-Acyitrans f erase- (CT2 43) 

CT244 hypothetical protein 

pdhA/odpA- Pyruvate Dehydrogenase Alpha- (CT245) 
pdhB/odpB- Pyruvate Dehydrogenase Beta- (CT246) 
pdhC-Dihydrolipoamide Acetyl trans f erase- (CT247) 
glgP-Glycogcn Phosphorylase- (CT348) 
similarity to CT249 

dnaA_l -Replication Initiation Protein_l- (CT250) 

60IM-60kDa Inner Membrane Protein- (CT251) 

Igt-Prolipoprotein Diacylglycerol Transferase- (CT252) 

CT101 hypothetical protein 

acpS-Acyl -carrier Protein Synthase- (CT100) 

trxB-Thioredoxin Reductase- (CT0 99) 

rsl-Sl Ribosomal Protein- (CT0 96) 

nusA-N Utilization protein A-(CT097) 

infB-Znitiation Factor-2- (CT096) 

rbfA-Rinosome Binding Factor A-(CT095) 

truB-tRNA Pseudouridine Synthase- (CT094 ) 

ribF-FAD Synthase- (CT0 93) 

ychF-CTP Binding Protein- (CT092) 

yscU-YopS Translocation Protein U -(CT091) 

lcro- Low Calcium Response D-(CT090) 

IcrE- Low Calcium Response E- (CT089) 

syc£- Secretion Chaperone- (CT088) 

malQ-Clucanotransf erase- (CT087) 

rl28-L28 Ribosomal Protein- (CT086 ) 

CT085 hypothetical protein 

Phopholipase D Superfamily [leader (33) peptide] - (CT08 4) 

CT083 hypothetical protein 

CT082 hypothetical protein 

CHLTR T2 Protein- (CT081) 

ltuB-(CT080> 

CT079 similarity 

folD-Methylene Tetrahydro folate Dehydrogenase- (CT07 8) 
yo)L-(CT077) 

smpB- Small Protein B- (CT076) 
dnaM-DNA Pol IZI (beta chain) - (CT075) 
recF-ABC superfamily ATPase- (CT074) 
(frame-shift with 0339) 
(frame-shift with 0340) 

predicted OMP (leader (19) peptide] - (CT073 ) 
(frame-shift with 03427) 
yaeL-Metailoprotease- (CT072) 
yaeM- (CT071) 

troo/ytgo- Integral Membrane Protein- (CT070) 
troc/ytgc- Integral Membrane Protein- (CT069) 
troB/ytgB-ABC transporter ATPase- (CT068) 
eroA/ycgA-Solute protein Binding Family- (CT067) 
CT066 hypothetical protein 
adt.l-ADP/ATP Translocase_l-(CT065) 
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CPn0352 

CPn0353 

CPn0354 

CPn0355 

CPn0356 

CPn03S7 

CPn0358 

CPn0359 

CPn0360 

CPn0361 

CPn0362 

CPn0363 

CPn0364 

CPn0365 

CPn0366 

CPn0367 

CPn0368 

CPn0369 

CPn0370 

CPH0371 

CPn0372 

CPn0373 

CPn0374 

CPn037 5 

CPn0376 

CPnP377 

CPU0378 

CPH0379 

CPn0380 

CPH0381 

CPn0382 

CPn0383 

CPn0384 

CPn0385 

CPn0386 

CPa0387 

CPn0388 

CPn0389 

CPn0390 

CPn0391 

CPn0392 

CPn0393 

CPn0394 

CPn0395 

CPn0396 

CPn0397 

CPn0398 

CPn0399 

CPn0400 

CPn0401 

CPn0402 

CPn04O3 

CPn0404 

CPn0405 

CPn0406 

CPn0407 

CPn0408 

CPn0409 

CPn0410 

CPn0411 

CPn0412 

CPn0413 

CPn0414 

CPn0415 

CPn0416 

CPn0417 

CPnCUlS 

CPn0419 

CPn0420 

CPn0421 

CPn0422 

CPn0423 



395574 

396893 

3971fi7 

399889 

400459 

401317 

401751 

402012 

405356 

406647 

407825 

409688 

409966 

410528 

411976 

413102 

413790 

414351 

415600 

417147 

417687 

418380 

420218 

421121 

421654 

423438 

426168 

426322 

426758 

429809 

430749 

431693 

432377 

434018 

434525 

435196 

435329 

438134 

439144 

439692 

439814 

440379 

440736 

441964 

444353 

445115 

445533 

445879 

446536 

447864 

448994 

449015 

450887 

451739 

451969 

453742 

454105 

454645 

455123 

455833 

456590 

459203 

460143 

461498 

461856 

463035 

464401 

466834 

467108 

467998 

4£6242 

468791 



396830 

397135 

398507 

398591 

400109 

400469 

401578 

403817 

403922 

405382 

407055 

407943 

410238 

411544 

412440 

413836 

414107 

415562 

416912 

417503 

418001 

420218 

420961 

421615 

422294 

422347 

423445 

426765 

427876 

428037 

430036 

430749 

431862 

432522 

434046 

434699 

437320 

437319 

438134 

439510 

440383 

440723 

441966 

443175 

443241 

444381 

445700 

446523 

447306 

447495 

447888 

449710 

449871 

450966 

45286S 

452858 

454581 

455127 

455833 

456609 

457246 

457227 

459172 

460221 

461557 

462244 

462953 

464876 

466824 

467108 

46JB784 

469216 



F 

F 

F 

R 

R 

R 

R 

F 

R 

R 

R 

R 

F 

F 

F 

F 

F 

F 

F 

P 

F 

F 

F 

F 

F 

R 

R 

F 

F 

R 

R 

R 

R 

R 

R 

R 

F 

R* 

R 

R 

F 

F 

F 

F 

R 

R 

F 

F 

F 

R 

R 

F 
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R 

F 

R 

F 

F 

F 

F 

F 

R 

R 

R 

R 

R 

R 

R 

R 

R 

F 

F 



lepA-GTPase- (CT064) 

gnd-6-Phosphogluconate Dehydrogenase- (CT063 > 
tyrS-tyrosyl tRNA Synthetase- (CT062) 
fliA/rpsD-Sigma-28/WhiC Family- (CT061) 
flhA-Flagellar Secretion Protein- (CT060) 
fer4-Ferredoxin IV-(CT059» 



CT058 hypothetical protein.2 
CT058 hypothetical proteixO 



gcp£-(CT057) 

CT056 hypothetical protein 



•ucB_l-Dihydrolipoaaide Succiuyltransferaee_l- (CT0S5) 
sucA-Oxoolutarate Dehydrogenase- (CT05 4) 
CT053 hypothetical protein 

hemN_l -Coproporphyrinogen III Oxidase_l- (CT052) 
CT326 similarity 

yabC/yraL- SAM- Dependent Kethy transferase- (CT048) 
CT047 hypothetical protein 
hctB-His tone- like Protein 2-<CT046> 
pepA-Leucyl Aminopeptidase A-(CT045) 
ssb-ss DMA Binding Protein- (CT044) 
CT043 hypothetical protein 

glgx -Glycogen Hydrolase (debranching) - (CT042) 
CT041 hypothetical protein 
ruvB-Holliday Junction Helicase- {CT040) 

dcd-dCT? Deaminase- (CT03 9) 
CT038 hypothetical protein 

ClyC_l-CBS Domain protein (Hemolysin Homolog)_l- (CT256) 
CT257 hypothetical protein 
yhfO-NifS-related protein- CCT258) 
PP2C phosphatase family- (C725 9) 

CT253 hypothetical protein 
CT254 hypothetical protein 
CT255 hypothetical protein 
mutY-Adenine Glycosylate- (CT107) 

ycec-predicted pseudouridine synthetase family- (CT1 06) 
CT105 hypothetical protein 

fabl-Enoyl-Acyl -Carrier Protein Reductase- (CT10 4) 

HAD superfamily hydrolase/phosphatase- (CT1 03 ) 

CT102 hypothetical protein 

CT260 hypothetical protein 

dnaQ_l-DNA Pol III Epsilon Chain_l-(CT261) 

CT262 hypothetical protein 

CT263 hypothetical protein 

msbA- Transport ATP Binding Protein- (CT2 64) 

accA-AcCoA Carboxylase /Transferase Alpha- (CT265) 

CT266 hypothetical protein 

himD/ihfA-Integration Host Factor Alpha- (CT267) 

amiA-M-Acetylmuramoyl Alanine Ami das e- (CT268) 

murE-N-Acetylmuramoylalanylglutamyl DAP Ligase- (CT269) 

pbp3 - transglycolase/ transpeptidase- (CT270 ) 

CT271 hypothetical protein 

yabC-PBP2B Family .methylcransf erase- (CT272 ) 

CT273 hypothetical protein 

CT274 hypothetical protein 
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CPn0424 


469612 


470961 


F 


CPn042S 


470980 


471564 


r 


CPn0426 


472111 


471536 


R 


CPn0427 


472207 


473715 


r 


CPn0428 


473722 


474681 


F 


CPn0429 


474681 


475319 


F 


CPn043Q 


475326 


476093 


F 


CPn0431 


476483 


476151 


R 


CPn0432 


476616 


476514 


R 


CPn0433 


477273 


476929 


R 


CPn0434 


479462 


477276 


R 


CPn0435 


480902 


479475 


R 


CPn0436 


481618 


480902 


R 


CPB0437 


481816 


484350 


F 


CPC043B 


485416 


484334 


R 


CPn0439 


48S5S3 


486077 


F 


CPn0440 


486105 


486740 


F 


CF&0441 


486891 


487638 


F 


CPH0442 


488013 


488528 


F 


CPH0443 


486729 


489979 


F 


CPn0444 


490287 


494507 


F 


CPH0445 


494772 


497579 


F 


CPn0446 


497626 


500415 


F 


CPn0447 


500568 


503351 


F 


CPn0448 


504810 


503698 


R 


CPH0449 


507231 


505330 


R 


CPn04S0 


508112 


507180 


R 


CPH04S1 


508275 


511058 


F 


CPn0452 


511319 


512B60 


F 


CPn0453 


513234 


516152 


F 


CPn0454 


516182 


519115 


F 


CPn0455 


520348 


519458 


R 


CPn0456 


521532 


520327 


R 


CPn04S7 


523865 


522120 


R 


CPn0458 


526320 


524236 


R 


CPn0459 


527005 


526619 


R 


CPn0460 


527840 


526992 


R 

R* 


CPn0461 


528638 


527844 


CPn0462 


531052 


529037 


R 


CPn0463 


532357 


531191 


R 


CPn0464 


532842 


532366 


R 


CPn0465 


533212 


532871 


R 


CPn0466 


533724 


536537 


F 


CPn04£7 


S36633 


539434 


F 


CPn0468 


539632 


S40432 


F 


CPn0469 


540399 


541460 


F 


CPn0470 


541357 


542532 


F 


CPn0471 


542564 


545401 


F 


CPn0472 


547905 


545581 


R 


CPn0473 


549593 


548070 


R 


CPH0474 


551573 


549807 


R 


CPH0475 


553844 


551685 


' R 


CPn0476 


554844 


553858 


R 


CPn0477 


556106 


S54844 


R 


CPn047B 


557625 


55S210 


R 


CFH0479 


558425 


557616 


R 


CPn04B0 


559303 


S56650 


R 


CPU0481 


560946 


559339 


R 


CPn0482 


561737 


560961 


R 


CPn0483 


561836 


564964 


F 


CPH04B4 


564970 


565824 


F 


CPn04B5 


566038 


566229 


F 


CPn04B6 


567784 


566405 


R 


CPH0487 


569740 


568112 


R 


CPn0486 


570096 


569767 


R 


CPn0489 


570965 


570096 


R 


CPn0490 


571279 


573333 


F 


CPn0491 


574352 


573336 


R 


CPn0492 


574652 


574804 


F 


CPn0493 


575004 


574855 


R 


CPn04S4 


575364 


575146 


R 


C?n049S 


575603 


576793 


F 



dnaA_2 -Replication Initiation Factory- (CT27 5) 
CT276 hypothetical proteins 
CT277 similarity 

nqr2-NA0H (Ubiquinone) Dehydrogenase- (CT27B ) 
nqr3-NAfiH (Ubiquinone) Oxidoreductaee, Gamma- (CT279) 
nqr4-NADH (Ubiquinone) Reductase 4-JCT280) 
nqrS-NADH (Ubiquinone) Reductase 5-(CT281) 



gesH-Glyeine Cleavage System H Protein- (CT282) 
CT283 hypothetical protein 

Phospholipase 0 superfamily (uncleavable leader peptide] - (CT284) 
lplA-Lipoete Protein Ligase-Like Protein- (CT285) 
clpC-ClpC Protease- (CT28 6) 
ycbF-pp-loop superfamily ATPase- <CT2B7) 



CT007 hypothetical protein 
CT006 hypothetical protein 
CT005 hypothetical protein 

pmp.6- Polymorphic Outer Membrane Protein G/Z Family 

pmp.7- Polymorphic Outer Membrane Protein xs Family 

pnp-8- Polymorphic Outer Membrane Protein G Family 

pmp_9- Polymorphic Outer Membrane Protein G/Z Family 

•yxjGjBs_2 Hypothetical Protein 

pmp_10-PMP_10 ( Frame-shi ft with 0451) 

pmp_10- Polymorphic Outer Membrane Protein G Family 

pmp_ll- Polymorphic Outer Membrane Protein G Family 

pmp_12 -Polymorphic Outer Membrane Protein A/I Family (truncated) 

pmp_13 -Polymorphic Outer Membrane Protein G Family 

pmp_14 -Polymorphic Outer Membrane Protein H Family 



pmp_l 5 -Polymorphic Outer Membrane Protein E Family 

pmp.l 6 -Polymorphic Outer Membrane Protein E Family 

pmp_l 7 -Polymorphic Outer Membrane Protein E Family 

pmp_l 7 -Polymorphic Outer Membrane Protein (Frame- shift with 0469) 

pmp_l 7 -Polymorphic outer Membrane Protein (Frame-shift with 0470) 

pmp_l 8 -Polymorphic Outer Membrane Protein E/F Family 



CT365 hypothetical protein 
glgB-Glucan Branching Enzyme- (CTB66) 
CT665 hypothetical protein 
•yqeV_Bs Hypothetical Protein 
hflx-GTP Binding Protein- (CT37 9) 
phnP-Metal Dependent Hydrolase- (CT3 80) 
CT363 hypothetical protein 

artJ-Arginine Periplasmic Binding Protein- (CT381) 

aroC-Deoxyheptonate Aldolase- (CT3 82) 
CT362.1 hypothetical protein 
•hypothetical proline permease 
CT384 hypothetical protein 
hitA-KZT Family Hydrolase- (CT3 85) 
CT386 hypothetical protein 
CT387 hypothetical protein 
CT369 hypothetical protein 



aspC-Asparcace Aminotransf erase- (CT390) 
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CPH0496 


578793 


577812 


r 


C7391 hypothetical protein 


CPn0497 


578089 


577820 


R 


CT38B hypothetical protein 


CPn0498 


579035 


578085 


R 




CPn0499 


580359 


579205 


R 




CPnOSOO 


580639 


582362 


r 


pros- Prolyl tRNA Synthetase- (C7393) 


CPnOSOl 


582457 


583630 


F 


hrcA-KTH Transcriptional Repressor- (CT3 94 ) 


CPnOS02 


583650 


584201 


r 


grpE-HSP-70 Cof actor- (CT3J5) 


CPn0303 


584234 


586213 


r 


dneJC-HSP-70-<CT396) 


CPnOS04 


586487 


588514 


r 


vacB-ribonucleaee family- (CT397) 


CPn0305 


588319 


589106 


F 


M-methyiadenine OKA Glycosylate 


CPn0506 


589172 


589840 


r 


CT421 hypothetical protein 


CP&0S07 


589961 


590122 


? 


CT421.1 hypothetical protein 


CPnOSOB 


590142 


590300 


F 


CT421.2 hypothetical protein 


C7&0S09 


590335 


590808 


F 


(predicted Ketalloenryme) -(CT422) 


CPnOSlO 


590813 


591973 


F 


ClyCJ2-CBS Domains (Hemolysin homolog)_2- (CT423) 


CPnOSU 


592141 


592488 


F 


rsbV.l-Sigme Regulatory Factor.l- (CT424 ) 


c?nosi: 


592533 


594412 


F 


CT425 hypothetical protein 


CPnOSU 


594647 


593753 


F 


Fe-S oxidoreductase_l-tcr426) 


cpnOSl4 


593729 


S96S20 


F 


CT427 hypothetical protein 


CPn0315 


596492 


597181 


F 


ublE-Ubiquinone Methyl transfer aee- (CT42 8) 


CP&0S16 


598814 


597235 


R 




CP&0S17 


599631 


598795 


R 




CPnOSIS 


600803 


599832 


R 


CT429 hypothetical protein 


CF&0319 


601674 


600904 


R 


dapF-Diaminopimelate Bpimeraae- (CT430) 


CP&0520 


602216 


601646 


R 


clpP-CLP Protease- (CT431) 


CFB0321 


603797 


602241 


R 


glyA-Serine Hydroxymethyltransferase- (CT432) 


CPn0522 


603967 


604655 


F 


CT433 hypothetical protein 


CPB0523 


604723 


603052 


F 




CP&0S24 


603103 


606179 


F 




CPH0325 


606322 


607283 


F 


CT39B hypothetical protein 


CPn0526 


608696 


607710 


R 


yrhH-GucQ/KpsF Family Sugar- P Zsomerase- (CT399) 


CPnOS27 


609904 


608726 


R 


suc8_2 -Dihyorolipoamide Succiny 1 trans f eras e_2- (CT400) 


C9&0S28 


611162 


609921 


R 


gltT-Glutaaate Symport- (CT401) 


CPB0S29 


612239 


611165 


R 


ycaH-AT Pase- (C7402 ) 


CPH0S30 


613254 


612460 


R 


epoO_l-rRMA Methylase_l-(CT403) 


CfnOS31 


614069 


613245 


R 


SAM dependent methyl transferase- (CT404) 


CPB0332 


614674 


614075 


R 


ribC/risA-Ribof levin Synthase- (CT405) 


CPnOS33 


614930 


613385 


F' 


CT406 hypothetical protein 


CPD0334 


619413 


61S7B4 


F 


dksA-OnaX Suppressor- (CT407) 


CP&0535 


615793 


616296 


F 


lspA-Lipoprotein Signal Peptidase- (CT408) 


CPH0336 


616345 


617691 


F 


degA_l-D-Ale/Gly Per»eaae_l- (CT409) 


CP»0337 


617833 


618189 


F 


CT814.1 hypothetical protein 


C9ttOS38 


618212 


618511 


F 


CT814 hypothetical protein 


CPttOS39 


618705 


621545 


F 


pmp.19 -polymorphic outer membrane protein A Family -(CT412) 


CPB0340 


621694 


626862 


F 


pmp_20 -polymorphic outer membrane protein B Family- (CT4 13 ) 


CPaOS41 


627170 


62B003 


F 


Solute binding protein (-yebL-Synechocystis Adhesin Honolog) -(CT41S) 


CP&0S42 


628003 


628737 


F 


ABC Transporter ATPase- (CT416) 


CP&0S43 


628725 


629603 


F 


(Metal Transport Protein) -(CT417) 


CPB0544 


630529 


62952S 


R 


yhbl-CTP binding protein- (CT418) 


CPn054 5 


630884 


630633 


R 


rl27-L27 ribosomal protein- (CT419) 


CPnOS46 


631229 


630912 


R 


rl2l-L21 Ribosomal Protein- (CT420) 


CPnOS47 


631661 


632188 


• F 


ygbB family- (CT434) 


CPn0548 


633231 


632191 


R 


cysj-Sulfite Reductase- (CT43 5) 


CPnOS49 


633369 


• 633255 


R 


ralO-SlO Ribosomal Protein- (CT43 6) 


CPaOSSO 


633661 


633380 


R 


f usA- Elongation Factor C-(CT437) 


CPn0551 


636168 


633698 


R 


rs7-S7 Ribosomal Protein- (CT43B) 


CPnOSS2 


636387 


636219 


R 


rsl2-S12 Ribosomal Protein- (CT43 9) 


CPB0SS3 


637747 


636812 


R 




CPnOS34 


637834 


638141 


F 


CT440 hypothetical protein 


CPnOSSS 


638298 


640241 


F 


tsp-Tail-Specific Protease- (CT441) 


CPn0556 


640912 


640325 


R 


crpA-lSkDa Cysteine-Rich Protein- (CT442 ) 


CM>SS7 


642861 


641194 


R 


ome8-60kPa Cysteine-Rich Outer Membrane Complex Protein- (CT443 ) 


CPnOSSB 


643300 


643031 


R 


omcA-9kDa-cysteine-Rich Outer Membrane Complex Lipoprotein- (CT444) 


CPn0359 


643742 


643927 


F 


CT441.1 hypothetical protein 


C9&0360 


643612 


644098 


R 


gltX-Giutamyl-tRKA Synthetase- (CT445) 


CP&0361 


646404 


645871 


R 


euo-CKLPS Euo Protein- (CT44 6) 


CPtt0362 


648036 


646916 


R 


*CKLP5 43 kOa protein horaoloc_l 


CPnOS63 


630036 


646293 


R 


recJ-ssONA Exonucleese-(CT447) 


CPnOS64 


634330 


630145 


R 


secOasecF- Protein Export Proteins SecO/SeeF (fusion)- (CT448) 


CPn0565 


633630 


654333 


R 


CT449 hypothetical- protein 


CP&0S66 


636141 


656890 


F 


yaeS family- (CT4S0) 


CPO0567 


636894 


657817 


F 


cdsA-Phosphatidate cy tidy ly trans f erase- (CT4 31) 
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CP»0S68 


657817 


658464 


r 


CPX10S69 


6S8464 


659099 


F 


CFn0S70 


6S9107 


660789 


r 


CPn0S71 


662122 


660749 


R 


CPn0572 


662352 


664616 


F 


CPn0373 


663404 


664691 


R 


CPB0S74 


66S94S 


665394 


R 


CPB0S7S 


666494 


665983 


R 


CPn0576 


667343 


666494 


R 


CP&0S76 


667396 


667530 


R 


CPn0577 


667895 


668155 


p 


CPn0578 


668406 


669365 


p 


CPnOS79 


669361 


669993 


p 




669993 


670793 


p 


CPnOSBl 


671434 


670745 


R 


CPnOS82 


671303 


672177 


p 


CPnOS83 


672400 


672717 


p 


CPnOS84 


672707 


673798 


P 


CPnOSBS 


673817 


673865 


J; 


C?nOS86 


676026 


677183 


f 


C980367 


677441 


678124 


p 


CPnOSBS 


678084 


678626 


p 


CPn05B9 


678640 


679395 


} 


CPnO590 


680112 


679516 


f 


CPB0S91 


680373 


681020 


f 


CPH0S92 


681133 


661461 


} 


CP&0593 


682476 


681391 


J. 


CPnOS94 


682383 


684958 


1 


CP&0S9S 


684936 


685926 


f 


CPttOS9€ 


683939 


686457 


p 


CPnOS97 


688213 


686479 


R 


CPn059B 


689697 


688219 


R 


CPB0S99 


691602 


689682 


R 


CPn0600 


692147 


691827 


R 


CPn0601 


693033 


692736 


R 


CPB0602 


69410S 


693104 


R 


CPnO603 


694205 


695165 


p 


CPn0604 


69S94S 


695196 


R* 


CF&060S 


696707 


696150 


ft 


CPn0606 


697444 


696707 


R 


CP&0607 


698895 


697573 


R 


CPn0608 


699645 


699016 


R 


CP&0609 


699705 


699966 


P 


CPnO610 


701420 


700029 




CPn0611 


702025 


701420 


R 


C7B0612 


704631 


702022 


R 


CPn0613 


705656 


704658 


R 


CFn0614 


707402 


705783 


R 


CPD0615 


706137 


707634 


R 


CVB0616 


708791 


710137 


p 


C9B0617 


710484 


712316 


p 


C9&0616 


712306 


713010 ■ 


p 


CV&0619 


713444 


713013 


R 


CP&0820 


714139 


713519 


R 


C9&0821 


714647 


714144 


R 


CP&0622 


715752 


714793 


R 


CPn0623 


716993 


716163 


R 


CPn0424 


718015 


717011 


R 


CPn0625 


718485 


718060 


R 


CPn0626 


719616 


718495 


R 


CPn0627 


720038 


719640 


R 


CPn0628 


720428 


720063 


R 


CF&0629 


721657 


720487 


R 


CPn0630 


'.22316 


721885 


R 


CPB0631 


722806 


722312 


R 


CPU0632 


723195 


722827 


R 


CPB0633 


723757 


723209 


R 


CPB0634 


724165 


723787 


R 


CP&0635 


724745 


724206 


R 


CPn0636 


725082 


724750 


R 


CFU0C37 


725464 


723099 


R 


CPn063B 


725747 


725490 


R 



frame-ehift ) -(CT435 



cdsA-Phosphatidate cytidyly transferase- (CT4S2 ) 
pi sC -Glycerol -3 -P Acy 1 trans £ era ee- (CT453) 
argS-Arginyl tRNA Trans tinn- (CT4 54) 
murA-TOP-N-Acetylglucosamine Trtniftriit- (CT4SS) 
CT4S6 hypothetical pro tain 
yebC tamily-(CT4S7) 

YhhY -Amino Croup Acetyl Trans f«ra»e- (CT45B I 
prfB-Peptide Chain Ralaaaa Factor 2 ( natural UGA 
prfB- (natural UGA frame-ehift ) 
SW1B (YK74) complex protein- (CT460) 
yael-phosphohydrolaee- (CT461) 

ygbP/yecM- Sugar Nucleotide Phosphorylaaa- (CT462) 

truA-Pseudouridylate Synthaae I- (CT463) 

Phosphoglycolett Phosphatase- ( CT464 ) 

CT465 hypothetical protein 

CT466 hypo that leal protein 

a toS / ntrB-2 -Component Sensor- ( CT467 ) 

•similarity to Cpa XncA.2 

atoC/ntrC-2-Compontnt Regulator- (CT468) 

*yvyD_B* coneerved hypothetical protein 

CT469 hypothetical protein 

CT470 hypothetical protein 

CT471 hypothetical protein 

yagE family- (CT472) 

yldD family- (CT473) 

CT474 hypothetical protein 

pheT-phenylalanyl tRNA Synthetase Beta- (CT47S) 

CT476 hypothetical protein 

ada -methyl trans f erase* (CT477 ) 

oppC w 2 -Oligopeptide Permease.2- (CT478) 

oppfl_2 -Oligopeptide Permease^ - ( CT47 9 ) 

oppA.5 -oligopeptide Binding Lipoprotein's- (CT4 B0) 

CT483 hypothetical protein 
CT484 hypothetical protein 
henZ-Ferrocnetalase- (CT485) 
fliY-Clutamine Binding Protein- (CT48 6) 
yhnr-Methylase - (CT4B7) 
CT486 hypothetical protein 
glgC-Glucost-1-P Adenylcrans f erase- (CT489) 

•pyrF-Uridine 5 '-Monophosphate Synthase (Ump Synthaae) -truncated? 

CT490 hypothetical protein 

rho -Transcript ion Termination Factor- (CT4 91) 

yace-predicted phoaphatase/kinase- (CT492) 

polA-DNA Polymerase 2-(CT493) 

sohB- Protease- (CT494 ) 

adt_2-ADP/ATP Translocase w 2-(CT49S) 

pgsA_l -Glycerol -3 -P Phoaphati dyl trans f erased - (CT496) 

dnaB-Rcplicative dna Helicase- (CT497) 

gidA-FAX>- dependent oxidoreductaao- (CT498) 

lplA-Lipoate-Protein Ligaae A-CCT499) 

ndk- Nucleoside- 2 -P Kinase- (CTSOO) 

ruvA-Kolliday Junction Helicase- (CTS01) 

ruvC -Crossover Junction Endonuclease- (CTS02) 

CT503 hypothetical protein 

CT504 hypothetical protein 

gapA-Glyceraldehyde-3-P Oahyrogenase- (CT505) 

rll7-L17 Ribosomal Protein- (CT506) 

rpoA-RNA Polymerase Alpha- ICTS07) 

rail -si 1 Ribosomal Protein- (CT508 ) 

rsl3-S13 Ribosomal Protein- (CTS 09) 

secY-Translocase- (CTS10) 

rllS-Ll5 Ribosomal Protein- (CT511) 

rs5-s5 Ribosomal Protein- (CTS 12) 

rll8-L18 Ribosomal Protein- (CTS 13) 

rl6-L6 Ribosomal Protein- (CT514) 

r«6-S6 Ribosomal Protein- (CT515) 

rl5-L5 Ribosomal Protein- (CTS 16) 

rl24-L24 Ribosomal Protein- (CTS 17 ) 

rll4-U4 Ribosomal Pro te in- ( CTS 1 6 ) 

«17-S17 Riboaomal Protein- (CT319) 
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CPn0639 


725958 


725743 


R 


CPnO640 


728377 


725964 


R 


CPn064l 


727077 


726409 


n 


CPn0642 


727428 


727096 


R 


rpnOfi'43 

WrulluVJ 


727713 


727450 


R 


CPn0644 


728573 


727722 


R 


LlHUD< 3 


728930 




R 




72 QK 31 




R 


brnooa « 


710331 


73 Q4C7 


R 


CPn0648 


711 *n* 

7J10DJ 


71 AS n« 


R 


CPn0649 


7 9*07* 


71 i 7 i n 

7 J171Q 


R 


CFRQ030 


/ J J301 


/ J40Q9 


R 


CDwASCl 

l»rnU031 


» j jy ' 3 


711517 
' J J 31 / 


R 


CRUJ034 


-ji j one 


71*. DBA 
/ J J77O 


R 


/*DmAXC i 

LrTIUD J J 


1W i OA 


/ JtOOO 


R 




714047 


7 1 cent 

/ Jo 30 J 


R 






7171 AY 


R 


CPn06S6 


7 17 ft73 


71P rt J Q 
/ #OOOB 


F 


CPnOS57 


71A471 
/JO* f J 


71 anti 


R 


CPn06S8 


739168 


73ft45* 


R 


CPn0659 


* J 9? J J 


710A1 A 
' J -a Jo 


F 




740327 




R 




741100 


740337 


R 


CPn0662 


742923 


741172 


R 


WrUUBBJ 


714140. 


7470A1 


R 




744757 


» 4*337 


R 




745001 


74 (ice 
' * 0 JO 3 


F 


CPnOfififi 


746388 


7501 07 


F 


wriivoo / 


751058 


7 4A1 77 


R 


CPn0668 


751209 


7521 43 
' 94404 


F 


CPn0669 


752179 


75277* 
' 34 / /3 


F 




752765 


7531 04 
» 3 Jxyo 


F 


CFn0671 


753630 


753305 
1 3J4U3 


R 


CPn0672 


753741 


75504A 


F 


CPn0673 


755287 


755441 


F 


CPn0674 


756668 


755577 
» 939 / / 


R 


CPn067S 


757919 


75674R 
'30/ OO 


R 


CPn0676 


759217 


758051 


A 


CPn0677 


760401 


759256 


R 


CPn0678 


761320 


760662 


R 


CPn0679 


762930 


761725 


B 
R 


CPn06B0 


764248 


762971 


R 


CPn06BX 


764929 


764258 


R 


CPn0682 


764984 


765955 


p 


CP&0683 


765948 


766919 


p 


CPn0684 


766038 


767181 


R 


CPn068 5 


768068 


768217 




CPn0686 


768361 


768176 


R 


CPn0667 


768564 


769214 


p 


CPn0688 


769382 


770137 


p 


CPn0689 


771404 


770187 


R 


CPnO690 


772680 


771436 


R 


CPn0691 


773452 


772685 


R 


CFS10692 


774912 


773461 


R 


CPn0693 


776256 


775240 


R 


CPH0694 


779599 


776330 


R 


CPH0695 


780216 


781382 


F 


CPn0696 


781769 


782599 


F 


CPU0697 


782602 


783447 


F 


CF&069S 


783458 


784201 


F 


CPa0699 


784182 


784721 


F 


CPn0700 


785097 


785609 


F 


CPn0701 


7B5599 


786672 


F 


CPn0702 


789685 


786929 


R 


CPH0703 


791190 


789685 


R 


CP&0704 


792321 


791209 


R 


CPn0705 


793173 


792334 


R 


CPn0706 


793683 


793180 


R 


CPnO707 


793029 


793704 


R 


CPnO708 


795705 


795034 


R 


CPn0709 


796188 


795742 


R 


CPn0710 


796461 


796210 


R 



rl29-L29 Riboeomal Procein- ICTS20) 
rll6-L16 Riboeomal Procein- (CTS 21) 
rs3-S3 Riboeomal Protein* (CTS22 ) 
rl22-L22 Riboeomal procein- (CTS23 ) 
ra!9-S19 Riboaomal Procein- (C7524) 
r!2-L2 Riboaomal Procein- (CT525) 
r!23-L23 Riboeomal Procein* (CT526) 
rl4*L4 Riboaomal Procein- (CTS27 ) 
r!3*L3 Riboaomal Procein* ICT52 8 ) 
CT529 hypothetical procein 

fmt-Methionyl CRNA Formyltransf erase* (CT530) 
lpxA-Acyl -Carrier UDP-ClcNAc - (CTS31) 
fabZ-Myriscoyl-Acyl Carrier Dehydracaae- (CTS321 
lpxC-Myriscoyl GlcNac Deacetylase- (CT533) 
cuce-Xpolipoprocein N-Acotyltransf erase* (CTS34 ) 
vdlO/yciA-acyl-CoA Thioest erase* (CT535) 
dnaQ_2-DNA Pol ZZZ Epsilon Chairs 2- (CT536) 

yjeE tATPaae or Kinase) * (CT537) 
CT538 hypothetical procein 
crxA-Thioredoxin* (CTS39) 
apoU_2-rRNA Mechylaae_2-<CT540) 

mip-FKBP-typa pep t idyl -prolyl c is -trans ieomeraee- (CT541) 
aspS-Aspartyl CRNA Synthetase- (CT542 ) 
hisS-Kistidyl CRNA Synthetase* (CTS43) 

uhpC-Hexosphosphate Transport - (CT544) 
dna£-0NA Pol ZZZ Alpha- (CTS4S) 
predicted OMP (leader (17)-(CT546) 
CT547 hypothetical procein 
CTS48 hypochecical procein 

rabw-sigma regulatory factor-hiacidine kinaae- (CT549) 
CTSS0 hypochecical procein 

dacF(pbpS) -O-Ala-D-Ala Caroxypepcida.ee- (CT551I 
CT5S2 hypochecical procein 
fmu-RNA Mechylcrans f erase* (CT5 53) 
CT696 hypochecical procein 
homologous co CT695 



pgk-phoephoglycerace Kinaae- (CT693J 

ygo 4 -phosphate Permease- (CT692 ) 

CT691 hypochecical procein 

dppO-ABC ATPaae Oipepcide Transport- (CT690) 

dppF-ABC ATPaae Oipepcide Transport- <CT68 9 ) 

s poJ/parB -Chromosome Pare it ion ing Procein- (CT68 8) 



CT4B2 hypochecical procein 
CT4B1 hypochecical procein 

y«hO_l-Nif S-relaced Aminocrana f eraae.l - (CT687 J 
ABC Transporter Membrane Procein- (CT686) 
abeX-ABC Transporter ATPaae* (CT68 5) 
ABC Traneporter-<CT684) 

TPR Repeats (O-Linxed GlcNAc Transferase similarity ) - (CT683 J 

pbp2-PBP2-transglycolase/ transpeptidase- (CT682 ) 

ompA-Major Outer Membrane Protein* (CT681 ) 

ra2*S2 Riboaomal Protein- (CT680) 

tsf -Elongation Factor TS-(CT679) 

pyrH-OMP Kinase- (CT679) 

rrf-Rlbosome Releasing Factor- {CT677) 

CT676 hypochecical procein 

JcarG-Arginine Kinase- (CT67 5) 

yacC/gapD-Yop C/Gen Secrecion Procein D*(CT674) 
pkn5-S/T Procein Kinase- (CT67 3) 

fUN- Flagellar Motor Switch Oomain/YscQ f amily- (CT672 ) 

CT671 hypochecical procein 

CT670 hypothetical procein 

yscN-Yop N (Flagellar-Type ATPaae) - (CT669) 

CT668 hypochecical procein 

CT667 hypochetical procein 

CT666 hypochecical procein 
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CPa07il 


796731 


796486 


R 


CT66S hypothetical protein 


CPD0712 


799315 


796761 


R 


FHA domain; homology to adenylate cyclase >- (CT664 ) 


CPn0713 


799721 


799332 


R 


CT663 hypothetical protein 


CPn0714 


801107 


800091 


R 


htmA -Glutamyl tRNA Reductase- (CT6 62) 


CPn071S 


801657 


803462 


r 


gyrB_2-DWA Cyras e Subunit 8_2-(CT661) 


CPn0716 


603469 


804902 


F 


gyrA_2-DNA Cyrase Subunit A_2- (CT660) 


CPn0717 


80S010 


805306 


r 


CT656 hypothetical protein 


CPn0718 


805309 


805626 


F 


CT657 hypothetical protein 


CP&07X9 


805916 


806890 


F 


sfhB- ( Pseudouridine Synthase) - (C76S8 > 


CPn0720 


607003 


807236 


F 


CT659 hypothetical protein 


CP&0721 


607683 


808489 


F 


kdsA-KOO Synthetase- (CT6SS) 


CPn0722 


608489 


808974 


F 


CT654 hypothetical protein 


CPn0723 


608984 


809703 


F 


yhbO-ABC Transporter ATPase-(CT6S3) 


_CPn0724 


810527 


809706 


R 




CPn0725 


810811 


810587 


R 


CT6S2,1 hypothetical protein 


CPH0726 


813372 


610860 


R 


CT620 hypothetical protein 


CPn0727 


813577 


816192 


F 


CT619 hypothetical protein 


CPH0728 


818477 


816525 


R 


CHLPN 76kDa Komoloo.1 (CT622) 


CPn0729 


819857 


618592 


R 


CHLPN 76kOa Homology (CT623) 


CPn0730 


821603 


819963 


R 


mviN- integral Membrane Protein- (CT62 4 ) 


CPn0731 


621587 


821760 


r 




CP&0732 


822098 


822976 


F 


nfo-Endonucleaae XV-(CT625) 


CPn0733 


823727 


823101 


R 


rs4-S4 Ribosomal Protein- (CT626 ) 


CPH0734 


823944 


824915 


F 


yceA-(CT627) 


CPn073S 


825668 


825003 


R 


•pyrH/udJc-Uridine Kinase (Uridine Monophosphokinase) (Pyrimidlne 
Ribonucleoside Kinase) . 


CSH0736 


827686 


825992 


R 


ygeD-Efflux Protein- (CT641) 


CPn0737 


827685 


B30756 


F 


recC-Exodeoxyribonuclease V, Gamma- (CT640) 


CPn0738 


830746 


633895 


F 


recB-Exodeoxyribonuc lease V. Beta-(CT639) 


CPn0739 


834871 


833861 


R 


CT636 hypothetical protein 


CPnO740 


636048 


834864 


R 


tyrB- Aromatic AA Aminotransferase- (CT637) 


CPn0741 


838350 


636185 


R 


ore A- Transcript ion Elongation Factor- (CT63 6) 


CPn0742 


838463 


838888 


F 


CT63S hypothetical protein 


CPn0743 


838962 


640362 


F 


nqrA- Ubiquinone Ox i do reductase. Alpha- (CT634) 


CPn0744 


841384 


840389 


R 


hemB- Porphobilinogen Synthase- ( CT633 ) 


CPn0745 


841903 


841742 


R 




CPn0746 


841975 


843567 


F 


CT632 hypothetical protein 


CPn0747 


843675 


843740 


F* 


CT631 hypothetical protein 


CPH0747 


843725 


843910 


F 


CT631 hypothetical protein (frame -shift) 


CPn0746 


844987 


644121 


R 


ispA-Geranyl Trans transferase- (CT626) 


CPn0749 


845629 


B45006 


R 


glmU-UDP-GlcNAC Pyrophosphory lase- (CT629 ) 


CPn0750 


846411 


845707 


R 


tetO/cpxR-HTK Transcriptional Regulatory Protein ♦ Receiver Oamax 
(CT630) 


CPn07Sl 


846608 


848434 


F 


CT6S1 hypothetical protein 


CPn0752 


848604 


850082 


F 


recD_2-Exodeoxyribo nuclease V, Alpha_2- (CT652) 


CPn07S3 


651006 


850161 


R 




CPn0754 


851336 


851040 


R 


rs20-S20 Ribosomal Protein- (CT6 17) 


CPn07SS 


851597 


852799 


F 


CT616 hypothetical protein 


CPn07S6 


852961 


854676 


F 


rpoO-RNA Polymerase Sigma -6 6 -(CT615) 


CPH0757 


854733 


855134 


F 


folX-Oihydroneopterin Aldolase- (CT614) 


CPn07S6 


855110 


856459 


F 


folP/dhpS-Dihydropteroate Synthase- (CT613) 


CPn07S9 


856488 


856997 • 


F 


folA-Oihydrofolate Reductase- (CT612) 


CPn0760 


B569S7 


857694 


F 


CT611 hypothetical protein 


CPn0761 


857704 


858375 


F 


CT610 hypothetical protein 


CPn0762 


859597 


858539 


R 


recA-RccA recombination protein- (CT650) 


CPn0763 


860511 


859972 


R 


ygfA-Formyltetrahydrofolate Cycloligaee- (CT649) 


CPn0764 


661807 


860524 


R 


CT648 hypothetical protein 


V>ruU /OS 


OOZJOZ 


661601 


R 


CT647 hypothetical protein 


CPn0766 


863782 


862394 


R 


CT646 hypothetical protein 


CPn0767 


863864 


864177 


F 


CT645 hypothetical protein 


CPn0768 


864159 


865163 


F 


yohX/nir3 -predicted oxidoreductaie -(CT644) 


CPn0769 


867733 


865121 


R 


topA-DNA Topoisomerase I-Fused to SWI Domain- (CT643) 


CPnO770 


668340 


869131 


F 


CT642 hypothetical protein 


CPn077l 


870463 


869144 


R 


rpoN-RNA polymerase Sigma -54- (CT609) 


CPn0772 


672385 


870469 


R 


uvrO-DNA Helicase-(CT60B) 


CPB0773 


872488 


873195 


F 


ung-Uracil ONA Glycosy lase- (CT607 > 


CPn0774 


873195 


673425 


F 


CT606.1 hypothetical protein 


CPn0775 


874031 


873414 


R 


yggV family- (CT606) 


CPn0776 


874246 


875487 


F 


CT605 hypothetical protein 


CPn0777 


875601 


877178 


F 


groEL_2-heat shock protein-60 -(CT604) 


CPn0778 


877505 


878092 


F 


tsa/ahpC-Thio-apecific Antioxidant (TSA) Peroxidase- (CT603 ) 


CPn077 9 


878481 


878095 


R 


CT602 hypothetical protein 
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CFD0760 


87920S 


878591 


R 


papO/amiB-N-Acetylmuranoyl-L-Ala Amide««- (CT601 J 


CPn078i 


879773 


679196 


R 


pal- Peptldogly can- Associated Lipoprotein- (CT600) 


CPn07B2 


88106S 


879773 


R 


tolB •polysaccharide transporter- (CTS99) 


CPn078 3 


88188S 


681100 


R 


CT598 hypothetical protein 


CPn0784 


882296 


681892 


R 


exbD-Biopolymer Transport protein* (CTS97) 


CPn0785 


882991 


882296 


R 


exbB/tolO-poly saccharide transporter- (CTS96) 


CPn0786 


883185 


865293 


r 


dsbD/xprA-Thioi disulfide Interchange Procein- (CT595) 


CPn0787 


865619 


686401 


F 


yabD/ycf.M-PHP superfamily . ( ureas e/pyrimidinase) hydrolase- (CT5 94 ) 


CPn078B 


886542 


887432 


F 


sdhC -Succinate Dehydrogenase- (CTS 93) 


CPn0769 


867439 


889316 


F 


sdhA- Succinate Dehydrogenase- (CT592) 


CPnO790 


869330 


690103 


F 


sdhfl -Succinate Dehydrogenase- (CT591) 


CPn0791 


B93050 


890111 


R 


CTS90 hypothetical protein 


CF&0792 


894919 


893108 


R 


CTS89 hypothetical protein 


CP&0793 


896823 


894919 


R 


rbsU -sigma regulatory family protein-PP2C phosphatase (RsbW 
antagonist) - (CT5B8 ) 


CPn0794 


897174 


698004 


F 




CPn079S 


896128 


899195 


r 




CPn0796 


899301 


901340 


F 




CPO0797 


901600 


902694 


r 




CPH0798 


902846 


903856 


r 




CPn0799 


904966 


903940 


R 




CP&0800 


906532 


905249 


R 


eno-Bnolase-(CT587) 


CPnO801 


908697 


906727 


R 


uvrB - Ex inuc lease ABC Subunit B- (CTS86) 


CPn0802 


909740 


908709 


R 


crpS-Tryptophanyl tRNA Synthetase- (CTS8 5) 


CPn0803 


910303 


909752 


R 


CTS84 hypothetical protein 


CPnO804 


911059 


910310 


R 


gp6D-CKLTR Plasmid Paralog- (CTS83) 


CPn0805 


911831 


911067 


R 


minD-chromosome partitioning ATPase-CHI.TR plasmid protein CP5D-(CT582 


CPn0806 


913771 


911867 


R 


thrS-Threonyl tRNA Synthetase- (CT561) 


CP&0807 


913971 


914879 


F 


CTS80 hypothetical procein 


CP&0808 


916287 


914956 


R 


CTS79 hypothetical protein 


CPn0809 


917785 


916307 


R 


CTS76 hypothetical protein 


CFH0810 


918164 


917825 


R 


CTS77 hypothetical procein 


CF&0811 


916900 


918206 


R 


icrK^l-Lov Ca Response Procein H_1-(CTS76) 


CPn0B12 


919123 


920862 


F 


mutL-DNA Mismatch Repair- (CTS7 5) 


CPnOB13 


920870 


921934 


F 


pepP-Aminopeptidase P-(CT574) 


CPn0814 


922107 


923357 


F 


CTS73 hypothetical procein 


CP&0815 


923361 


925622 


F 


gspD/pilQ-Gen. Secretion Protein D-(CT572) 


CPB0816 


925615 


927102 


F* 


gspE-Cen. Secretion Protein E-(CT57l) 


CPn0B17 


927115 


926267 


F 


gspF-Gen. Secretion Protein F-(CT570) 


CFttOBIS 


928314 


926682 


F 


predicted GHP [leader (16) peptide) -(CT5 69) 


C7U0B19 


928669 


929132 


F 


CT568 hypothetical protein 


CPn0820 


929120 


929659 


F 


CTS67 hypothetical protein 


CPn0821 


929667 


930668 


F 


CT566 hypothetical protein 


CPn0822 


930756 


931229 


F 


CT56S hypothetical protein 


CPn0823 


932367 


931501 


R 


yscT/spaR-YopT Tranlocation T- (CT564) 


CPn0824 


932662 


932378 


R 


yscS/fliO-YopS/fliQ Translocation Protein- (CT563) 


CPn0825 


933594 


932677 


R 


yscR-Yop Translocation R-(CTS62) 


CPn0826 


934310 


933612 


R 


yseL-Yop Translocation L-(CT561> 


CPU0827 


935264 


934434 


R 


CT560 hypothetical procein 


CPn0826 


936271 


935267 


R 


yscJ-Yop Translocation J-(CTS59) 


CPnOB29 


936744 


937298 


r 




CPn0830 


937444 


937959 • 


F 




CPn0831 


938267 


938434 


F 




CFn0832 


939747 


938827 


R 


lipX-Lipoate Synthetase- (CT5 56) 


CPn0833 


941129 


939747 


R 


lpdA-Lipoamide Dehydrogenase- (CT557) 


CPn0834 


941553 


942014 


F 


CTS56 hypothetical protein 


CPn0835 


945689 


942045 


R 


motl_l«SWI/SNF family helicase_l- (CT555) 


CPnOB36 


946879 


945722 


R 


brnO-Amino Acid (Branched) Transport- (CT55 4) 


CPn0837 


947771 


947145 


R 


nth- Enodnucl ease IIZ-(CT697) 


CPn083B 


949106 


947781 


R 


thdF-Thlophene/Furan Oxidation Protein- (CT6 98) 


CPn0839 


949257 


950159 


F 


psdD-Phosphatidylserine Decarboxylase- (CT699 ) 


CPn0840 


950222 


951544 


F 


CT700 hypothetical protein 


CPR0841 


951731 


954640 


F 


secA_2-Translocase SecA_2-(CT701) 


CPn0842 


954863 


954710 


R 


CT702 hypothetical protein (frame-shift with 084 3) 


CPn0843 


955191 


954994 


R 


CT702 hypothetical protein 


CPn0844 


956730 


955270 


R 


yphc-GTPase/GTP-binding protein- (CT703) 


CPn08 4 5 


958079 


956650 


R 


pcnB_l-Poly A Polymerase.!- (CT704) 


CPnOB4 6 


959374 


956112 


R 


clpX-CLP Protease ATPase-(CT705) 


CPn0647 


959995 


959387 


R 


clpp-CLP Protease Subunit-(CT706) 


CPa084B 


961502 


960177 


R 


tig/murl -Trigger Faccor-pepcidyl -prolyl isomerase- (CT707 ) 


CPa0849 


961788 


965265 


F 


motl.2-SWI/SNF family heiicsst_2- (CT706 ) 


CPnCB50 


965293 


966390 


F 


nce8-Rod Shape Protein-Sugar Kinase- (CT709) 
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CPn0851 


966396 


96S195 


f 


CPn0852 


966316 


970613 


F 


. CPn0853 


970637 


971803 


F 


CPn0854 


972837 


971806 


R 


CPnOSSS 


973995 


972994 


R 


CPn085 6 


975377 


973995 


R 


CPn0857 


975757 


975392 


ft 


CPnOBSB 


977055 


975757 


ft 


CPn08S9 


977588 


977055 


n 


CPn0860 


978630 


977608 


ft 


CPn0861 


979722 


978925 


ft 


CPn0862 


980873 


979722 


ft 


CPn0863 


981514 


980831 


ft 


CPn0864 


981670 


982374 


p 


CPn0865 


982418 


982942 


p 


CPnOB66 


983491 


982916 


ft 


CPnOB67 


983423 


984667 


p 


CPnOB68 


986643 


984670 


ft 


CPn0869 


987401 


9B665B 


} 


CPn0870 


96B728 


987448 




CPn0871 


988772 


989899 


f- 


CPn0872 


969963 


991216 


p 


CPn0873 


991233 


991694 


p 


CPn0874 


993107 


991749 


p 


CPn087 5 


993372 


994022 


f 


CPnOB76 


994144 


995517 


1 


CPn0877 


995533 


995982 


f 


CPn0878 


996654 


995992 


p 


CPn0879 


997439 


996645 


ft 


CPn0880 


999861 


997444 


ft 


CPnOBBl 


1005667 


1006209 


p 


CPn0882 


1006266 


1007404 


p 


CPn0883 


100B86S 


1007573 


ft 


CPn0884 


1009359 


1009009 


ft 


CPn0885 


1010635 


1009433 


ft 


cpnoaee 


1011276 


1010908 


ft 


CPn0887 


1011692 


1014157 


p 


CPn0888 


1015423 


1014119 


R* 


CPnOB89 


1016835 


1015462 


ft 


CPn0890 


1017B0S 


1016819 


ft 


CPn0891 


1021073 


1017819 


ft 


CPn0892 


1023661 


1021046 


R 


CPn0893 


1023894 


1025888 


F 


CPn0894 


1026766 


1025886 


R 


CPn089S 


1026988 


1027557 


F 


CPH0896 


1027595 


1027622 


F 


CPnOB97 


1028737 


1027853 


R 


CPn0898 


1030460 


1028904 


R 


CPn0899 


103087 5 


1032215 


F 


CPn0900 


1032235 


1033261 


F 


CPU0901 


1033287 


1034537 


F 


CPn0902 


1034543 


1035241 • 


F 


CPn0903 


1035263 


1036417 


F 


CPn0904 


1036326 


1037396 


F 


CPnOBOS 


1037409 


1039835 


F 


CPn0906 


1040340 


1039915 


R 


CPn0907 


1040780 


1040445 


R 


CPn0908 


1041589 


1040780 


R 


CPn0909 


1041637 


1041966 


r 


CP&0910 


1041979 


1043004 


P 


CPH0911 


1044043 


1042985 


R 


CPn0912 


1044129 


1045760 


F 


CPn0913 


:045760 


1045945 


F 


CPn09l4 


1045999 


1046397 


r 


CPn0915 


1046461 


1046817 


F 


CPn0916 


1046837 


1048084 


F 


CPn09l7 


1048090 


1048539 


F 


CPn091B 


1049223 


1048579 


R 


CPn09l9 


1049378 


1050430 


F 


CPn0920 


1051405 


1050431 


R 


CPn092l 


1C51535 


1052293 


F 



pckA-Phosphoenolpyruvace Cartooxykinaae- (CT710) 

CT711 hypothetical protein 

CT712 hypothetical protein 

ompB-Oucer Membrane Protein B-(CT713) 

gpdA-Giycerol-3-P Dehydrogenase- (CT714) 

AgX-1 Homolog-UDP-Glucose Pyrophosphorylase- (CT715) 

CT716 hypothetical protein 

fliX-Flagellum-apecilic ATP Synthase- (CT7 17) 
CT718 hypothetical protein 

fliF-Flagellar M-Ring Protein- (CT7 19 ) 

nifU-Ni£U-related protein- (CT720) 

yfhO_2-NifS-related protein_2- (CT721J 

pgmA-Phoaphogly cerate Mutase- (CT722) 

yjbC -predicted pseudouridine synthase- (CT72 3) 

CT724 hypothetical procein 

birA-Biotin Synthetase- (CT72 5) 

rodA-Rod Shape Protein- (CT726J 

zntA/cadA-Metal Transport P-type AT Pas e- (CT727) 

CT728 hypothetical protein 

serS-Seryl tRNA Synthetase^- (CT72 9) 

ribD-Ribo flavin Deaminase- (CT730) 

ribAtribB-CTP Cyclohydrataee a DHBP Synthase -(CT731) 

ribE-Ribityllumazine Synthase - (CT732 ) 

CT733 hypothetical protein 

CT734 hypothetical protein 

dagA_2-0- Alanine/Glycine Permaase.2- (CT735) 

ybcL family- (CT73 6) 

SET Domain protein- (CT737) 

yycJ -metal dependent hydrolase- (CT73 8 1 

ftsK-Cell Blvision Protein FtsK-(CT739) 



dmpP/nqr6-Phenolhydrolase/HADH ubiquinone oxi do reductase- (CT740) 

CT741 hypothetical protein 

ygcA-rRNA Methyltransferse-(CT742) 

hctA-Kis tone- Like Developmental Protein- (CT743 > 

CHLTR possible phoiphoprotein- (CT744) 

hemG-protoporphyrinogen Oxidase- ICT745) 

hemN_2-Coproporphyrinogen III Oxidase_2- (CT746) 

hemE-Uroporphyrinogen Decarboxylase- (CT7 47) 

mfd-Transcription-Repair Coupling- (CT748) 

alaS-Alanyl tRNA Synthetase- (CT74 9 ) 

tktB -Trans ketolase- (CT750) 

awn -AMP Nucleosidase- (CT7 51) 

efp_2- Elongation Factor P_2- (CT752 ) 
CT753 hypothetical protein 

(possible phosphohydrolase) - (CT754 ) 
Mitochondrial HSP60 Chaperonin Homo log- (CT755) 
murF-Muramoyl-DAP Ligase- (CT756) 
mraT-Muramoyl-Pentapeptide Transferase- (CT757) 
murD-Muramoylalanine-Glutamate Ligase- (CT758) 
nlpD-Muramidase (invasin repeat family )- (CT759) 
ftsW-Cell Division Protein FtsW-{CT760) 
murG-Peptidoglycan Transferase- (CT761) 

murC&ddlA-Muramate-Ala Ligase a D-Ala-D-Alam Ligase- (CT762) 
CT763 hypothetical protein 

•cutA Periplasmic Divalent Cation Tolerance Protein CutA (C-Type 

Cytochrome Biogenesis Protein) 
CT764 hypothetical protein 
rsbV_2-Sigma Factor Regulator.2- (CT765) 
mi aA- tRNA Pyrophosphate Transf erase- (CT766) 
Fe-S cluster oxidoreduc taae.2 - <CT767) 
CT768 hypothetical protein 



ybeB-iojap superfamily ortholog- (CT769) 
fabF-Acyl Carrier Procein Synthase -<CT770) 
hydrolase /phosphatase homolog- (CT771) 
ppa- Inorganic Pyrophosphatase- (CT772) 
ldh-Leucine Dehydrogenase- (CT77 3) 

cysO-SuUire Synthesis /biphosphate phosphatase- (CT774) 
snGlyceroi-3-P Acyl transferase- (CT775) 
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CPn0922 


1052314 


1053927 


F 


aas-Acylglycerophosphoethanolamine Acy.'tranaf erase- (CT776) 


CPU0923 


10S39B4 


1055093 


r 


bioF_l -Oxononanoate Synthase.: - (CT777 J 


CPn0924 


1057274 


1055026 


R 


priA-primosomel Protein N* -(CT778) 


CPn0925 


1057900 


1057226 


R 


CT779 hypothetical protein 


CPn0926 


1058060 


1058557 


F 


Thioredoxin Disulfide Isomerese- (CT7B0) 


CPn0927 


1059809 


1058670 


R 


•CHLPS 43 fcDa protein homolog_2 


CPn0928 


1061008 


1059884 


R 


•CHLPS 43 kOa protein homolog_3 


CPn0929 


1062292 


1061186 


R 


•CHLPS 43 kDa protein homology 


CPn0930 


1062857 


1063330 


F 




CPn0931 


1064138 


1065718 


F 


lysS-Lyiyl tRHA synthetase- (CT781) 


CPn0932 


1067142 


1065721 


R 


cysS-Cyscainyl CRN A Synthetase- (CT782 J 


CPH093 3 


1067535 


1068576 


F 


predicted disulfide bond isome,rase-(CT783) 


CPn0934 


1068942 


1068526 


R 


rnpA-Ribonuc lease P Protein Component - < CT784 ) 


CPn0935 


1069091 


1068957 


R 


rl34-L34 Rlbosoraal Protein- (CT7 65) 


CPn0936 


1069336 


1069470 


F 


rl36-L36 Ribosomal protein- (CT786) 


CPn0937 


.1069496 


1069798 


F 


rsl4-514 Ribosomal Protein- (CT7 87) 


CPn0938 


1070322 


1069849 


R 


CT78B hypothetical protein - [leader (60) peptide-periplasmicj 


CPn0939 


1070728 


1071195 


F 


CT790 hypothetical protein 


CPn0940 


1073012 


1071204 


R 


uvrC-Excinuclease ABC* Subunit C-(CT791) 


CPn094X 


1075501 


1073016 


R 


mutS-DNA Mismatch Repair- (CT7 92) 


CPn0942 


1075985 


1077754 


F 


dnaG/priM-DNA Primase-(CT794> 


CPn0943 


1077978 


1078238 


F 


CT794.1 hypothetical protein 


CPn0944 


1078512 


1078997 


F 




CPn0945 


1079070 


1079660 


F 


CT79S hypothetical protein 


CPn0946 


1082786 


1079745 


R 


glyQ-Glycyl tRNA Synthetase- (CT7 96) 


CPn094 7 


1083442 


10B4059 


F 


pgsA_2 -Glycerol -3 -P-Phosphatydyltransf eras e_2- (CT797 ) 


CPn094B 


1085474 


1084047 


R 


glgA -Glycogen Synthase- (CT79 8) 


CPn0949 


1085929 


1086483 


F 


ctc-Ceneral Stress Protein- (CT7 99) 


CPn0950 


1086488 


1087027 


F 


pth-Peptidyl tRNA Hydrolase- (CT800) 


CPn09Sl 


1087122 


1087457 


F 


rs6-S6 Ribosomal Protein- (CT801) 


CPn0952 


1087478 


1087723 


F 


rsia-Sie Ribosomal protein- (CTB02 ) 


CPn0953 


1087742 


1066248 


F 


rl9-L9 Ribosomal Protein- (CT803 ) 


CPn09S4 


1088286 


1068708 


F 


ychB-Predicted Kinase- (CT804 ) 


CPn0955 


1086612 


1089175 


F 


(frame-shift with 0954) 


CPn0956 


1069560 


1090909 


F 


CT8 05 hypothetical protein 


CPn0957 


1093788 


1090963 


R 


ide/ptr-Insuiinase family/Protease III-(CTB06) 


CPn0958 


1094785 


1093793 


R 


plsB-Glycerol-3-P Acyltransf erase- (CT807 ) 


CPn0959 


1096343 


1094799 


R 


cafE-Axial Filament Protein- (CT80 8) 


CPnO960 


1096764 


1097102 


F 


CT809 hypothetical protein 


CPn0961 


1097118 


1097297 


F 


rl32-L32 Ribosomal Protein- (CT8 10) 


CPn0962 


1097316 


1098275 


F 


plsX-FA/ Phospholipid Synthesis Protein- (CTB11) 


CPn0963 


1098398 


1103224 


F 


pmp_21- Polymorphic Outer Membrane Protein D Family- (CT812) 


CSH0964 


1104758 


1103301 


R 




CPn0965 


1106736 


1104925 


R 


lpxB-Lipid A Oisaccharide Synthase- (CT411) 


CPn0966 


1106037 


1106748 


R 


pcnB_2-PoiyA Polymerase_2- (CT410) 


CPn0967 


1108512 


1109685 


F 


mrsA/pgm-Phosphoglucomutase- (CT815 ) 


CPn0968 


1109695 


1111721 


F 


glmS-Glucosamine-Fruccose-6-P Aminotransferase- (CT81 6) 


CPn0969 


1111812 


1112999 


F 


0969-cyrP.l -Tyrosine Transports- (CT817) tyrP_l -Tyrosine Transport_l- 
(CT817) 


CPn0970 


1113461 


1114648 


F 


0970-tyrP_2 -Tyrosine Transport^- (CT8 18) tyr P. 2 -Tyrosine Tranaport_2- 
(CT81B) 


CP&0971 


1114702 


1115415 ' 


F 


yccA-Transport Permease- (CT819 ) 


CPn0972 


1116299 


1115430 




ftsY-Cell 01 vision Protein FtsY-<CT820) 


CPn0973 


1116370 


1117527 


F 


sucC-Succinyl-CoA Synthetase. Beta-(CT821) 


CPn0974 


1117544 


1118422 


f 


sucD-Succinyl-CoA Synthetase. Alpha- (CT822) 


CPn097S 


1119104 


1119637 


F 




CPn0976 


1120082 


1121185 


F 




CPn0977 


1121371 


1122402 






CPn0978 


1122665 


1123693 






CPn0979 


1123980 


1125443 




htrA-00 Serine Protease- (CT82 3) 


CPn0980 


1126982 


1125504 




•similarity to Saccharomyces serevisiae hypothetical 52.9KD protein 


CPn0981 


1127031 


1129952 




Zinc Metalloprotease (insulinase family) - (CTB24 ) 


CPn09B2 


1131194 


1129962 




yigN family- (CTB25 ) 


CPn0983 


1132000 


1131206 




pssA-Glycerol-Serine Phosphatidyl transferase- (CT626) 


CPn0984 


1132379 


1135510 




nrdA-Ribonucleoside Reductase. Large Chain- (CT827) 


CPn0985 


1135534 


1136571 




nrdB-Ribonucleoside Reductase. Small Chain- (CT82B) 


CPn0986 


1136724 


1137395 




yggH-predicted rRKA Methylase- (CTB291 


CPn0937 


1137516 


1138115 




ytgB-like predicted rRNA methylase- (CT8 30) 


CPH0988 


1136986 


113B07S 




murB-UDP-N- Ace tylenolpyruvoylglucos amine Reductase- (CT831I 


CPn0989 


1139495 


1139016 




CT832 hypothetical protein 


CPn0990 


1139683 


1140440 




infC-initiation Factor 3-(CT833) 


CPn0991 


1140421 


1140612 




rl35-L35 Ribosomal Protein- (CT834) 
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CPn099: 


1140634 


1140996 


r 


rl20-L20 Ribosomal Protein- (CT835 ) 


CPHP993 


-1141014 


1142030 


p 


pheS- Phenyl alanyl CRN A Synthetase. Alpha- (CTB3 6) 


CPn0994 


1142398 


1144440 


F 


CT837 hypothetical protein 


CPn099 5 


1145512 


1144415 


R 


CT838 hypothetical protein 


CPO0996 


1146589 


1145519 


R 


CT839 hypothetical protein 


CPn0997 


1146706 


1147664 


P 


mesJ-pp-loop super family ATPase- (CT840) 


CPn0998 


1147855 


1150584 


P 


ftsH- ATP -dependent zinc protease- (CT841) 


CPn0999 


1152847 


1150766 


R 


pnp- Polyribonucleotide Nucleotidyl transferase- (CT642I 


CPnlOOO 


1153157 


1152891 


R 


rslS-SIS Ribosomal Protein- (CT843) 


CPnlOOl 


1153405 


1153869 


P 


yfhC-cytosine deaminase- (CT844) 


CPnl002 


1153862 


1154089 


F 


CT845 hypothetical protein 


CPnl003 


1154796 


1154092 


R 


CT846 hypothetical protein 


CPnl004 


1155397 


1154879 


R 


CTB47 hypothetical protein 


C£nlO05 


1155933 


1155415 


R 


CT848 hypothetical protein 


CPnl006 


1156472 


1155990 


R 


CT849 hypothetical protein 


CPnlOO? 


1156689 


1156907 


F 


CT849.1 hypothetical protein 


CPnlOOB 


1156928 


1158223 


F 


CT850 hypothetical protein 


CPM009 


1159058 


1158186 


R 


map-Methionine Aminopeptidase- (CTBS1) 


CPnlOlO 


1159672 


1159067 


R 


CTBS2 hypothetical protein 


CPnlOll 


1160306 


1159902 


R 


CTB53 hypothetical protein 


CPnl012 


1162193 


1160421 


R 


yxeB-ABC transporter permease- (CT8 541 


CPnl013 


1162245 


. 1163624 


F 


fumC-Fum&rate Hydratase- (CT855). 


CPnl014 


1165426 


1163732 


R 


ychM-Sulfete Transporter- CCT85 61 


CPnl015 


1165634 


1166893 


P 


CT857 hypothetical protein (possible ZM protein) 


CPnlOl6 


1167042 


1168898 


F 


CT85B hypothetical protein 


CPnl017 


1169006 


1169935 


F 


lytB-Metalloprotease- (CT8S9) 


CP&1018 


1169898 


1170629 


F 




CPnl019 


1172128 


1170638 


R 


CT860 hypothetical protein 


CPnlO20 


1173679 


1172150 


R 


CT861 hypothetical protein 


CPnl021 


1174213 


1173698 


R 


lcrH_2-Low Calcium Response_2- tCT862) 


CPnl022 


1175673 


1174216 


R 


CT863 hypothetical protein 


C?nl023 


1176035 


1176331 


F 




CPnl024 


1177236 


1176334 


R 


xerD-Xntegrasa/recombinase- (CT864 ) 


CPnl025 


1177302 


1178879 


r 


pgi-Glucose-6-F Isomerase- (CT37B) 


CPnl026 


1178997 


1179137 


F 


ltuA-(CT377) 


CPnl027 


1179175 


1180755 


F 




CPnl028 


1181016 


1181999 


F 


mdhC-Halate Dehyrogenase- (CT376) 


CPnl029 


1182008 


1182844 


P* 




CPnl030 


1183886 


1182843 


R 


predicted D- amino acid dehyrogenase- (CT375) 


CPnlOSI 


1185552 


1184098 


R 


arcD-Arginine /ornithine Antiporter- (CT374 ) 


CPH1032 


1186150 


1185566 


R 


CT373 hypothetical protein 


CPnl033 


1187500 


1186187 


R 


CT372 hypothetical protein 


CPnl034 


1188517 


1187732 


R 


Predicted 0KP_1 (CT371) [laader (IB) peptide) 


CPnl035 


1190000 


1188570 


R 


AroE-Shikimate 5 -Dehyrogenase- (CT3 70) 


CPnl036 


119113S 


1189984 


R 


AroB-Dehyroquinate Synthase- (CT369) 


CPnl037 


1192199 


1191123 


R 


AroC-Chorismate Synthase- (CT3 68) 


CPnl038 


1192726 


1192199 


R 


aroL-Shikimate Kinase IZ-(CT367) 


CPnl039 


1193999 


1192665 


R 


aroA-Phosphoshikiaate Vinyl transferase- (CT3 66 ) 


CPnl040 


1194741 


1194073 


R 




CPnl041 


1195994 


1194726 


R 


# bioA*AdenosylAethionine-B-Amino-7-Oxononanoate Aminotransferase 


CP&IQ42 


J.196590 


1195934 


R 


*bioO-dethiobiotin synthetase 


CPnl043 


1197717 


1196572 • 


R 


bioF w 2-Oxononanoate Synthase_2 


CP&1044 


1198691 


1197699 


R 


•bioB-Biotin Synthase 


CPnl045 


1199590 


1198901 


R 


'conserved hypothetical bacterial membrane protein 


CPnl046 


1200675 


1199590 


R 


•Tryptophan Hyroxylase 


CPnl047 


1200552 


1201343 


F 


dapB-Dihydrodipicolinate Reductase- (CT364) 


{.rxixus o 


1201606 


1202604 


F 


asd- Aspartate Dehydrogenase- (CT3 63 ) 


CPnl049 


1202S9S 


1203914 




lysC-Aspartokinase III- (CT3 62 ) 


CPnlOSO 


1203926 


1204796 




dapA-Dihydrodipicolinate Synthase- (CT3 61) 


CPnlOSl 


1204962 


1205270 






CPnl052 


1205417 


1206169 






CPnlOS3 


1206153 


1206701 






CPnlOS4 


1207034 


1209466 






CPalOSS 


1209694 


1210521 






CPnlOS6 


1210527 


1211228 






CPnl057 


1211497 


1213596 




CT356 hypothetical protein 


CPnlOSB 


1213748 


1214836 




CT355 hypothetical protein 


CPnlOS9 


1214848 


1215678 




kgsA -Dimethyl adenosine Trans f erase - (CT3 5 4 ) 


CPnl060 


1217658 


1215727 


R 


dxs/ tkt -Trans tee tolase- (CT33 1 ) 


CPnl06I 


1217920 


1217666 


R 


CT330 hypothetical protein 


CPnl062 


1219820 


121B1S9 


R 


xseA-Exodoxyribonuclease Vli-(CT329) 


CPnl063 


1219951 


1220712 


r 


tpiS-Triostphosphate Isomerase- (CT328) 
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CPnl064 


1220719 


1220895 


r 


CPnl06S 


1221095 


1220928 


R 


CPnl066 


1221135 


1221488 


r 


CPal067 


1221735 


1222292 


r 


CPnl066 


1223256 


1222365 


R 


CPnl069 


1223513 


1223941 


r 


CPnl070 


1225511 


1224144 


R 


CPU1071 


1227324 


1225885 


R 


CPnl072 


1227969 


1228B3S 


F 


CPnl073 


1229011 


1229832 


F 



del -Polypeptide D«formyl*»e- (CT353 ) 
rnha_2-Ribonucleaee HZI.2- (CT008) 
yfoA-HTH Transcriptional Regulator- (CT009) 



Predicted 0MP_2 -CCT371) 
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Tabic 2 (Supplemental Data) Functional Assignments of C pneumoniae Coding Sequences. C trachomatis genes are shown in 
parentheses. 



Amino Acid Biosynthesis 



Aromatic Family 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1039 


(CT366) 


sroA 


Phosphoshikimate Vinyltransferase 


1036 


(CT369) 


iroB 


Dehyroquinate Synthase 


1037 


(CT368) 


aroC 


Chorismate Synthase 


I03S 


(CT370) 


iroE 


Shikimate 5-Dehyrogenase 


0484 


(CT382) 


aroG 


Deoxyheptonate Aldolase 


1038 


(CT367) 


aroL 


Shikimate Kinase II 


0740 


(CT637) 


tyrB 


Aromatic AA Aminotransferase 


Aspartate family (lysine) 




1048 


(CT363) 


asd 


Aspartate Dehydrogenase 


1050 


(CT361) 


dapA 


Dihydrodipicolinaie Synthase 


1047 


(CT364) 


dapB 


Dihyirodipieolmate Reductase 


0519 


(CT430) 


dapF 


Dian tnoptmelate Epimerase 


1049 


(CT362) 


lysC 


As pa to kinase III 


Serine Family 






0433 


(CT282) 


gcsH 


Ciyc ne Cleavage System H Protein 


0521 


(CT432) 


giyA 


Serine Hydroxymethyl transferase 



Base A Nucleotide Metabolism 



0171 




guaA 


GMP Synthase 


0172 




guaB 


Inosine 5'*Monophosphase Dehydrogenase 


0608 






Uridine ^-Monophosphate Synthase 


0735 






Uridine Kinase 


0244 


(CT128) 


adk 


Adenylate Kinase 


0894 


(CT751) 


amn 


AMP Nucleosidase 


0568 


(CT452) 


cmk 


CMP Kinase 


0392 


(CT039) 


ded 


dCTP Deaminase 


0059 


(CT292) 


dm 


dUTP Nucleoodohydrolase 


0120 


(CT030) 


gmk 


GMP Kinase 


0619 


(CT500) 


ndk 


. Nucleoside-2-P Kinase 


0984 


(CT827) 


nrdA 


Ribonucleoside Reductase. Large Chain 


0985 


(CT828) 


nrdB 


Ribonucleoside Reducone. Small Chain 


0236 


{CTI83) 


pyrG 


CTP Synthetase 


0698 


(CT678) 


pyrH 


UMP Kinase 


0273 


(CT188) 


tdk 


Thymidylate Kinase 


0659 


(CT539) 


trxA 


Thioredoxin 


0314 


(CT099) 


trxB 


Thioredoxin Reductase 


1001 


(CT844) 


yfhC 


Cy cosine Deaminase 








Biosynthesis of Cofactori 


iotin. Lipoate A Ubiquinone 




1041 




btoA 


Adcnosyimemioninc-8-Amiiu>-7-Oxononanoate Aminotransferase 


1044 




bioB 


B iotin Synthase 


1042 




bioO 


Dethiobiotin Synthetase 


0923 


(CH77) 


bioFJ 


Oxononanoate Synthase 1 


1043 


(CT777) 


btoF.2 


Oxononanoate Synthase_2 


0866 


(CT725) 


birA 


Biorin Synthetase 


0748 


(CT628) 


ispA 


Geranyl Transoansferase 


0832 


(CT558) 


lipA 


Lipoate Synthetase 
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10 



15 



20 



25 



0265 (CT219) 

0264 (CT220) 

0513 (CT428) 
Folic Acid 

0759 (CT613) 

0335 (CT078) 

0758 (CT613) 

0757 (CT6I4) 

0763 (CT649) 



0714 (CT662) 

0744 (CT633) 

0052 (CT299) 

0890 (CT747) 

0888 (CT745) 
0138 (CT210) 
0380 (CT052) 

0889 (CT746) 
0603 (CT485) 

Riboflavin 

0872 (CT73 1 ) 
0532 (CT405) 
0871 (CT730) 

0873 (CT732) 
0320 (CT093) 



ubiA 
ubiO 
ubi£ 

folA 
folO 
fo(P 
foIX 

ygfA 

hcrrvA 



hemE 
hemC 
hemL 



Benzoate Octapheny I transferase 
Phenylacrylate Decarboxylase 
Ubiquinone Methyl transferase 

Dihydrofolate Reductase 
Methylene Tetrahydrofolate Dehydrogenase 
Dihydropteroite Synthase 
Dihydroneopterin Aldolase 
FormyliePahydrofolate Cycloligase 



Glutamyl tRNA Reductase 
Porphobilinogen Synthase 
Porphobilinogen Deaminase 
Uroporphyrinogen Decarboxylase 
pTOtoporphyrinogen Oxidase 
Glutamate- 1 -Semialdehyde-2, 1 -Amino mutase 
hemNJ Coproporphyrinogen III OxidaseJ 
hemN_2 Coproporphyrinogen 1 1 1 Oxidase_2 
hemZ 



nbA&ribB GTP Cyclohydratase & DHBP Synthase 
ribC Riboflavin Synthase 
hbO Riboflavin Deaminase 
ribE Ribiryllumszine Synthase 
nbF FAD Synthase 



Cell Envelope 

Fatty Acid & Phospholipid Metabolism 





0161 


(CT206) 




(predicted acyUransferase family) 


30 


0912 


(CT776) 


aas 


Acylglycerophosphoeihanolamine Acyltransferase 




0414 


(CT265) 


accA 


AcCoA Carboxylase/Transferase Alpha 




0183 


(CT123) 


accB 


Biorin Carboxyi Carrier Protein 




0182 


(CT124) 


accC 


Biotin Carboxylase 




0058 


(CT293) 


accD 


AcCoA Carboxylase/Transferase Beta 


35 


0295 


(CT236) 


acpP 


Acyl Carrier Protein 




0313 


(CT100) 


acpS 


Acyl-carrier Protein Synthase 




0567 


(CT45I) 


cdsA 


Phosp had dale Cytidylytransferase 




0297 


(CT238) 


fabD 


Malonyl Acyl Carrier Transcyclase 




0916 


(CT770) 


fabF 


Acyl Carrier Protein Synthase 


40 


0296 


(CT237) 


fabG 


Oxoacyl (Carrier Protein) Reductase 




0298 


(CT239) 


fabH 


Oxoacyl Carrier Protein Synthase HI 




0406 


(CT104) 


fibl 


Enoyl-Acyl-Carrier Protein Reductase 




0651 


(CT532) 


fibZ 


Myris toy I- Acyl Carrier Dehydratase 




0098 


(CT010) 


htrB 


Acyltransferase 


45 


0271 


(CT136) 




Lysophospho lipase Esterase 




0615 


(CT496) 


pgsAj 


Glyccrol-3-P Phosphatidyhsansferase J 




0947 


(CT797) 


Pg»A_2 


Glycerol-3-P Phosphaty dy loans ferase_2 




0958 


(CT807) 


plsB 


Glycerol-3.P Acylcransferise 




0569 


(CT453) 


plsC 


Glycerol -3-P Acy (transferase 


50 


0962 


(CT8U) 


pIsX 


FA/Phospho lipid Synthesis Protein 




0839 


(CT699) 


psdD 


Phosphaddylserine Decarboxylase 




0983 


(CT826) 


pSSA 


Glycerol-Serine Phospharidyloransf erase 




0921 


(CT775) 




snGlyccrol-3-P Acyltransferase 




0654 


(CT535) 


yciA 


Acyl-CoA Thioesterase 


55 


0877 


(CT736) 


ybcL 


CT736 Hypothetical Protein 



LPS 
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0154 


(CT208) 


gseA 


KDO Trans feme 




0721 


(CT655) 


kdsA 


KDO Synthetase 




0235 


(CT182) 


kdsB 


Deoxyoctulomnic Acid Synthetase 




0650 


(CT531) 


IpxA 


Acyi-Cerrier UDP-GlcNAc O-Acyltransferase 


5 


0965 


(CT41 1) 


ipxfi 


Lipid A Dbaccharide Synthase 




0652 


(CTS33) 


IpxC 


Myristoyi GlcNac Deacetyltse 




0302 


(CT243) 


ipxD 


UDP Glucosamine N-Acyl transferase 




Membrane Proteins, Lipoproteins A Portia 




0310 


(CT251) 


60IM 


60kDa Inner Membrane Protein , 


10 


0556 


(CT442) 


crpA 


1 5kDa Cysteine -Rich Protein 




0653 


(CT534) 


cut£ 


Apoltpoprotein N-Acetyl transferase 




0311 


(CT252) 




Proiipoprotem Diacyl glycerol Transferase 




0558 


(CT444) 


omcA 


9kDa-Cysteme-Rich Lipoprotein 




0557 


(CT443) 


omcB 


60kDa Cysieine-Rich OMP 


15 


0695 


(CT681) 


ompA 


Major Outer Membrane Protein 




0854 


(CT713) 


ornpB 


Outer Membrane Protein B 




0781 


(CT6O0) 


pal 


Pepddoglycan-Associated Lipoprotein 




0300 


(CT241) 


yieT 


Omp85 Homolog 




Pepiidogiycan 






20 


0417 


(CT268) 


ami A 


N-Acerytmuramoyl Alanine Ami due 




0780 


(CT601) 


amiB 


N-Aceryimuramoyl-L-Ali Amidase 




0677 


(CT551) 


dacF 


D-Ali-D-Ala Caroxypepridase 




0968 


(CT816) 


glmS 


Glucosamine- Fructose-6-P Aminotransferase 




0749 


(CT629) 


glmU 


UDP-GlcNAc Pyrophosphorylase 


25 


0900 


(CT757) 


mraY 


MuramoyUPentapepdde Transferase 




0571 


(CT455) 


mux A 


UDP-N- Acetyl glucosamine Transferase 




0988 


(CT831) 


murB 


UDP-N-Acerylenolpyruvoylglucosamine Reductase 




0905 


(CT762) 


murC&ddlA Muramate-Ala Ligase & D-Ala-D-Alam Ligase 




0901 


(CT758) 


murD 


Muramoylalanine-Clutamate Ligase 


30 


0418 


(CT269) 


murE 


N-Acerylmuramoylalanylglutamyl DAP Ligase 




0899 


(CT756) 


murf 


MuramoyUDAP Ligase 




0904 


(CT761) 


muiG 


Pcptidoglycan Transferase 




0902 


(CT759) 


nlpD 


Muramidase (invasin repeat family) 




0694 


(CT682) 


Pbp2 


PB P2-Transglycolase/Transpepridas e 


35 


0419 


(CT270) 


Pbp3 


Trans glycokasc/Transpeptrdase 




0421 


(CT272) 


yabC 


PBP2B Family Methylnnsferase 










Cellular Processes 




Cell Division 






40 


0959 


(CT808) 


cifE 


Aaial Filament Protein 




0880 


(CT739) 


fttK 


Cell Division Protein FtsK 




0903 


(CT760) 


ftsW 


Cell Division Protein FtsW 




0972 


(CT820) 


niY 


Cell Division Protein FtsY 




0617 


(CT498) 


gidA 


FAD-dependent Oxidortductase 




0805 


(CT582) 


minD 


Chromosome Partitioning ATPase 




0850 


(CT709) 


mrc B 


Rod Shape Protein-Sugar Kinase 




0867 


(CT726) 


rodA 


Rod Shape Protein 




0684 


(CT688) 


parB 


Chromosome Partitioning Protein 




Detoxtifteoiion 






50 


0057 


(CT294) 


sodM 


Superoxide Dismutase (Mn) 




0778 


(CT603) 


ahpC 


Tmo-specific Antioxidant (TSA) Peroxidase 




Signal Transduction 








0148 


(CT145) 




S/T Protein Kinase 




0584 


(CT467) 


atoS 


Two-Component Sensor 


55 


0294 


(CT235) 




cAMP-Dependent Protein Kinase Regulatory Subui 




0712 


(CT664) 




(FHA domain) 
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0478 




hflX 


\j l r DtTTGJng r TO lci n 


0703 


(CT673 ) 




Of i r njLcin Mime 


0095 






I r rule in Mnijc 


mo? 

uiiy / 






PP2C Phosphatase Family 


0037 


(CTJ37) 


ptsH 


PTS Phosphocatrier Protein Hpr 


0038 


(CT336) 




PTS PPP Phncnhntramfrrate 


0060 


(CT291) 


ptsNJ 


PTS II A Protein 1 


0061 


(CT290) 


ptsN_2 


PTS II A Protein * KTH DNA-Binding Domain 


0262 


(Cni8) 


su/E 


SurE -I ike Acid Phosphatase 


0838 


(CT698) 


thdF 


Thiophenc/Furen Oxidation Protein 


0693 


(CT683) 




TPR Repcats«CT683 Hypothetical Protein 


0321 


(CT092) 


ychF 


GTP Binding Protein 


0544 


(CT418) 


yhbZ 


GTP binding protein 


0844 


(CT703) 


yphC 




Standard Protein Secretion 




0115 


(CT025) 


ffh 


Signal Recognition Particle GTPase 


0363 


(CT060) 


nhA 


Rage liar Secretion Protein 


0858 


(CT717) 


tin 


Fla gel I um -specific ATP Synthase 


0704 


(CT672) 


fliN 


Flagellar Motor Switch Domatn/YscQ family 


0815 


(CT572) 


gspD 


Gen. Secretion Protein D 


0816 


(CT571) 


gspE 


Gen. Secretion Protein E 


0817 


(CT570) 


gspF 


Gen. Secretion Protein F 


0359 


(CT064) 


IcpA 




0110 


(CT020) 


lepB 


Signal Peptidase 1 


0535 


(CT408) 


IspA 


Lipoprotein Signal Peptidase 


0260 


(CT141) 


secAJ 


Protein Translocate Sub unit t 


0841 


(CT701) 


sccA_2 


Trans Incise See A 2 


0564 


(CT448) 


secD&se< 


•F Pmlnn Exnort Proteins SeeFVSeeF ( fusion } 


0075 


(CT321) 


secE 


rib^HWUl ft l mnw mj^ 


0629 


(CT510) 


secY 




0848 


(CT707) 


t»g 


Tri Offer FsetAr«Penfidvl>nra]vI luirttntfi 


Transport-Related Proteins 




0486 






Hypothetical Proline Permease 


0289 


(CT230) 


aaaT 


Neutral Amino Acid (Glutamate) Transporter 


0691 


(CT685) 


abcX 


ABC Transporter ATPase 


1031 


(CT374) 


arcD 


Arg in ine/Omi thine An n porter 


0482 


(CT381) 


arU 


Arginine Periplasmtc Binding Protein 


0836 


(CT554) 


bmQ 


Amino Acid (Branched) Transport 


0536 


(CT409) 


dagAJ 


rv a i */r*ii u p ■ < ■ ■■■ ■! ■ i 
l>-Aia»vJiy rCi iikASC_ 1 


0876 


(CT735) 


dagA_2 


D- Alanine/Glycine Permease_2 


0682 


(CT690) 


dppD 


ABC ATPase Dipcpbde Transport 


0683 


(CT689) 


dppF 


ABC ATPase Drpeptide Transport 


0280 


(CT689) 


dppF 


D ipeptide Transporte r ATPase 


0785 


(CT596) 


exbB 


Macro molecule Transporter 


0784 


(CT597) 


exbD 


Bio polymer Transport Protein 


0604 


(CT486) 


OiY 


Gtutamine Binding Protein 


0192 


(CT129) 


glnP 


ABC Amino Acid Transporter Permease 


0191 


(CT130) 


gmQ 


ABC Amino Acid Transporter ATPase 


0528 


(CT401) 


gUT 


Glutamate Sympon 


0286 


(CT194) 


mgi£ 


Mg** Transporter (CBS Domain) 


0413 


(CT264) 


nub A 


Transport ATP Binding Protein 


0290 


(CT231) 




Na*-dependeni Transporter 


0195 


(CT198) 


oppAJ 


Oligopeptide Binding Protein_l 


0196 


(CTI98) 


oppA.2 


Oligopeptide Binding Protein_2 


0197 


(CTI39) 


0ppA_3 


Oligopeptide Binding Protein_3 


0198 


(CTI75) 


oppA.4 


Oligopeptide Binding Protein A 
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0599 


(CT480) 


oppA.5 


Oligopeptide Binding L:ooprotein_3 


0199 


(CT199) 


oppBJ 


Oligopeptide Permease_l 


0598 


(CT479) 


oppB_2 


Oligopeptide Permease_2 


0200 


(CT200) 


oppC_l 


Oligopeptide Permease J 


0597 


(CT478) 


oppC.2 


Oligopeptide Permease _2 


0201 


(CT201) 


oppD 


Oligopeptide Transport ATPase 


0202 


(CT202) 


oppF 


Oligopeptide Transpon ATPase 


0231 


(CT180) 


tauB 


ABC Transpon ATPase fNitrate/Fe) 


0782 


(CT599) 


tolB 


Macrotnolecule Transporter 


0969 


(CT817) 


tyrP.i 


Tyrosine TransporM 


0970 


(CT8.I8) 


tyrP.2 


Tyrosine Transport^ 


0665 


(CT544) 


uhpC 


Hexosphosphate Transpon 


0282 


(CT216) 


xuA 


Anrina Acid Transporter 


0207 


(CT204) 


ybhl 


dicarboxylate Tnnslocator 


0971 


(CT819) 


yecA 


Tranrport Permease 


0248 


(CT152) 


ycfV 


ABC Transporter ATPase 


1014 


(CT856) 


ychM 


Sulfa e Transporter 


0736 


(CT641) 


ygeD 


Efrtur. Protein 


0680 


(CT692) 


ygo4 


Phospiate Permease 


0723 


(CT653) 


yhbG 


ABC Transporter ATPase 


0023 


(CT348) 


yjjK 


ABC Transporter Protein ATPase 


0127 


(CT034) 


ytff 


Carici jc Amino Acid Transporter 


0349 


(CT067) 


ytgA 


Solub Protein Binding Family 


0348 


(CT068) 


ytgB 


ABC nnsporter ATPase 


0347 


(CT069) 


yt«c 


lntegril Membrane Protein 


0346 


(CT070) 


ytgD 


Integral Membrane Protein 


1012 


(CT854) 


yzeB 


ABC Transporter Permease 


0868 


(CT727) 


zntA 


Metal Transpon P-type ATPase 


0279 






Possible ABC Transporter Permease Protein 


0543 


(CT417) 




(Metal Transport Protein) 


0692 


(CT684) 




ABC Transporter 


0542 


(CT416) 




ABC Transporter ATPase 


0690 


(CT686) 




ABC Transporter Membrane Protein 


0541 


(CT415) 




solute binding protein 



Type-Ill Secretion 

0323 (CT090) 

0324 (CT089) 
0811 {CT576) 
1021 (CT862) 

0325 (CT088) 
0702 (CT674) 
0828 (CT559) 
0826 (CT561) 
0707 (CT669) 
0825 (CTS62) 
0824 (CT563) 
0823 (CT564) 
0322 (CT09I) 



IcrO 

IcrE 

IcrH.I 

lcrH_2 

sycE 

yscC 

yscl 

yscL 

yscN 

yscR 

yscS 

yscT 

yscU 



Low Calcium Response D 

Low Calcium Response E 

Low Ca Response Protein HJ 

Low Calcium Response^ 

Secretion Chaperone 

Yop C/Gen Secretion Protein D 

Yop Translocation J 

Yop Translocation L 

Yop N (FUgellar-Type ATPase) 

Yop Translocation R 

YopS Translocation Protein 

YopT Tranlocation T 

Yop Translocation Protein U 



Central Intermediary Metabolism 



Glycogen Metabolism 

0856 (CT715) 

0948 (CT798) 

0475 (CT866) 

0607 (CT489) 

0307 (CT248) 

0388 (CT042) 



8lgA 
glgB 
glgC 
glgP 
glgX 



UDP-Glucose Pyrophosphorylise 
Glycogen Synthase 
Glucan Branching Enzyme 
Glucose- 1- P Adenyltnuuferase 
Glycogen Phosphorylasc 
Glycogen Hydrolase (de branching) 
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0326 


(CT087) 


mi!0 


Glucanotransfense 


0851 


(CT710) 


pckA 


Phosphoenolpyruvate Carboxykiruse 


Phosphorous A Sulfur 






0548 


(CT435) 


cysJ 


Sulfite Rcducase 


0920 


(CT774) 


cysO 


Sulfite Synthesis/Biphosphate Phosphatase 


0025 


(CT346) 


atsA 


Sulphohydrolase 


0918 


(CT772) 


ppa 


Inorganic Pyrophosphatase 








DMA Replication. Modification, 


DNA Mismatch Repair 






0505 






3-Methyladenine DNA Glycosylase 


0812 


(CT575) 


mud. 


DNA Mismatch Repair 


094! 


(CT792) 


mutS 


DNA Mismatch Repair 


0402 


(CT107) 


mu(Y 


Adenine Glycosylase 


0732 


{CT625) 


nfo 


Endo nuclease (V 


0837 


(CT697) 


nth 


Enodnuc tease 111 


DNA Modification 






0596 


(CT477) 


ada 


Methyltransferase 


0114 


(CT024) 


hemK 


A/G-speciftc Methylase 


0891 


(CT748) 


mfd 


Transcription- Repair Coupling 


0620 


(CT501) 


ruvA 


Holliday Junction Helicase 


0390 


(CT040) 


ruvB 


Holliday i unction Helicase 


062! 


(CTS02) 


mvC 


Crossover Junction Endonuclease 


0053 


(CT298) 


snru 


Sms Protein 


0773 


(CT607) 


ung 


Uracil DNA Glycosylase 


1062 


(CT329) 


xseA 


Exodoxyribonuclease VI! 


DNA Recombination 






0762 


(CT650) 


rccA 


RecA Recombination Protein 


0738 


(CT639) 


recB 


Exodcoxyribonuclease V. Beta 


0737 


(CT640) 


recC 


Exodeoxyribonucleasc V. Gamma 


0123 


(CT033) 


recDJ 


Exodeoxyribonuclease V (Alpha Subunit) J 


0752 


(CT652) 


recD_2 


Exodeoxyribonuclease V. Alpha_2 


0339 


(CT074) 


recF 


ABC Superfamily ATPasc 


0340 


(CT074) 




(frame-shift with 0339) 


0563 


(CT447) 


reel 


ssDNA E*onuclease 


0299 


(CT240) 


rrcR 


Recombination Protein 


DNA Replication 






0309 


(CT250) 


dnaAJ 


Replication Initiation Protein. 1 


0424 


(CT275) 


dnaA_2 


Replication Initiation Factor J 


0616 


(CT497) 


dnaB 


Replkative DNA Helicase 


0666 


(CT545) 


dn*E 


DNA Pol III Alpha 


0942 


(CT794) 


dniG 


DNA Primaac 


0338 


(CT075) 


dnaN 


DNA Pol III (Beta) 


0410 


(CT261) 


draQJ 


DNA Pol III Epstlon Chain, 1 


0655 


(CT5J6) 




DNA Pol III Epsiton Chain_2 


0040 


(CT334) 


dnaXj 


DNA Pol III Gamma and TauJ 


0272 


(CT187) 


dnaX_2 


DNA Pol 11 1 Gamma and TauJ 


0149 


(CT146) 


dnU 


DNA Ugase 


0274 


(CT189) 


gyrA_l 


DNA Gyrase Subunit A J 


0716 


(CT660) 


gyrA.2 


DNA Gyrase Subunit A_2 


0275 


(CT190) 


gyrB.l 


DNA Gyrase Subunit B 1 


0715 


(CT661) 


gyrB_2 


DNA Gyrase Subunit B_2 


0416 


(CT267) 


himD 


Integration Host Factor Alpha 


0612 


(CT493) 


polA 


DNA Polymerase 1 


0924 


(CT778) 


pnA 


Primosomal Protein N* 


0386 


(CT044) 


ssb 


SS DNA Binding Protein 



58 



WO0027994 ffile:/AVdcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0027994.c pcl 



Page 61 of 330 



WO 00/27994 



PCTAJS99/26923 





0835 (CT555) 


SWl/SWF family helicasej 




0849 (CT708) 


SWI/SNF family helicasej 




0769 (CT643) BpA 


DNA Topoisomcnse l-Fused to SWI Domain 




0024 (CT347) xerC 


Integiase/recombinase 


5 


1024 (CT864) xerD 


Imegrase/recombinase 




Eukaryoiic~Typ* Chromatin Factor* 




0886 (CT743) hctA 


Histone-Like Developmental Protein 




0384 (CT046) hciB 


Histone-like Protein 2 




0878 (CT737) 


SET Donuin protein 


10 


0577 (CT460) 


SWIB ( YM74) Complex Protein 




UVR Exinvclcose Repair System 






0096 (CT333) uvrA 


Excinuclease ABC Sub unit A 




0801 (CT586) uvtB 


Exinuclease ABC Subunit B 




0940 (CT791) uvtC 


Excinuclease ABC. Subunit C 


15 


0772 (CT608) uvtD 


DNA Heltcase 






Energy MeUbolhm 




Aerobic 






0855 (CT7I4) gpdA 


Glycerol-3-P Dehydrogenise 


20 


0743 (CT634) nqrA 


Ubiquinone Oxidoreductase. Alpha 




0427 (CT278) nqr2 


NADH (Ubiquinone) Dehydrogenase 




0428 (CT279) nqrt 


NADH (Ubiquinone) Oxidoreductase. Gamma 




0429 (CT280) nqr4 


NADH (Ubiquinone) Reductase 4 




0430 (CT281) nqr5 


NADH (Ubiquinone) Reductase 5 


25 


0883 (CT740) nqr6 


Phenolhydrolase/NADH (Ubiquinone) Oxidoreductase 6 




ATP Biogenesis and metabolism 






0351 (CT065) adtj 


ADP/ATP Translocase J 




0614 (CT495) xdt_2 


A DP/ATP Translocase.2 




0088 (CT308) «pA 


ATP Synthase Subunit A 


30 


0089 (CT307) itpB 


AT? Synthase Subunit B 




0090 (CT306) ttpD 


ATP Synthase Subunit D 




0086 (CT3I0) atpE 


ATP Synthase Subunit E 




0091 (CT305) «pl 


ATP Synthase Subunit I 




0092 (CT304) aq>K 


ATP Synthase Subunit K 


35 


0860 (CT719) flif 


Flagellar M-Ring Protein 




Electron Transport Chain 






0102 (CT013) cydA 


Cytochrome Oxidase Subunit 1 




0103 (CT014) cydB 


Cytochrome Oxidase Subunit 11 




0364 (CT059) 


Ferredoxin 


40 


0084 (CT3 12) 


predicted Ferredoxin 




Glycolysis <fi Cluconeogenesis 






0281 (CT215) dhnA 


Predicted 1.6-Fructose Biphosphate Aldolase 




0800 (CT587) cno 


Enotase 




0624 (CT505) gapA 


Glyceraldehyde-3-P Dehyrogenase 


45 


0056 (CT295) mnA 


Phosphomannomutase 




0967 (CT815) pgm 


Phosphoglucomutase 




0160 (CT207) pfkAJ 


Fruetose-6-P Phosphotransferase. 1 




0208 (CT205) pflcA_2 Fructose-6-P Phosphotransferase 2 




1025 (CT378) pgi 


Glucose -6 -P Isomerase 


50 


0679 (CT693) pgk 


Phosphoglycerate Kinase 




0863 (CT722) pgmA 


Phosphoglycerate Mutase 




0097 (CT332) pyk 


pyruvate Kinase 




1063 (CT328) tpiS 


Triosephosphate Isomerase 




Pentose Phosphate Pathway 




55 


0239 (CT186) devB 


Glucose-6-P Dehyrogenase (DevB family) 




1060 (CT331) dxs 


Transketolasc 
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0360 


(CT063) 


gnd 


6-Pnosphogluconate uenyarogen»*c 




0185 


(CTI21) 


rpe 


ruoulOSC-r cpirnerasc 




0141 


(CT213) 


rpiA 


nJOOSCO'r iwropMS n 




0083 


(CT313) 


tai 


i fiminwiiw 


c 
J 


0893 


(CT750) 


Ocifl 


Trsnsketolasc 




0238 


(CTI85) 


zwf 


niiKou«6»P Denvrosenase 




Pyruvate Dehydrogenase 






0833 


(CT557) 


IpdA 


uipoimioc ucnyuivg»n»»t 




0436 


(CT285) 


IplAj 


Lipoate Protein Ligiie-Like Protein 


i n 

1U 


0618 


(CT499) 


lplA_2 


LipOlLc -rTuLCin LljfJJC A 




0033 


(CT340) 


pdhA&B 


Oxoiso valerate Dehydrogenase a4J Fusion 




0304 


(CT245) 


pdhA 


Pyruvate Dehydrogenase Alpha 




0305 


(CT246) 


pdhB 


Pyruvate Dehydrogenase Ben 




0306 


(CT247) 


pdhC 


Dthydrolipoamide Aceryltransferase 


15 


TCA Cycle 








0495 


(CT390) 


aspC 


Aspartate Aminotransferase 




1013 


(CT855) 


fumC 


Fumarate Hydritase 




1028 


(CT376) 


mdhC 


Malite Dehyrogenase 




0789 


(CT592) 


sdhA 


Succinate Dehydrogenase 


20 


0790 


(CT591) 


sdhB 


Succinate Dehydrogenase 




0788 


(CT593) 


sdhC 


Succinate Dehydrogenase 




0378 


(CT054) 


SUCA 


Oxoglutarate Dehydrogenase 




0377 


(CT055) 


sucB_l 


DDiydrolipoamide Succinylrjaniferase J 




0527 


(CT400) 


sucB_2 


Dthydrolipoamide Succinyiowsferase.2 


25 


0973 


(CT82I) 


sucC 


Succinyl-CoA Synthetase. Beta 




0974 


(CT822) 


sucD 


SuccinyUCoA Synthetase, Alpha 



Protein Folding, Aitembly & Modification 





Chaperones 






30 


0949 


(CT799) 


etc 


General Stress Protein 




0534 


(CT407) 


dksA 


DnaK Suppressor 




0032 


(CT341) 


dnaJ 


Heat Shock Protein J 




0503 


(CT396) 


dnaK 


Hsp-70 




0134 


(CTU0) 


groELJ 


Hsp-60J 


35 


0777 


(CT604) 


groEL.,2 


Hsp-60.2 




0898 


(CT755) 


groEL_3 


Hsp-60_3 




0135 


(CTMD 


groES 


lOKDa Chaperonin 




0502 


(CT395) 


grpE 


HSP-70 Cofactor 




0661 


(CT541) 


rnrp 


FKBP-type Pepudyl-prolyl Cis-Trans U 


40 


proteases 










0144 


(CT113) 


clpB 


Clp Protease ATPase 




0437 


(CT286) 


clpC 


ClpC Protease 




0520 


(CT431) 


clpPJ 


CLP Protease 




0847 


(CT706) 


clpP.2 


CLP Protease Sub unit 


45 


0846 


(CT705) 


crpX 


CLP Protease ATPase 




0269 


(CT138) 




Dipeptidase 




0998 


(CT841) 


ftsH 


A TP -dependent Zinc Protease 




0030 


(CT343) 


gcp.l 


O-Sialoglycoprotem EndopeptidaseJ 




0194 


(CT197) 


vvJ 


OSialoglycoprotein Endopeptidase_2 


50 


0979 


(CT823) 


htrA 


DO Serine Protease 




0957 


(CT806) 


ide 


Insulinase family/Protease III 




0027 


(CT344) 


ion 


Lon ATP-dependent Protease 




1017 


(CT859) 


lytB 


Metalloprotease 




1009 


(CT851) 


map 


Methionine Aminopeptidaac 


55 


0385 


(CT045) 


pepA 


Leucyl Aminopepndase A 




0136 


(CT1 12) 


pepF 


Oligo peptidase 
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0813 (CT574) pepP Aminopeptidase P 

0613 (CT494) sohB Protease 

0555 (CT44I) tsp Tail-Specific Protease 

0344 (CT072) yicL Metalloprotease 

0981 (CT824) Zinc Metalloprotease (insulinase family) 

Protein liomerost* 

0227 (CT176) dsbB DUul fide bond Oxidoreductase 
0786 (CT595) dsbD Thio:disulfide Interchange Protein 

0228 (CTI77) dsbG Disulfide Bond Chaperone 

0933 (CT783) Predicted Disulfide Bond Isomerase 

0926 (CT780) Thioredoxin Disulfide Isomerase 
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Transcription 



RNA 





0999 


(CT842) 


pnp 


Polyribonucleotide Nucleotidyltransferase 


5 


0054 


(CT297) 


mc 


Ribonucleue III 




0119 


(CT029) 


mhB_l 


Ribonuclease HI1_1 




1068 


(CT008) 


mhB_2 


Ribonucleue HIM 




0934 


(CT784) 


mpA 


Ribonucleue P Protein Component 




0504 


(CT397) 


vicB 


Ribonucleue Family 


If) 


RNA Elongation <fc Termination 






0741 


(CT636) 


greA 


Tnnirrirrrion ElonBition Factor 




0316 


(CT097) 


nusA 


N tlttlinmn Pmtrm A 




0076 


(CT320) 


nusC 


TnntrnnMWil A tittf »HI MWJtiftll 
1 luiKnpuwuu fMiiiiciiiiiiMuui) 




0845 


(CT704) 


pcnBJ 


r viy r\ roiyiuciuE i 


i s 
1 J 


0966 


(CT410) 


pcnB_2 


Poly A Polymerase^ 




0610 


(CT491) 


rho 


Transcription Termination factor 




RNA MethyUu€S 








0674 


(CT553) 


fmu 


RNA Meihyltransferase 




1059 


(CT354) 


kgsA 


Dimethyl adenosine Transferase 


20 


0187 


(CT133) 




Predicted Methylue 




0530 


(CT403) 


spoUJ 


rRNA Methylue J 




0660 


(CT540) 


spoU_2 


rRNA Methylasc.2 




0117 


(CT027) 


trmD 


tRNA (Guanine N-1)-M«hyltransferase 


25 


0885 


(CT742) 


ygcA 


rRNA Methyl transferee 


0986 


(CT829) 


yg»H 


Predicted rRNA Methylue 




0987 


(CT830) 


ytgB 


Predicted rRNA Methylase 



RNA Modification 





0649 


(CT530) 


fmt 


Methionyl tRNA Formy I transferase 




0910 


(CT766) 


miaA 


tRNA Pyrophosphate Transferase 


30 


0719 


(CT658) 


sfhB 


Predicted Pseudo uridine Synthase 




0219 


(CT193) 




Queuine tRNA Ribosyt Transferase 




0580 


(CT463) 


tmA 


Pseudouridyiate Synthase I 




0319 


(CT094) 


truB 


tRNA Pseudo undine Synthase 




0403 


(CT106) 


yceC 


Predicted Pseudouridine Synthetase Family 


35 


0864 


(CT723) 


yjbc 


Predicted Pseudouridine Synthase 




RNA Polymerase <S Transcription Regulators 




0586 


(CT468) 


atoC 


Two-Component Regulator 




0362 


(CT061) 


rpsO 


Sigma-28/WhiG Family 


40 


0501 


(CT394) 


hrcA 


HTH Transcriptional Repressor 


0793 


(CT588) 


rbsU 


Sigma Regulatory Family Protein— PP2C Pi 




0626 


(CT507) 


rpoA 


RNA Polymerase Alpha 




0081 


(CT315) 


rpoB 


RNA Polymerase Beta 




0082 


(CT314) 


rpoC 


RNA Polymerase Beta' 


45 


0756 


(CT615) 


rpoD 


RNA Polymerase Stgma-66 


0771 


(CT609) 


rpoN 


RNA Polymerase Sigma-54 




0511 


(CT424) 


rabV_i 


Sigma Regulatory Factor J 




0909 


(CT765) 


nbV_2 


Sigma Factor Regulator_2 




0670 


(CT549) 


rabW 


Sigma Regulatory Factor-Hisridme Kinase 


50 


0750 


(CT630) 


tctO 


HTH Transcriptional Regulatory Protein ♦ 1 


1069 


(CT009) 


yfgA 


HTH Transcripnonal Regulator 



Amino Acyi tRNA Synthesis 
0892 (CT749) alaS 
55 0570 (CT454) argS 

0662 (CT542) upS 



Alanyl tRNA Synthetase 
Arginyl tRNA Transferase 
Aspartyl tRNA Synthetase 



Translation 
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0932 


(CT782) 


cysS 


Cysteinyl iRNA Synthetase 


0003 


(CT003) 


gatA 


Glu (RNA Gin Amidotransferase (A subunit) 


0004 


(CT004) 


g«B 


Glu iRNA Gin Amidotransferase (B Subunit) 


0002 


(CT002) 


g»tC 


Giu (RNA Gin Arnidonansferase (C subunit) 


0560 


(CT445) 


gltf 


GlutamyMRNA Synthetase 


0946 


(CT796) 


8>y<3 


Glycyl tRNA Synthetase 


0663 


(CT543) 


htsS 


HUtidy! (RNA Syntheuse 


0109 


(CT019) 


ileS 


IsoleucyUtRNA Synthetase 


01S3 


(CT709) 


leuS 


Leucyl iRNA Synthetase 


0931 


(CT781) 


lysS 


Lysyl tRNA Synthetase 


0122 


(CT032) 


metG 


Methionyl-iRNA Syntheuse 


0993 


(CT836) 


pheS 


Pnenylalanyl uv»A ^yntneuse, Aipna 


0594 


(CT475) 


pheT 


Pnenylalanyl tKJNA synthetase oeu 


0500 


(CT393) 


proS 


Prolyl tRNA Syntheuse 


0870 


(Cn29) 


serS 


Scry! tRNA Synthetase^ 


0806 


(CT581) 


thrS 


Threonyl tRNA Syntheuse 


O802 


(CT585) 


trpS 


Tryptophany! (RNA Syntheuse 


0361 


(CT062) 


tyrS 


Tyrosyl (RNA Syntheuse 


0094 


(CT302) 


valS 


Valyl tRNA Syntheuse 


eptide Chain Initiation. Eiongat, 


ion & Termination 


1067 


(CT353) 


oef 


Polypeptide Deformylase 


0184 


(CTI22) 


efp.l 


Elongation Factor P_l 


0895 


(CT752) 


efp.2 


Elongation Factor P_2 


0550 


(CT437) 


fu»A 


Elongation Factor G 


0073 


(CT323) 


infA 


Initiation Factor IF* 1 


0317 


(CT096) 


inffi 


Initiation Factor-2 


0990 


(CT833) 


infC 


Irddation Factor 3 


0113 


(CT023) 


pfrA 


Peptide Chain Releasing Factor 1 


0576 


(CT459) 


prfB 


Peptide Chain Release Factor 2 


0950 


(CT800) 


plfa 


Pepcidyt tRNA Hydrolase 


0318 


(CT095) 


rbfA 


Ribosome Binding Factor A 


0699 


(CT677) 


ITf 


Ribosome Releasing Factor 


0697 


(CT679) 


tsf 


Elongation Factor TS 


0074 


(CT322) 


tufA 


Elongation Factor Tu 


Ubosomai Pro twins 






0078 


(CT318) 


rll 


LI Ribosomal Protein 


0644 


(CT525) 


rl2 


L2 Ribosomal Protein 


0647 


(CT528) 


rl3 


L3 Ribosomal Protein 


0646 


(CT527) 


rl4 


L4 Ribosomal Protein 


0635 


(CT516) 


ri5 


L5 Ribosomal Protein 


0633 


(CT5M) 


rl6 


L6 Ribosomal Protein 


0080 


(CT316) 


rl7 


L7/LI2 Ribosomal Protein 


0953 


(CT803) 


r!9 


L9 Ribosomal Protein 


0079 


(CT317) 


rllO 


L10 Ribosomal Protein 


0077 


(CT319) 


rill 


LI 1 Ribosomal Protein 


0247 


(CTI25) 


rll 3 


LI 3 Ribosomal Protein 


0637 


(CT5I8) 


rtl4 


LI4 Ribosomal Protein 


0630 


(CT5II) 


rll5 


LI 5 Ribosomal Protein 


0640 


(CT52I) 


rll 6 


LI 6 Ribosomal Protein 


0625 


(CT506) 


rt!7 


LI 7 Ribosomal Protein 


0632 


(CT513) 


rl18 


LI 8 Ribosomal Protein 


0118 


(CT028) 


rll9 


LI 9 Ribosomal Protein 


0992 


(CT835) 


H20 


L20 Ribosomal Protein 


0546 


(CT420) 


r!2l 


L2I Ribosomal Protein 


0642 


(CTS2J) 


ri22 


122 Ribosomal Protein 


0645 


(CT526) 


rt23 


L2J Ribosomal Protein 
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0636 


(L T 517) 


• IT 4 


L~i* rUDOSOmai rTUiCin 




tCTAlQ\ 


rl27 


1 *J "J nkntAmtl llHll^in 




luooi 


rl28 


1 "7 B B'iKmaimI Pibmih 
L-*o inIDOMJITmI nvwin 


UOJV 


IV. 1 5*U) 


ri29 


f ?0 Rit-wxnrrwl PtntPin 
Li" IslDUSOnwJ n«win 


Ann 
Ul 1 i 




ri3l 




0961 


(CT810) 


(132 






\\. 1 1 5U ) 


nj j 




0935 


(V-T7B5) 




L~5*» KlDOSOflUl rroiCin 


0991 


(CT834) 


r!35 


L3S Ribosonul Protein 


0936 


(1,1780) 


ruo 


LJO rUOOSOfTUi rrOlCin 


0315 


(CT098) 


rsl 


51 nJDOSomil rrgion 


0696 


(CT680) 


r$2 


S2 Ribosonul Protein 


0641 


(CT522) 


rx3 


S3 Ribosonul Protein 


0733 


(CT626) 


rs4 


$4 Ribosonul Protein 


0631 


(CT512) 


rs5 


S5 Ribosonul Protein 


09S1 


(CT801) 


rs6 


S6 Ribosonul Protein 


0551 


(CT438) 


13? 


S7 Ribosonul Protein 


0634 


(CT515) 


rj8 


PA P% '1 1 gfc _• . 

S8 Ribosonul Protein 


0246 


(CT126) 


rs9 


S9 Ribosonul Protein 


0549 


(CT436) 


ralO 


SI0 Ribosonul Protein 


0627 


(CT508) 


rsl 1 


S 1 1 Ribosonul Protein 


0552 


(CT439) 


rs!2 


SI 2 Ribosonul Protein 


0628 


(CT509) 


nl3 


SI 3 Ribosonul Protein 


0937 


(CH87) 


R14 


SI4 Ribosonul Protein 


1000 


(CT843) 


nlS 


SI5 Ribosonul Protein 


0116 


(CT026) 


rsl6 


SI6 Ribosonul Protein 


0638 


(CT519) 


rsl7 


SI 7 Ribosonul Protein 


0952 


(CT802) 


B18 


SI8 Ribosonul Protein 


0643 


(CT524) 


rs!9 


SI9 Ribosonul Protein 


0754 


(CT617) 


ra20 


S20 Ribosonul Protein 


0031 


(CT342) 


rs21 


ail luoosomai rromn 








Other Citrgories 


Chiamydia-Speeific Prottins 




0561 


(CT446) 


Euo 


CU\ DC Ciin p_ M< — 

Lnira cuo rrotein 


0804 


(CT583) 


Gp6D 


nlil TP PlacmiH Panlnii 

\_rvLi k riismio ran tog 


0186 


(CTH9) 




Aimiiartiy to inc a_ i 


0291 


(CT232) 


incB 


Inclusion Membrane Protein B 


0292 


(CT233) 


incC 


inclusion Ptcimmini rnjicin v» 


1026 


(CT377) 




1 hi A PmiaiH 


0333 


(CT080) 






0005 


(CT87I) 


pmp_l 


Pmlumnftihi^ Oiit^r Mnihwiw Pfftfpin CI Pnmilv 
ruiyiiKjrpniu \s utt t mcrooiiRt riwisiii \j r »uuiy 


0013 


(CT871) 


pmp_2 


Pnlvmnmhie Outer Mrmhnnc Protein Ci Fimilv 


0014 


(CT871) 


pmp_3 


Plotx n mmuJii 4* Ontj»T M*mhnrp Pmffin fi Pamilv 
4 viyinvrpjitw wiitfi iviwtjiuidic r iwwhi vj r*uiny 


0015 


(CT871) 


pmp_3 


PMP 3 f fr»me-ihift with 001 4^ 


0016 


(CT874) 


pmp_4 




0017 


(CT87I) 


pmp_4 


PMP 4rftinu.thift unthfl016) 


0018 


(CT874) 


pmp_5 


Polymorphic Outer Membrane Protein G Family 


0019 


(CT871) 


pmp_5 


PMP_5 (frame-shift with 0018) 


0444 


(CT871) 


pmp_6 


Polymorphic Outer Membrane Protein G/I Family 


0445 


(CT871) 


pmp_7 


Polymorphic Outer Membrane Protein G Family 


0446 


(CT871) 


pmp_8 


Polymorphic Outer Membrane Protein G Family 


0447 


(CT871) 


pmp_9 


Polymorphic Outer Membrane Protein G/l Family 


0450 


(CT871) 


pmp_10 


Polymorphic Outer Membrane Protein G Family 


0449 


(CT871) 


pmpJO 


PMP. 10 (Frame-shift with 0450) 
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20 



0451 fCT87l) pmpjl 

0452 (CT874) pmp_l2 

0453 <CT87t) pmp_U 

0454 (CT872) pmp_l4 

0466 (CT869) pmpj5 

0467 (CT869) pmpj« 

0468 fCT869) pmpj7 

0469 (CT869) pmpj7 

0470 (CT869) pmpJ7 

0471 fCT870) pmpJ8 

0539 (CT412) pmpJ9 

0540 (CT413) pmp_20 
0963 (CT812) pmp_21 
0562 
0927 
0928 
0929 

0728 (CT622) 

0729 (CT623) 
0133 (CTI09) 
0332 (CT08I) 

Miscellaneous En^mexKonserved Pratt ins 



Polymorphic Outer Membrane Protein G Family 
PorymoTphic Outer Mem brine Protein (crunched) A/1 Family 
Polymorphic Outer Membrane Protein G Family 
Polymorphic Outer Membrane Protein H Family 
Polymorphic Outer Membrane Protein E Family 
Polymorphic Outer Membrane Protein E Family 
Polymorphic Outer Membrane Protein E Family 
PMPJ7 (Frame-shift with 0468) 
PMPJ 7 (Frame-shift with 0469) 
Polymorphic Outer Membrane Protein E/F Family 
Polymorphic Membrane Protein A Family 
Polymorphic Membrane Protein B Family 
Polymorphic Membrane Protein O Family 
CHIPS 43 kDa Protein Homolog J 
CHLPS 43 kDa Protein Homolog_2 
CHL?S 43 kDa Protein Homolog J 
CHU'S 43 kDa Protein Homology 
CHU'N 76kDa Homolog J (CT622) 
CHLPN 76kDa Homolog_2 (CT623) 
CHU'S Hypothetical Protein 
CHL' R T2 Protein 





0193 




argR 


Possi le Arginine Repressor 




1046 






Aron abc Amino Acid Hydroxylase 


25 


0232 






Similarity to S'-Methylthioadenosine Nucleosidase 




0128 


(CT035) 




Btotin Protein Ligasc 




0513 


(CT426) 




Fe-S Oxidoreductase_l 




0911 


(CT767) 




Fe-S Oxidoreductase.2 




0373 


(CT057) 


gcpE 


GcpE Protein 


30 


0407 


(CTI03) 




HAD Superfamily Hydrolase/Phosphatase 




0917 


(CT77I) 




Hydrolase/Phosphatase Homolog 




0488 


(CT385) 


ycfF 


HIT Family Hydrolase 




0701 


(CT675) 


karG 


Arginine Kinase 




0526 


(CT399) 


kpsF 


GutQ/KpsF Family Sugar- P Isomerase 


35 


0919 


(CT773) 


Idh 


Leucine Dehydrogenase 




0022 


(CT349) 


maf 


Maf protein 




0997 


(CT840) 


mesJ 


PP-loop superfamily ATPase 




0151 


(CTI48) 


mhpA 


Monooaygenase 




0730 


(CT624) 


mviN 


Integral Membrane Protein 


40 


0861 


(CT720) 




NifU-Related Protein 




0479 


(CT380) 


phnP 


Metal Dependent Hydrolase 




0106 


(CT015) 


phoH 


ATPase 




0329 


(CT084) 




Phopholtpase D Superfamily 




0435 


(CT284) 




Phospholipase D Superfamily 


45 


0581 


(CT464) 




Phosphoglycolate Phosphatase 




0897 


(CT754) 




Predicted PhosphohydroUse 




0509 


(CT422) 




Predicted Metalloenzyme 




1030 


(CT375) 




Predicted D- Amino Acid Dchyrogenase 




0531 


(CT404) 




SAM Dependent Methyl transferase 


50 


0337 


(CT076) 


smpB 


Small Protein B 




0394 


(CT256) 


UyC.l 


CBS Domain Protein (Hemolysin Homolog)_l 




0510 


(CT423) 


UyC.2 


CBS Domains (Hemolysin Homolog)_2 




0382 


(CT048) 


yabC 


SAM -Dependent Meihytransferase 




0787 


(CT594) 


yabD 


PHP Superfamily (Urease/Pyrimidinase) Hydroli 


55 


0611 


(CT492) 


yacE 


Predicted Phosphatase/Kinase 




0579 


(CT462) 


yacM 


Sugar Nucleotide Phosphorylasc 




0578 


(CT46I) 


yae( 


Phosphohydrolase 
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0345 


(CT07I) 


yaeM 


CT07I Hypothetical Protein 


0S66 


(CT450) 


yaeS 


YaeS family Hypothetical Protein 


0591 


(CT472) 


yagE 


YagE family 


0039 


(CT335) 


ybxB 


YbaB family Hypothetical Protein 


0101 


(CT012) 


ybbP 


YbbP family Hypothetical Protein 


091S 


(CT769) 


ybeB 


tojap Superfamily Onholog 


0U7 


(CTI08) 


ybgl 


ACR family 


0529 


(CT402) 


ycaH 


ATPase 


0438 


(CT287) 


ycbF 


PP-loop Superftmily ATPase 


0734 


(CT627) 


yc«A 


YceA Hypothetical Protein 


0954 


(CT804) 


ychB 


Predicted Kinase 


0261 


(CT217) 


ydaO 


PP-Loop Superfamily ATPase 


0245 


(CT127) 


ydhO 


Polysaccharide Hydrolase -In vasin Repeat Family 


0573 


(CT457) 


yebC 


YebC Family Hypothetical Protein 


0689 


(CT687) 


yfhOj 


NifS* related Amino transferase J 


0862 


(CT721) 


yfhO_2 


Ni(S*related Aminotransferase_2 


0547 


(CT434) 


ygbB 


YgbB Family Hypothetical Protein 


0237 


(CT184) 


yggF 


YggF Family Hypothetical Protein 


0775 


(CT606) 


yggv 


YggV Family Hypothetical Protein 


0396 


(CT258) 


yhfOJ 


NifS-iclatcd Aminotransferase^ 


0605 


(CT487) 


yhhf .. 


Predicted Methylase 


0575 


(CT458) 


yhhY 


Amino Croup Acetyl Transferase 


0592 


(CT473) 


yidO 


YidD Family 


0982 


(CT825) 


yigN 


YigN Family Hypothetical Protein 


0657 


(CT537) 


yjeE 


YjeE Hypothetical Protein 


0768 


(CT644) 


yohj 


Yohl Predicted Oxidoreductase 


0336 


(CT077) 


yojL 


YojL Hypothetical Protein 


0217 


(CTUO) 


ypdP 


YpdP Hypothetical Protein 


0140 


(CT212) 


yqd E 


YqdE Hypothetical Protein 


0263 


(CT221) 


yqfu 


YqfU Hypothetical Protein 


0139 


(CT211) 


yqgE 


YqgE Hypothetical Protein 


0270 


(CT137) 


ywlC 


SuA5 Superfamily- related Protein 


0879 


(CT738) 


yycJ 


Metal Dependent Hydrolase 








Homologs to CHLTR Hypothetii 


0001 


(CT001) 


CT001 Hypothetical Protein 


0020 


(CT351) 


CT35I Hypothetical Protein 


0021 


(CT350) 


CT350 Hypothebcat Protein 


0026 


(CT345) 


CT345 Hypothetical Protein 


0035 


(CT339) 


CT339* 


lypothetical Protein 


0036 


(CT338) 


CT338 Hypothetical Protein 


0055 


(CT296) 


CT296I- 


lypothetical Protein 


0062 


(CT289) 


CT289 Hypothetical Protein 


0065 


(CT288) 


CT288 Hypothetical Protein 


0068 


(CT360) 


CT360 Hypothetical Protein 


0071 


(CT325) 


CT325 Hypothetical Protein 


0072 


(CT324) 


CT324 Hypothetical Protein 


0085 


(CT31I) 


CT31 1 Hypothetical Protein 


0087 


(CT309) 


CT309 Hypothetical Protein 


0093 


(CT303) 


CT303 Hypothetical Protein 


0100 


(CT011) 


CT01 1 Hypothetical Protein 


0104 


(CT017) 


CT0I7 Hypothetical Protein 


0105 


(CT0I6) 


CT016 Hypothetical Protein 


0107 


(CT058) 


CT058 Hypothetical Protein 1 


0108 


(CT018) 


CT018 Similarity 


out 


(CT021) 


CT02I Hypothetical Protein 


0121 


(CT031) 


CT03I Hypothetical Protein 
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0129 


(CT036i 


0M5 


(CT1M) 


0150 


(CTI47) 


0152 


(CT149) 


0176 


(CTI53) 


0188 


(CTI32) 


0189 


(CT131) 


0206 


(CT203) 


0229 


(CT178) 


0230 


(CT179) 


0234 


(CT181) 


0249 


(CT151) 


0253 


(CTI44) 


0254 


(CTI43) 


0255 


(CT142) 


0256 


(CT144) 


0257 


(CTI43) 


0259 


(CT142) 


0276 


(CT19I) 


0288 


(CT195) 


0293 


(CT234) 


0301 


(CT242) 


0303 


(CT244) 


0308 


(CT249) 


0312 


(CT101) 


0328 


(CT085) 


0330 


(CT083) 


0331 


(CT082) 


0334 


(CT079) 


0342 


(CT073) 


0343 


(CT073) 


0350 


(CT066) 


0369 


(CT058) 


0370 


(CT058) 


0374 


(CT056) 


0379 


(CT053) 


0381 


(CT326) 


0383 


(CT047) 


0387 


(CT043) 


0389 


(CT041) 


0393 


(CT03B) 


0395 


(CT257) 


0399 


(CT253) 


0400 


(CT254) 


0401 


(CT255) 


0405 


(CT105) 




(CTI02) 


0409 


\\~ 1 iO*J) 


0411 


(CT262) 


0412 


(CT263) 


0415 


(CT266) 


0420 


(CT271) 


0422 


(CT273) 


0423 


(CT274) 


0425 


(CT276) 


0426 


(CT277) 


0434 


(CT283) 



CT036 SimiUricy 
CTI14 Hypothetical Prolan 
CT147 Hypoibetical Protein 
CT149 Hypothetical Protein 
CT153 Hypothetical Protein 
CT132 Hypothetical Protein 
CT131 Hypothetical Protein 
CT203 Hypothetical Protein 
CT178 Hypoibetical Protein 
CT179 Hypothetical Protein 
CT 1 8 1 Hypothetical Protein 
CT151 Hypothetical Protein 
CT144 Hypoihedcal Protein J 
CT143 Hypothetical Protein J 
CT 1 42 Hypothetical Protein J 
CT144 Hypothetical Protein_2 
CT143 Hypothetical Protein_2 
CT142 Hypothetical Protem_2 
CT191 Hypothetical Protein 
CT19S Hypothetical Protein 
CT234 Hypothetical Protein 
CT368 Hypothetical Protein 
CT244 Hypothetical Protein 
CT249 Similarity 
CT10I Hypothetical Protein 
CT085 Hypothetical Protein 
CT083 Hypothetical Protein 
CT082 Hypothetical Protein 
CT079 Similarity 
CT073 Hypothetical Protein 
(frame-shin with 03427) 
CT066 Hypothetical Protein 
CT058 Hypothetical Prolan J 
CT058 Hypothetical Protein.3 
CT056 Hypothetical Protein 
CT053 Hypoihetical Protein 
CT326 Similarity 
CT047 Hypoihetical Protein 
CT043 Hypothetical Protein 
CT041 Hypothetical Protein 
CT038 Hypothetical Protein 
CT257 Hypothetical Protein 
CT253 Hypothetical Protein 
CT254 Hypothetical Protein 
CT255 Hypothetical Protein 
CT105 Hypoihetical Protein 
CT102 Hypothetical Protein 
CT260 Hypothetical Protein 
CT762 Hypothetical Protein 
CT263 Hypothetical Protein 
CT266 Hypothetical Protein 
CT27I Hypothetical Protein 
CT273 Hypothetical Protein 
CT274 Hypothetical Protein 
CT276 Hypothetical Proteins 
CT277 Similarity 
CT283 Hypoihetical Protein 
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0441 


(CT007) 


CT007 Hypothetical Protein 


0442 


(CT006) 


CTD06 Hypothetical Protein 


0443 


(CT005) 


CT00S Hypothetic al Protein 


0474 


(CT365) 


CT36S Hypothetical Protein 


0476 


(CT865) 


CT86S Hypothetical Protein 


0480 


(CT383) 


CT383 Hypothetical Protein 


04S5 


(CT382) 


CD 82.1 Hypothetical Protein 


0487 


(CT384) 


CT384 Hypothetical Protein 


0489 


(CT386) 


CD 86 Hypothetical Protein 


0490 


(CT387) 


CT387 Hypothetical Protein 


0491 


(CT389) 


CT389 Hypothetical Protein 


0496 


(CT391) 


CD 91 Hypothetical Protein 


0497 


(CT388) 


CT388 Hypothetical Protein 


0506 


(CT421) 


CT421 Hypothetical Protein 


0507 


(CT421) 


CT421.I Hypothetical Protein 


0508 


(CT421) 


CT421.2 Hypothetical Protein 


OS 12 


(CT425) 


CT425 Hypothetical Protein 


0514 


(CT427) 


CT427 Hypothetical Protein 


0518 


(CT429) 


CT429 Hypothetical Protein 


0522 


(CT433) 


CT433 Hypothetical Protein 


0525 


(CT398) 


CD 98 Hypothetical Protein 


0533 


(CT406) 


CT406 Hypothetical Protein 


0537 


(CT814) 


CT814.I Hypothetical Protein 


0538 


(CT814) 


CT814 Hypothetical Protein 


0554 


(CT440) 


CT440 Hypothetical Protein 


0559 


(CT44I) 


CT441.1 Hypothetical Protein 


0565 


(CT449) 


CT449 Hypothetical Protein 


0572 


(CT456) 


CT456 Hypothetical Protein 


0582 


(CT465) 


CT465 Hypothetical Protein 


0583 


(CT466) 


CT466 Hypothetical Protein 


0588 


(CT469) 


CT469 Hypothetical Protein 


0589 


(CT470) 


CT470 Hypothetical Protein 


0590 


(CT471) 


CT471 Hypothetical Protein 


0593 


(CT474) 


CT474 Hypothetical Protein 


0595 


(CT476) 


CT476 Hypothetical Protein 


0601 


(CT483) 


CT483 Hypothetical Protein 


0602 


(CT484) 


CT484 Hypothetical Protein 


0606 


(CT488) 


CT488 Hypothetical Protein 


0609 


(CT490) 


CT490 Hypothetical Protein 


0622 


(CT503) 


CT503 Hypothetical Protein 


0623 


(CT504) 


CT504 Hypothetical Prorin 


0648 


(CT529) 


CT529 Hypothetical Protein 


0658 


(CT538) 


CT538 Hypotheacal Protein 


0667 


(CT546) 


CT 540 Hypothetical Protein 


0668 


(CT547) 


/^Ttil tt n.i,..;. 

CT 34/ Hypothetical Protein 


0669 


(CT548) 


CT548 Hypothetical Protein 


(JO /I 




\~ 1 3 rtypocncncai rroietn 


UO/J 




v. i jjz. rtypoincocai rrotcin 


0675 


(CT696) 


CT696 Hypothetical Protein 


0676 


(CT695) 


CT695 Similarity 


0681 


(CT691) 


CT691 Hypothetical Protein 


0687 


(CT482) 


CT482 Hypothetical Protein 


0688 


(CT48I) 


CT481 Hypothetical Protein 


0700 


(CT676) 


CT676 Hypotheacal Protein 


0705 


(CT671) 


CT671 Hypothetical Protein 


0706 


(CT670) 


CT670 Hypothetical Protein 


0708 


(CT668) 


CT668 Hypothetical Protein 
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0709 


tCT667) 


07|0 


• CT666) 


071 1 


<CT665) 


07|3 


(CT663) 


0717 


JCT656) 


0718 


(CT657) 


0720 


(CT659) 


0722 


(CT654) 


0725 


(CT652) 


0726 


»CT620) 


A771 
U 111 


(CT619) 


A710 


fCT638) 






0746 




0747 


I V, 10J 1 ) 


0751 


(CT65 1 ) 


075 S 


(CT616) 


0760 


ir-Tf, 1 1 \ 
iv- 1 o 1 1 ; 


0761 


(CT610) 


htAA 


IV* IO*Oj 




(CT647) 


U'Oo 


IV, 1 CHOJ 


0767 




0770 


IV.IO**/ 


0774 


(CT606) 


0776 


(CT605) 


0779 


IV. IOUZJ 


0783 








0792 


IV. 1 307) 


nam 






^v» ijouj 


0808 


\V» • v It ) 


0809 


(CT578) 


0810 


fCT5T71 

\V» %3 1 f f 


0814 


(CT573) 


0818 


(CT569) 


0819 


(CT568) 


0820 


(CT567) 


0821 




0822 


(CT565) 


0827 


(CT560) 


0834 


(CT556) 


0840 


(CT700) 


0842 


(CT702) 


0843 


(CT702) 


0852 


(CT711) 


0853 


(CT712) 


0857 


(CT7I6) 


0859 


(CT718) 


0865 


(CT724) 


0869 


(CT728) 


0874 


(CT733) 


0875 


(CT734) 


0884 


(CT74I) 


0887 


(CT744) 


0896 


(CT753) 



CT667 Hypothetical Protein 
CT666 Hypothetical Protein 
CT665 Hypothetical Protein 
CT663 Hypothetical Protein 
CT656 Hypotheocal Protein 
CT657 Hypotheocal Protein 
CT659 Hypotheocal Protein 
CT654 Hypothetical Protein 
CT652.I Hypotheocal Protein 
CT620 Hypothetical Protein 
CT61 9 Hypothetical Protein 
CT368 Hypothetical Protein 
CT635 Hypothetical Protein 
CT632 Hypothetical Protein 
CT631 Hypothetical Protein 
CT651 Hypothetical Protein 
CT6I6 Hypotheti:al Protein 
CT61 1 Hypothetical Protein 
CT610 Hypothetical Protein 
CT648 Hypothetical Protein 
CT647 Hypodieti.al Protein 
CT646 Hypothec al Protein 
CT645 Hypothec al Protein 
CT642 Hypotheocal Protein 
CT606.1 Hypothetical Protein 
CT 605 Hypotheocal Protein 
CT602 Hypothetical Protein 
CT598 Hypothetical Protein 
CT590 Hypotheocal Protein 
CT589 Hypothetical Protein 
CT584 Hypothetical Protein 
CTS80 Hypothetical Protein 
CT579 Hypotheocal Protein 
CT578 Hypothetical Protein 
CT577 Hypothetical Protein 
CT573 Hypothetical Protein 
CT569 Hypothetical Protein 
CT568 Hypothetical Protein 
CT567 Hypothetical Protein 
CT566 Hypothetical Protein 
CT565 Hypothetical Protein 
CT560 Hypothetical Protein 
CTS56 Hypothetical Protein 
CT700 Hypothetical Protein 
CT702 Hypotheocal Protein 
CT702 Hypotheocal Protein 
CT71 1 Hypothetical Protein 
CT7I2 Hypothetical Protein 
CT7I6 Hypothetical Protein 
CT7I8 Hypothetical Protein 
CT724 Hypothetical Protein 
CT728 Hypothetical Protein 
CT733 Hypothetical Protein 
CT734 Hypothetical Protein 
CT741 Hypothetical Protein 
CHLTR Possible Phosphoprotein 
CT7S3 Hypothetical Protein 
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0906 (CT763) CT763 Hypothetical Protetn 
0908 <CT764) CT764 Hypothetical Protein 
0912 (CT768) CT768 Hypothetical Protein 
0925 (CT779) CT779 Hypothetical Protein 

5 0938 (CT788) CT788 Hypothetic H Protein 

0939 (CT790) CT790 Hypothetical Protein 

0943 (CT794) CT794. 1 Hypothetical Prolan 

0945 (CT795) CT795 Hypothetical Protein 

0956 (CT805) CTS05 Hypothetical Protein 

10 0960 (CT809) CT809 Hypothetical Protein 

0989 (CT832) CT832 Hypothetical Protein 

0994 (CT837) CT837 Hypothetical Protein 

0995 (CT838) CT838 Hypothetical Protein 

0996 (CT839) CT839 Hypothetical Protein 
15 1002 (CT845) CT845 Hypothetical Protein 

1003 (CT846) CT846 Hypothetical Protein 

1004 (CT847) CT847 Hypothetical Protein 

1005 (CT848) CT848 Hypothetical Protein 

1006 (CT849) CT849 Hypothetical Protein 
20 1007 (CT849) CT849. 1 Hypothetical Protein 

1008 (CT850) CT8 50 Hypothetical Protein 

1010 (CT852) CT852 Hypotheucal Protein 

101 1 (CT853) CT853 Hypotheucal Protein 
1015 (CT857) CT857 Hypothetical Protein 

25 1016 (CT858) CT858 Hypothetical Protein 

1019 (CT860) CT860 Hypothetical Protein 

1020 (CT861) CT861 Hypothetical Protein 
1022 (CT863) CT8 63 Hypothetical Protein 
1032 (CT373) CT373 Hypothetical Protein 

30 1033 (CT372) CT372 Hypothetical Protein 

1034 (CT37!) CT371 Hypothetical Protein 

1057 (CT356) CT356 Hypothetical Protein 

1058 (CT355) CT3 5 5 Hypothetical Protein 
1061 (CT330) CT330 Hypothetical Protein 

35 1073 (CT37I) CT37I Hypothetical Protein 

Coding Genes Mot in C trachomatis 
0486 Hypothetical Proline Permease 

0279 Possible ABC Transporter Permease Protein 

40 0505 3-Methyladenine DNA Glycosylase 

0193 argR Similarity to Argintne Repressor 

1041 bio A Aomosylmet}rionim«8-Amino-7»Oxo»onanoate Aminotransferase 
1044 bioB Biotin Synthase 

1042 bioD Dethiobiotin synthetase 
45 0585 Similarity to Cos lncA_2 

0562 CHLPS 43 kDa Protein Homolog J 

0927 CHLPS 43 kDa Protein Homology 

0928 CHLPS 43 kDa Protein Homology 

0929 CHLPS 43 kDa Protein Homology 

50 1 045 Conserved Hypothetical M embrane Protein 

025 1 Conserved Hypothetical Protein 

0278 Conserved Outer Membrane Lipoprotein Protein 

0907 CutA-like Periplasmic Divalent Cation Tolerance Protein 
0171 guaA CMP Synthase 

55 0172 guaB tnosine S'*Monophosphase Dehydrogenase 

0608 Uridine 5'-Monophosphate Synthase 

0735 Uridine Kinase 
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0980 Similar to Soccknromyces eenvisia* 52.9KDi Protein 

0232 Similarity to 5^Methylthioadenosine Nucleosidase 

1046 Tryptophan Hydroxy lase 

0477 yq«V_Bs Conserved Hypothetical Protein 

0048 yqfF-Bs Conserved Hypothetical !M Protein 

0587 yvyD_Bs Conserved Hypothetical Protein 

0143 yxjG.Bs.l Conserved Hypothetical Protein 

0448 y*jG_Bs_2 Conserved Hypothetical Protein 



0006 


0180 


0440 


0977 


0007 


0181 


0455 


0978 


0008 


0190 


0456 


1018 


0009 


0203 


0457 


1023 


0010 


0204 


0458 


1027 


0011 


0205 


0459 


1029 


0012 


0209 


0460 


1040 


0028 


0210 


0461 


1051 


0029 


021 1 


0462 


1052 


0034 


0212 


0463 


1053 


0041 


0213 


0464 


1054 


0042 


0214 


0465 


1055 


0043 


0215 


0472 


1056 


0044 


0216 


0473 


1064 


0045 


0218 


0481 


1065 


0046 


0220 


0483 


1066 


0047 


0221 


0492 


1070 


0049 


0222 


0493 


1071 


0050 


0223 


0494 


1072 


0051 


0224 


0498 




0063 


0225 


0499 




0064 


0226 


0516 




0066 


0233 


0517 




0067 


0240 


0523 




0069 


0241 


0524 




0070 


0242 


0553 




0099 


vi*i 


TO/* 






0266 


0600 




Uu) 


0267 


0656 




0126 


0268 


0664 




0130 


0277 


0677 




0131 


0283 


0678 




0132 


0284 


0685 




0142 


0285 


0686 




0146 


0287 


0724 




0147 


0352 


0731 




0155 


0353 


0745 




0156 


0354 


0753 




0157 


0355 


0794 




0158 


0356 


0795 




0159 


0357 


0796 




0162 


0358 


0797 




0163 


0365 


0798 




0164 


0366 


0799 




0165 


0367 


0829 




0166 


0368 


0830 




0167 


0371 


0831 




0168 


0372 


0881 




0169 


0375 


0882 
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0170 


0376 


0913 


0173 


0391 


0914 


0174 


0398 


0930 


0175 


0404 


0944 


0177 


0431 


0964 


0178 


0432 


0975 


0179 


0439 


0976 
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CPn.OOOl no « 

CTOOl hyponnnt ic«l protein 

KRLTOExrrrsuuucAMifiKiiRGLSSLivtLa^^ 

ato rrr 1 1 i^XAC'/iCLLsrcprcsitittRHSHCDSCsscccHSMHSDjai 

~pn_onn^ 570 «7s 

• •*»! :-.+"' ; j lV flit' i -* j'ti'i**.' Vu /. il " • . \i\T:iu'.x.' r t: "mf k#\s I *! r » r AC 71 Sffif- F 
HMVVN\mXREDSVTSOrMlEEfUWPE3ta2.VKVPAVIK 

CPn_0001 889 2370 

gat A -Glu CRNA Gin Aaidotransf eru ». 

K IKYR YS A1£LAJCAVT LGELT ATGVTOHFFHR X EEAfl30VGAFISl£KEQAX*B}AELIDK 

KRSRCEPLCKIACVPVCIItrKIHTOUaTGAS 

KUOC^AMCSTTLYSAFWPTHNTTOLSRVTO 

OPAAFCC\ATCFKPSYGAVSRYCLVAFASSLDO IGPLANTVEIMA LMHPVFS GRPPXIttTS 
RETFRDSFTtSXLSTEVTlCVIOVPRTFtX^ 

3HAVS IYY ILASA£AAT>OAJtrOGVRYGYRSPQAHT X SOLYDLSRGEGFGKEVMRR Z LLG 
WVt^AERONVYYlOCATAVPJuXIVltAmAFEXra 
ODXYTVAMNXAYUAIAVPSGFSKEGLPLGWXIGCXGODC^^ 
SKRYAJC5WL00QS 

CPn.0004 2334 3B33 

gatB-(P«tll2) Glu CRN* Gin Amidotranif eraie <B Subunit) 
EIC^RCCSRPJIMSAVYAJ3WESVIGLEVHVEWASKIJSSA1^ 

RrKAJVOCEERYFELAQraiEXJDAGMLKHFCTFAGTO^ 

VAYATSLVSLX^XGISXOJKEEGSIRFWNVSVRttCTC^ 

LEAEJCORQIDEYLNOPNTOPlOAraPAATYlTOFEXJ^^ 

WLTESYIEMRKTIJ>n.PYDKYHRYIQ£YCLOTIASIL 

RSLSNWVTVtTGGRClCrur^PSSGIITEZJ/AOLVK^ 

GXNPEEILXEKPELLPMSOSGtLOXX XAEVVLMTPCSXVtTYTCKTICAlJGlXVGOXMXRr 

AGKAppCTvyn.riT n.nKO 

CPIX.000S 4097 6892 

pmp_l- Polymorphic Outer Membrane Protein 

SDIKnJLGTKNRFStXCFPLVTSFTLLSVFTTrSLSA^ 

0 AGDVYSLTGDVSX SNVt*iSALNXACFTArrSGSVTrAGNKHGt*YT f TOTSSGTTT3XSAVlJC 
C30POATARFSGFSTLSFIQSPCaiXECGCLYS70*AI*^^ 

3ANVT IVGNYDSVSrYONAATrGGAIHSSGPLOI AVNOAEXRFAONTAXNGSGGALYSDG 
DIDIDONAYVLFREKEAIXTAI01COGAVCXLPTSGSCTW 
GGGOTARJO^XSSG&fTLFlWIXSYANSQNLGGAXMO 
SLPFLWIHUjONAXnja-OAJWGYSIETOPITSCArCS^ 

SGEJCSLAKDPRDFKST I PONVNLSAGYLVIKEGAEVTVSKfTQS PCSHLVLPLGTXLIA5 
KEDIAITGLAIDIOSLSSSffTAAVIICANrANXOISVTO XELXSPTGNAYQ3UWRRSQT 
FPIXSt^PGAGGSVTVTACOFUVSPHYSFCGNWKLAW 
KECNLViOTLMGNAVOVRStiCVOETHASSI^^ 

YAV HHNEYWM YLGSYtYQYTrSLSWiraY 

GGatPLLVFENGRLFQCAI PFMia£LWAY0GOFK£ITAIX3RftF5NGSLTCX$VFXJGXRF 
□UALSOOVLYDFSFSYI PD XFRJCDPSCEAALVI SG DSWLVPAAKVSRKAFVGSOTGKYH 
FTJZTrTELIXRGSXECRFHARNYNlNCGSKFRF 

CPiU>006 W9 7X41 

No robust hcmolog present in Genebenk/EMBL at of 11/7/98 
KQLOSPXJISAUXRI^SMLVLLCVPS PETTRSTPDCOAN0LPKQ5RNRTLCSL 

CPn_0007 7486 10496 

No robust homolog present in Oenebenk/EMBL as of 11/7/98 
KSF)ttNIAXFSFLWXFLTDSTTSSLSTSU£B5N^ 
GtolATVMPGI^SVISSPAGMGACAlJQCVMIJuXIO 
PaSGISISGADSTIRSLPTYlJJBCWPQSMRXLRIlAIVLIW^ 

DSTXRSLPTYPLDEGHPOSMRKLR IIAXVLXVFSX X LIASGWLLTVAIPGLSSI XSSPA 

EM^CALCCWLAISIOVUJCKREWIW^ 

PEQlSAFSGYXlC\n^SHU»MX5LPYDGHGLEacrKH0XRWR^^ 

EEEEFFFLS ARKRLI DIATTLV7JUCI LTEOLERNNLRXAFSYLYODSIFXX I IDNTDCLA 

WKFHIUKSICRITIIFENHEHGVAXSUJaW^ 

LHGNPFFSLEDNKXTIHXEHAEMLESLSSYRKWLALSDENVV^ 

iAIJEtERDB>^KKA«UCHOCSLYTOARimLTOSSK£^ 

F*IERV0ERIPAIOKLYpNILE»EOTTCOtTVTPTy^0GTTASSXTO 

NONOESCWVLRSM£VD4SWEVKOETCPKKKE^^ 

HLSYFKXVWDCKE\A?YAKrTlUrVXESOL£GILAOTESAKLLT^ 

nCGSlXCAIASKAJCPYFCESPRFOOSDTOUtALTUlLOCAXASLEEEXKRFSNUMDXAE 

ERRUJCE3K0TFERAGLGVUlEIAVESTYDLRSL77m*n7rPESEJCW 

RRAJCTRLVEKT0RYROFKHAUAHQFNEEALL0EELSI0AP5E 

CPn_0008 10780 1U8S 

no robust homo log present in Genebank/EMBL ai ot 11/7/98 
:KYSYIXNYPPPPRRSLGVSCSKLR5LS XTLLVLCV LLLT LC I PC LTAG I5FCACLGFSA 
^GVLV15CLLJLI<VRREVPT\^SEEtPRGVSVTTSE£PAlXKA0KEPETKKXL0RLPKC 
LOOLCTY ICEWAClXRUCOPKYEORGIXTEAICEiaJ^DVVEXDMMSEFLOXORVLNEE 
AYYVEMOTPLENXAYEIFSS0EIJU>YYCAGVCGYU»SGO 

TWkCWEASVMLDHS YGVAR ELF KXAVGVL EESVYK X LFXSYRDAFY ECEKAK tQROGR FX 

WL 

CPn.OOUO 11689 IJH9 

no rohurr ho*w>lon pr«s«nc in (fenetMnk/EMBL as ot ll/7/ifl 
I/T'JAIIAFWFRD INm.VEOLKOT [ FWCEIIDCTO I ETVRK^CMWLORYAOKF I UEKEEK 
MF^HELn^TMVnKA:«lAYA)^AArEKER^LWRKVKOVE3<WLS^^EFRNOESRR 
ARERL* O^LYPEVJWBEPAO^RORTKICVNLENLY AD I OCKYWICVRWEHYWREVENK 

ka ky rem ; v :a eevs n: lquledcletht.k k lt kaeejvfekx ftwteklcnkvle d 

VTNRLE t UTESAKW I PR I EE I EMTLRMVELTU.FM KNTFEKA3U?YNjX'KEMLAJCVEP0 
• K ESPTYR.T; V^I^t^VHOTAYTNC0ER(^*r:»LLE5KVn7Y!1trilUlE0MK1 IF EVOC 
' "FCNRKU *A* ^lJT0ARLOLVATVPWFTU^I»llKRWCW:k 1 NHAJCTERYRE 

akv svmkfji jakot (i JCF.PHVW1J J»onwujioenK«wjRni. tn« iamwhvkt.? 



ntYTfLR3Yppppc«sv^s . jklrvla :tflvfcklll ; jcaifltlc : rcLSAA i $ 
fglc iqHauxvlh i sgllcllvkre : nvn pe e i petv siju> sezpau >aac*tlacl. 

AOEtaWOUfFLXNGGRStWTASStSIJUinrSKRLCYLPSC^^ 

SLHCn KKVAVAFDRNSYAMAEXAF AXALGAIXESVYRSLTOSYRDKFLEjEXAJCI P*C 

H nWjUjDfcKXCAEXWJCNPRVWWILCXOSFC 

CTnJWlO.l l««B l"4« 

FUCRUIRJCCALAJCmaUtfJKKNI^AVX^ 

YPEVSVSXRDIXXCETRSNimmXEZNirPXCVWCXTVWX 

SpnLeS5U9rCNGUQ^SEXXJItXIX»tLICL0XCATA 

RTOCR XEIIErnJUC^ElJ^PTlOWFEXACSOYNSCAXHIJD^ 

RLVSUJEDUlWYTEt^CRFOCOSG^ 

SR1Jort TO)CVSGVICXlAPPCKXFY^ 

AM lB»aJtEXEYWLYREEJUWKETOU.VG^ 

DMmPlfflEDHS 

CPn^OOll 15877 16614 

oacB-(Pecll2) Glu cRNA Gin Amidotranif erase (B SubuaicJ 

FWSDITAAPAXl^PTPPErriG^PItOSItSRAlGraLWIliV^^ 

GLSALXA/CGLGISTISLGVVl^VLCLILlJJUCRXLTtXOIEAJCOIAr^ 

05TOtfU3CIECSRYSDOCFl^TOKrU3LESr^SrrSEF1U7UlO^ 

ar r rr TAA»ff-FTtf1GflDVYIJtt^NIADIRAY>CPW?YXVAKVI EKAXAWKEFXVLTTtlAR 



CPIU0012 16596 182U 

oatB-(Pecll2) Glu CRNA Gin Aaiootransferaie (B Subunici 

GXRVrnJO^KYGUJCCKYOEMUuUXRIXYNS^ 

OXilOCAZAEXATLEQOKILtlJYGXSIFWLIIENM 

E3D#IKJCVUOKLBJI7YI3CUXESSXEST^ 

VXSUUJCYGGTVDPXOOTEAlOaC^X^ASLrT^^ 

ARTOEXXSOOXEAXREZAAESUlSX/nSUJaCSKmURAJCmXE 

OOCARIJaUCEOrrSVLPEIDEIETCt^LE£lJ>LI.T^^ 

VFEWrrVOEYEVO10^FlO4X:iSORJ^ 

CPtuOOii 18S09 21106 

anEL_2-Polynorpnic Outer Membrane Protein 
UOlLAFFrrliYVnOESPUtnaCVVMXIPUVLL 

fUUli S U SFSSrreSATDCTNYVFTOSVVTEWP K ' rcC TOCTSCrKN ^^ 

GFSFTraiOAT^ASGAAXGSEAAIOrrVTLKFSAX^FLKSPASTV^ 

UAOKVLX0XSIFSTG0GGAirCAC5XAXA>a^UFXCNSSST^^ 

CTXZGGOTAPTajCIX3GAXAlAIJ9GTLSISGOSGOXIFECNrXG^ 

KCTALRAAOGHriYrYOPXTVTGSTSVAX>AlinN5P UlUUWAt » 101 rVFSGOCLTtAXA 

KD|TORTSiaJWA7104CTVVlJQGtW^^ 

KUINIDSIJlNOTIKIJAATAOXDIRIDRPVVlAISDESFYCfCFlJODK^ 

AGXSmSADSRSIOAVQSPYGYGGKWIlMST^DXXATVSW 

Ui43SFIWRSFCtffIEUrrEI3AJ^DOWWAGI 

S T HMPU.U1 LStjGFAOLFARDKPYFKNTNF AJCTYAGSLRL0HDASLYSW8 1 LLGEOCLR 
CXilWKTUCSFYGGl^YGHTtWRMXmU'PPPPTL^ 

AEOtYNV^UIYSPC^RSNinorrTTlXSNGGSWI^ 
LFGNFGFOfRGSSRSYNVDAGSXtKF 

CPn.0014 21365 21922 

pmpu3 -Polymorphic Outer Membrane Protein 

IONOSIYFTMXSSFPKFVFSTFAIFPLSMIATETVLDSSASFDGNXICNFSVRESOEDA^ 
TTYLFKOiVrLENI PGTGTAIT7CSCFNNTKCCI.TFTGNGNS1XFCTVBAGWAGAAWSS 
VVDKSTTFXGFSSLSnASPGSSXTTGKGAVSC^SULTXXS^ 
POttFW 

CPn.0015 21835 24174 

pmp.)-PMP w 3 (frame-shift with 00141 

LEFDKNVSLLFSKNFSTDNGGA ITAXTLS LTGTTMSALFS ENTSSKKCCAIOTSDALTIT 

C^EVSI^ONTSSDSGAAXFTEASVTXSNNJuXVSFXDNKVTGASiST^ 

KT STDT KVT LTGNOKLLFSNNTSTTAGCA X YVKXIXLASGG LTLFSRNSVNGGTAPKCGA 

lAIEDSGELSLSADSGDrVFLGNTVrS I t PC I N RSS IDLCTS AXMTALRSAAGRAIYFYD 

P ITTGS S fl V I U VLXVNETPADSALOYTCNI I FTGEXU ETEAAWJOVLTSKLLOPVTLS 

GCTI3UO<GVTWOAFTCX)AOSRIOT3V^ 

TlCATSIQ^TLSGTrrU^ITCTFYENHSLRNPOSYD ILEiatASGTVTSTAVTPOPIMOEK 
FHYGYOCT>CPIVV<rrC^STrATFWITTCYIPNre 

CTANEELOCORAFWCACI^FFHKDSTKTRTCFRHIJCGYV I OCNLHTCSDX ILSAAFCQ 
LFGRORDYFVAKNC<nVYGGTLT/0HNET/X5LPCKLRPCSL£YVTTEI PVLFSGNLSYT 
KTI»©UaKYTTIfP7^CGSWGND5FALEFGGRAPICI^ESALF 

GFKEOCTEAREFGSSRLVNLALPXG I RFDXESTODATYNLTL^TVDLVmSNPOCTTTL 
RI SGOSW)CTFCTNLAi»0AL\XitW>WFCFKSNFEArS0FSrEUCS 

CPn.0016 24J8) 241B8 

pmp.4 -Polymorphic Outer Membrane Protein 

RSDFAlJtRGC>0<RSSFSLLLI5-oLAFPLLMSVSADAAIM.TU;SROS"/NCDTinTEFTPK 
AATSDASCTTY I LOCOVS t^OAGKOTSLTTSCFSNTACNLTFLGNCFS LMFDN X XSSTVA 
OVVVSNTAASGITKFSCFSTLRKLAAPRTTnKGAIK tTDCLVFE^ ICNLDLNENASSEIC 
GA INTirrLSLTCSTRrVAFUSNSSSOOOGAI YASG03V ISENAC ILSFCNNS ATTSOGA I 
SAECNLVTSNNQN IFFOGCKATTT PTGA IDCNKAfiAMPOP t LTLJON ES LMFLNNTAGNSG 
«TAI YTK KLVLSSCRGCVLF: ^NKAANAT PFOGA IAILDSCEIC XSAOUJN 1 1 FECMTTST 
TGSPAHVTHNA I OtASNAK FtNLPATRQir/ 1 FYOP t TZ ?i ^TDX Ul Lii XADAG3GNTYE 

' rf rvrm ekls ee elk f. ponlk jTrn^v kla/^ ^lvlk u rsmwtrr rTwu^RvvKD 

« FTTTFKA:TAHTVrrLNCUA t N X D'LOrTD 1FA ( I KATAAiJKDVALiV.P IMI.VPACrjNYYISKH 
riL.'X^WK f L [ t X^IAOr nKTTTL 1 1 ATI • J l/ITTNllY'.TV5NWr»a:UU«H« TftTKtlKKCYLN 

Hi 
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vrax,vrrpr rrr. l-sroallvr/V -mhhafaz nf e. ^fevelpcssrsyaidxogrfcf 

CPnJJOlB 27? I j 2')0Qi 

p»p..5* Polymorphic Outer ««norane Prot ein 

Wi^svs^\i^»3sxvij<aattplnpeot 

L7CEVLY rDPrpCGCStTCTCr/ETACOLTFtiOTnT^ 

rroFULviTESPxsAvrrGXGSLvsu^vo^ 

-. r wr** i Fnoffi-^x kttca L7rP7^TiEi«u?run*o«AvrswL0UCAASTrrAN 

:- , . , .«rr?:» : u.:.-ft: :r ".:j<. ! i7c.TMCf*ViM;::^Tr:.::T^^C 
::r/j -t.tjw -.'.alma;::.:":'. -..'*;a:.f sifAniATK/*av:P!fJr«n^:JTr 

'XDIWIDIOVnTAPNA'nTCRWVrHLCSTAXWrCLAASOCNAtYTYOP CTTNtflGASDN 
LR IKEVSANQKLSCS XVFSGERLSTAEA XAENLTSR rNQPVT LVgGSLVLKOGVTL ITQG 
PSQEPCSTZXLCL5TS L 

CPn.0019 29007 30356 

prnp_5-PKp_5 (frame-shift with 0018) 

/kSTEDIVrn^rNAgrrYCKNPINTVASAAKKNrTLTCr^ 

DYSFVKLSPGAGGTX XTOtWKWXVAPSRPKYCYOGMWNVOWWr&I^PSOAMlJW 

jrrCYLI>NPEROKLVPHSLWCSrWWaOCXMVNSSOXLOOEIOACAC 

NEICY1WSGVSYLVGVGTHAFSDATINAATO 

EFRSPOCPTTOSSSEACCHOWriDKOLSYSMRNNlJHKTIlV PI rTEAGGSKANDVFGLEF 

cATrmpNSTTtJorYSPFiJttfiCTrAHOErrKrTroBv^^ 

RFSbOOtCSYEiTIJOTPWIIUCDPKSTATlASG^^ 
PCIEVFSHCAIELRCSSRNYNIHLCCICYW 

CPn_0020 32717 30603 

Predicted OMP (leader (141 peptide i oucer menbranel 

KLMSNPMJOIOtRCrLfrarVl^ 

UJIKNNLRIOANXVYVENTVCOSlJOiVAKGNVMVNYRA^ 



RVCTIPILrLPPf^IMPMEIPKPPINFWSCTOGrLSSYI/SMSYSPISlU M 
FTOCNOCFNIJiCSQKQVPDIVFNMXSYYAKRIAI OMA£AHDRYRliHCDrCrTTOCHVKrS 
GEYHWDSWETVADIrT*lrOTC»rTGFTRVlXn*^^ 
YLTOWXSIYNTCVYtDilVBOCTWArSTOIVGENrS 
r.^STl£SSLIYYSWP£ISSRJiSQLSAXI£lXn^^ 

PLAXKEDKYirS IQDAFHSmLLKAG XOTSVLSIOMPRPPIUKMCLtffTKILSfreSXPT 
PmACEUUTGKIWTVSLDAaawXXKCWDKMIXIWEn 
XKCDREZffXLWSRFXOQLUSPX^OHn^LCICLF^ 
YLZYOMIIjjrKirEHWQLYCVYtWl£AOSRFrFTlJQ<DKPKXPPr 

CPrv.0021 34470 32707 

Predicted OMP (leader (19) peptide) 

CSRSPY PNX EJLARGVEHRSMSLFHLTLFGI .XJrS LPI SLVAKFPESVGHXI LYI STQST 

00AtATTfl£Al^YOC«OFTVlJ«ICIXmJWSIHSSOPOT1UCSTI 

l£OA*OTrADPLOOXAVLSAVSGHXiSKT5DOLLr^ 

OKLHSrXHKLPmQCLSAMIlAXXrCESDAYXllDXaJUUa^ 

FIJTLPJtoT5ASPOX>0EMLYAliSraiaXQSYYN^ 

EZE*LPVXKKC*IJIBPRALYALPJ^SEXOIPIALPXFlJa^ 

OTPiaiXYITOaVOPMYNmALSFSJCOim^NWiaiVNX XVPOOPOERCRLLSTrRSLE 

BQXLTFlJ«I^KEMrLK:XYXZJJ^(»ROXATTAXSrUKn 

IXRAYAIXAIYHLTiaJPEraaSLKDYAKXLIOErtirWraW 

RTKU1LDX LETIATSXSSCOXIUXXOIiCrSSDMtNFPVXASLLXXXVC 

CPA.0022 35042 3439S 

mat 

TIUT^SNCaWSMrRSFrSHSLPLVL^ 

CDPIAYTQEIiAAOXAYAVSELHS PCDCX X LTGX7TIVSYT)GWFTKPC<SXADAIQMIJCTLR 
NQTOD^S I AVLHKCKLLTCSCTSO I SLTM1 POHR I ESYI tTIVSTUWOykYCWCHGCl. 
XIJQCVHOCVYKVQGLPX0TLICYXiI£EXJIIDLWDY5X 

CPn_0023 36657 35014 

yj jK/elr-ASC Transporter protein ATPaae 

ENRAKLLYSKQHFVKI^AMSIVT^IGKSI>G™^ 

TIXKIIMSMIEPTRCSISU>KX\^ILP^IOSFKDTTVUX^IMC^^ 

LOEFTDAIG KELCEI EE 1 Iu EDCYRADS EAEELLTGIC I PNEKFDKXHAM I P I DLOFH V 

LLCOALFCHPE*lAIX£PTNWLDLYSXNWLG*irXJa3^^ 

IDYSTX XXYPGNYD0KVO<XTASR£0EKADIKSKEKXIS0UQirVAKFGAC^RA90V0SR 
UlEI 100/3 POEUQCSNIQRPYIRFPLSTJXSSBCW^^ 

GDKLG I IG^l^KTTUOOLAGVEAPSSGS I KLGHOA ICSYFPQNHSEVIJUrGC-ETLF 

EWUOmKTGINDOEIRSVLC10C.FrXDDAFXO^ 

EAWWt^ESVSAl^AIWYKCTAirVSHDRGLIOOCATKLLIFOra 

TACHXQLL 

CPn.0024 37605 36661 

xerC- Integrate / recomhinate 

REVMIAS IYSFlXYLJ(MVXSASPKTLRWClJL^LJCirX.EERGNLAWSPLOLATEXRK 

VSEU>F3LFTKEHVRMYIAKL: QCKAKRTIKRCLSS XKSFAKYCVIOK IUXKPAET IH 

GPIU^KELPSPMr^ACVEVUUTPOISKYHCLJlDRCUffiLFYSSCLRISEIVAVNKOOFD 

LSTHLXRI RCKCKXERI I PVTSNA 10WXQI YUWPDRKRLEKDPOA I FLNRFGRRI STR3 

tORSFOEYLPJSGl^ITPHTIRHTIAlTWlXSGHDl^ 

VKLKX0THOEAHPHA 

CPn.0025 386X0 37684 

elac/ataA-jfcilphohydrolase/Glycosiilfatase 

tLHSSRELIIUXTSSQOPTRTPJIQC^YLFRVNC 

IFVSHFHGDHCLGLGSMLMRUMKVSHPIHCYYPASBKKYFDRLRYCTIYHCTIOWEH 
Prr>EECrVEDFCSFRIEAORI^>CVI}TU»ffiITEPDTIKFLPKELESRCtRCLIIODLIR 
DQEISt'T^rTWL^DVSYVRKGnSIAIIAmXPCOAAIDU^ 

AEJUFMm-AKOAATLAKRAATOKLILTUFSARY LNLDDFYK EA3AVFPNVCVA0EYRSYP 
FPKNPLUfK 

«T»4'> l<ypnr|»>r ical piotttH 

.mvmwu P::u<fc.VT:?rniKfVPEKrMAp!UC^ i m, h>ii ayl [ i urvi.vwvr.WAML 
• TV tl-r.v : 1 1 -w;u wi.Aii.vtL; : iknivl [ nw i ktkftk k i apkoa:;e:;u iwwwukcw 

I VI-:HUUjHKJ KMFf RTVI .RW,W.VTNKFK:> ifElltH t ZDhJU&tW-KT'r/.'.Z I ES5DE 

Jir'KAKi a:AJHrTATTKKr:irr:rrTF::;;KKKKicTKiL';i jirtttj:: i i ikr.ta pk phvpt; k 
•'injioav «aanj 11771 *«yt 

I. in I«hi ATI" -lifffrftthHir PriViMtUt 



PSIRTrVDSTTNSOSPlLO. .rVOCLLMEXXSEWSTDUiPStLFILPLWCRPPT 
PCHAAP ILIESG PYYrVTJCVLA*-"^ * 7 IGL VLTKKIXAD I L KVSrMDU OCT QVAARILR 

iKPtia^Acvu^mniffTxr^ 

t j ^jjp(_ry rrr^ irTgRSPy I^fCKCWP5VALTTAT<tC^P^tW74WO>>rB«iAl.X 

rvSw3HYCLOE i kor r lxu. : j vcx^sxcuccs : iclvc ppcvacs i=*s 1 ax\xi«xf 
frfIjvgck^ 

LKXVClNOeKPKSKXXTFKI^aKNU/rYLJKi'lF^URn'W 

Y t ESVO^SLKTDMHLTCOACEVMXESSC lAiffYLKSALKRYAiCt rfTPKSO VWIHIPC 

GATPiaxPSAcrrKVTSLLiuxrrpvwNuafrCT 

LNILIFPEDNRROYEELPAYXJTTCIJCIKFVSHYIXMJO/AFPK1JC 
CPn.0028 43328 42543 

no robust nonol09 present in CencDank/OOL at of 11/7/98 
RHTLOFTKP rVTSDOSLSFLPYLCXSSG I X OCCSNrVEHYLKlCCOTSVI ITGVSOATTL 

svuKALPisiGUtiiKiisrtLitSLiudsiKvnjazisTrri^ 

TPMENl^XJ>LPSPTTIjaXHAXJ<ILVRSCX^ 
rSYNSLXo^<rrFlfSLVSVPNXSGEERAX>nrKXiaCEI^VKlJC^^ 
grrKDXXACFGLLTFFPWK I YPL 

CPnJ0O29 43839 43390 

NO robust horaoioa present in Genebank/CKBL at of 11/7/98 
SNTCyiRNiyiYCFTVIJTtYIPJTAAIJJIRHNDCLRFCYSY IIXRPKLXDSSLlWrTnOrf iT. 
WCPYJIKXJtTTSIICSSLXSUWLGKREATOSOXLYGTSRrOYl^m 
L0EXTVSRSVMELKXKFYVYLNSERNKT1CP 

CPn.0030 43840 44529 

5cp-o-sialoglycoprotein Endopeptldaae 
UOCVCWYSlJryilCNRWfYFYXYVIIDTSGYYPFLACV^^ 
FlJICSKNLSn^AVAIilPCNFSATRICISTACGIJUHAXW^^ 

LFASSFSDKITVEEV^PSVEOIRRHVlSOFMFVEYlJKOl-SPDYRSYSCir 

CPn.0031 44708 44884 

rs21*S21 Ribosoaal Protein 
CMPgWCVBWOOTVDWtfJttLIOyJCICKg 

CPn.0032 44881 46098 

dnaJ-Heat Shock Protein J 

SLIC«!\An?VCXSVSGKDYYSILGISCTASAm 

VSIAYEVLSOPOKRDSYORraOXPFAGAGGFCGACrJCNMEBAX^ 

FTXXSXJXSGXITEAFGKR^OPAGAROCASiarVHINLTfTEAAHGVE 

QGAWPQGXICSC^BCKGSGqVVOSRGFFSMASTCPE^ 

llSVHVHIPAGVDStMRXJatECYCDAJCCTCAPSGOLW 

GFVouuuaaacsxpTLXJcrsGsa^TVPKiosonLKVRm 

VRTONU EXQKEXiUtTTASTIXAEKFPKXRS F LOK X KCFFSOFTV 
CPIU0033 46129 48171 

pdhAAfl/odbAAodbB- (pyruvate) Oxoisovalerate Oehydrogena] 
k Beta Fusion 
ERSHCVVTJW^ SS XiWJaVWEXJlFAElWl^ 
KSLXJCKPWSFFYYRWreiGt^DLS^^ 

XCM8«<AJSV7FEI3C£GAOIASX^CHOGLAVYZV^^ 
AIJUDVVIU£SHSI«C»«EKYRSAiIUJCLSMDia3Pl,^ 



SEAt.VEE>nTlD^7VIVFGEEVAGIXOCVFGVTR^TTXTC PQRCFNSPLAXATX XGTAXC 

HAXJGIHKPVVEXOFAXJYXWPCTOU^EASSIYYIISAGEWEVPLVIIU^ 

HS0SXECFXJUICPGXRVAYPSNAAX>AXAlXIUUlXWPNPVVr7^^ 

SHUYVLPFGKAAIVHPG)a^TIVSW3KPL\^LEVA0ElJkSRGIS 

TVLXSLDCTGRXXVX HEASEFCGTGSELVATMSEOGYAYXJAP XPJtLGCLHAPVPYSKVL 

ENEVLPHKE5XL0AAKSLAEF 

CPn.0034 49496 48210 

. CT34S hypothetical protein 
VNT1iPTTCRCIEitA£ISTPSLPtSSIVS0>erPP\™^ 

ETPSSIVNAIAFAILAFLSCLGCVFA ICX^SLEIT«PIJILTAVFIArrU.YFIKYLEX 

PKXPEPLPTPPPSPTLRAPTLTPEI PAPAPGIPLPPTLPRVDRTKLTOIPDI KYPmfOP 

KACFSLLXQLFSLOPETRP EORXYSNXLAS I LLRSKEK3G FR PHCFKCHFSHDKXLMOCS 

GAWISS«SSHDFSTTLGRAFAVTTCLORSCWEXIKNNI PTPEXHLPIGSCVSOPWCVEE 

GAOLYTSMtWXNPPT^ETLIKEKKWUITtJCDFSMCEAFTOLVWYXj^^ 

SVQLEVFCLJWLSAiDOEErrTWESCOUJUXESVRIUJkSKEEYA 

TACRALFLN 

CPn.0035 51146 49569 

CT33> hypothetical protein 

ARTTLEEOAGSSLJCPLPKTFPCATALYITHRRERXSEHOHWNRCOVFSSFFFRYPISSWL 
[RLRASCECrOORHPIFL^LYWl>CITSRG«PECSALILIFU><FLPRNPROWLPlASA 
W 1 1 SLMLTPAPFLMXX3PISCTI^IMHAGGCGTYYGEAl£ IOTPCCKRAHKLSCOILSESR 
l^LKW/ELECTLHWT^IVFICSNACTrXEIPRSRm 
PFASSLU^PLPCWJIDLFROICCLSHLFAXSCWHFSLCATTLNKLC^ 
IVLTSU^ I FPMSL3VWRSWIS VH'UrFSVCFSCSCSrjUJRU^GFlLCSXFFf PFSPTF 
VLJFLATLC ILLFFPK t FSFLYTPWTOFLS PFWLYP IR YLAKTLAISLSAOLFXVLPXm 
Y FC5UPL EGLLYNL IVPFTrLP I IVFLXATX XU^CrPXTEALIOCfTJWPWLKNPIIILK 
TLSFAPVPPWMLTLA;;LILFF IC tLRTNVSPYAS I&\TT/RFIETL 

i:Pn_UU1». SO«»»i'» SI79»» 

AKi ^Xn." ERKKMFK ITJNTlt^r HT//RSPPPFOVLC I EOLPKEKSWEW.-JUC [ PP LPROWYEL 
WJUiKFJ* inFT.'LOI^VC: A/LllEHKEnP.I fCRFFSLUTlEVY I YRLEKEr/OLKMFWF 

KDnrni ;i v sktpu JM'iiTiinRU'f'LnohirYEKFFf; r wr;FnkweoB ) ( pitff'j;i,\KvwK 

I J^a« ATMNKW/AM :v.;i i; t Fl'FYT/EEPFAYOSFFFDPF. I RRUt.KM+f/IXNKE.'XE 
Illt;:| JCT I EX.UIU';K:rvYP::F'UMf.»W/U!.ieeVYNE 

,^••.11 it: I4irir:|4*ii'.ii i i. if t't'itttin m»r 

ki .1 j > i : i • :i-ma ( i uu:: a» *wr t y i vph: tKWKrTRr/LESEKDPOou i PiEi/^-n.' i */kn 
aa. » t nvwvw w f vmi #\ i jf:n 7//11 rmnKT i rwuc iMn tr^u iApytitU u-/r t r:»ea 
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HR I LCK WDAFSSCF^EL 

•:piijioi« wuv smn 

o^VAX^EVTraWEVSSIL^ 

SJUaNT-AY I RRFV^Vr-AATSKTA rVoRAKSIPYIiANIStELWNXAKRYHCKLVwItXJY 

y ; .-— . ■ — rrirr.w: '" :i? i v.*. :.*irr/?Ji.:.:rrr?T 

•T-vwiavim ' *. ,j ij.vio!I^k«-= ■.T:MJ{iJWSCJ'i 

MIHWLaAAjqWTVPVSXCG^^ 
rCCL£XTCAIX0MCTC5EVCELLNWMXrrS 

CPnJ>039 54256 53963 

CT339 hypothetical protein tn^rr 

XSMSSWKIOOCEAKINEOOFLElttASLIXlCRY^ 
Cl^PEDPEVII25lJRAArKLA«X)KXX3eMSl^RSTOPr 

CPn_0040 55673 54316 

dnax-DNA Pol XXI Geaeia and Tau 

AFYTKStXJYTWrXXJPYOASSRKYRPOI ^^^'^^'^y^fJJ'^Y^S^Ji^^^S^ 
TraCTTWILAKAI-NCVKLSEra 
IflETVLFTPVKAItmYIIDEVHKLTKEAJHAIACT^ 
L3TO!0WLORIPEmi£KI£LKAOI»KIEAS0EAJ^ 

LTPKswprrrvxoMi:rAsoosiJtnJ»AiLOiW 

LTYBNLIATfKnTSKFSSOmra^ 

ORPVLSIilSSIKSWjraCIJWIXEPTLTCXTVSAPQPOPTYI^^ 
SVEVXSSAS IKSAAVOTLLOTAWEPSCILRO 

CPnJ>041 55888 5734: 

No robust homolog present in Genebenk/EHBL as of 11/7/98 

CKYLYHHSYFPPWSVCSISSRYXLRVLACT^ 

MlttSAI^WWSCaijCLLVJCRWKVCPEErPATO 

OXEOKTWWLPOELWlimiOW^^ 

DMIAETVELOOILCOtXaUiDVINO™ 

LWCSAREVVDRFWTTCKIRXIAKrftJWWYSV 

RYKD<RrrRAJWinCVAEWI<n*AT>WVOTSlJlOO | I£WtVA> tAWS«n0ROVOKXKDCR 

UUJIJCEIJlDCELPRAQERLRELaALYPEIAVSVVEAR}^^ 

EQELY 

CPn_0042 57346 58182 

No robust homo log present in Geasbank/EMBL as ot 11/7/90 
EEZDCQEAEFRENGTX XRSMEEVSSYXXaQVTOLESCSia^TPCrPALCVI Trf MfTTT E 
SI ILSOVVrwrBVLCRDIEI»fl^VEEIERMLW«ELPU.PIKE^ 
TRVEJTFKESPAYLTSEERLOSXJOrLORAYKESOXVSG 
Otf/SLXJOXILIVrSTPRTirjSYKSPTUJI^^ 
DAF0AF0EMXJC£CLVE£AOAIJirtTyWLYREZlUCSXXlQI 

CPn_0043 58432 60372 

No robust homolog present in Genebenk/EHBL as of 11/7/98 
HHRriaqVPLSPOI^PPPPIWSVGASFCLSKrRVlAITrLVl OT !! f t stcm. f ltxgisg 
VSLCVGI£I^ALCSVLVISCITIiLERRrV^ 
AKQXXXQLPOELreLXTTDXOtnn^CX/aajeDXJCCiayRCLL^ 

FVELQQVHDQESRYLEGLI HEVQSIAHKI^TDVNIRSKLCESCCYIJSD3VK CnJjy 

AKiVVAftflOCVTRDIKKIAHAJTiW 

HUKII^OEWISARAEORTREVlTOra 

LI LEIOUaJKVMSHOLVEATMRVKEAE\nYSVAIWArETOCS00WKKr0 EKTKERLRCUC 
DIJUJOECHRAOERXXKLTALYPEVSVSWETEREIKF^ 

VTEOKKREAEFRAJCCTTCV? SMEEVAEHLQ ILENLLEDCYKRLSKAETFAI£VEREATEXI 
EYTXLSOAANRLRVLCEDI EDTLPRVEEIEHMIJlMAERPXiiPIKOAFTXArWOY^ 
LAJCVCPYYXESPAYVNSCERWSLOOASOCIORVFItCFKrRNGSMYI 

CPn_0044 60278 60778 

No robust homo log present in Genebank/EKBL as o£ 11/7/98 

(AKSKRVWXRliiSAYKES0XVSSLrrEACrYREYlJ(£QVQQFET0GVSLXKEEIXF15S 

TLKSKl^YDPLIANIPafKfYYCYYDDXDKARAOSRWLEKSERY^ 



CPn_0045 60961 62790 

CT345 hypothec teal protein 

CKYTYHPPOLPPOHSVCATSWQPKLRX LTITR.VLC7/lXLISGALrLTLCVPCLAACLSr 

GI^IGLSAWX^WSGLI^FLIRRGVSKVRPEEIPVTPOT 

IQEVVSCLGKIJaJLKYEDOCLLTEVOEiaJW^ 

I NQVQS I SH KLPVPDVN XCAKLAELCCYL PSGDVR VERLKRS ARQWDR FMR VTCDTRKV 
AMAFDENACCVAXNAFDXAFGALEECVYKSLTESYP. EAF YEY EKAX I LRNEDVEWLOOKN 
KSARA£0RPREVKDRW£(^KETVFVnnCENQCIOL£VLTAVQCWPt)RCPEKLX PEKRRNKV 
MSHKX*EATi4R10CC*EtnYSVARVAFETO 

RERXJ^EXXALYPEVSVSWETERETKFKl^AYGNLEERYOSVVT^ 
FREKCTKVRSPEEWEYLO IL£Nl^XrSKOLTIAT//VLC^LEATAErEYTILSDAAN 
RLXVLC ED I ED I LPRVEE IE XMLR I A ELPFLP IKQAFTKAFLQYNSCKDXLAKVEPYCOE 
SVDYKSrJFRV 

i*Pn_004ii H2775 63263 

Ni> reduce homo Log precenr in Cenebdnfc/EMBL as ot L 1/7/96 
ERFOr.LNODL0NW0ECOKATCL£SEVSAYRDHLREO ITEFETOGLDVtKEELLFVSSTL 
KS KLTY DTL I AD ( PCMXFYEEYYEC I DKARVQSRWLEXS ERYRKAKKCFOEMLKECLFKE 
rOALKKAEYRLLREKRWIXFKLLtCNXX CAAOCR VOE P"» P.1DC 

i'l9k.iNM7 MJ7 oJ6W 

Mi lotmisr rwMhili<u pn*u.tnr tn Gan«tb»mK/KMBL >*» oc 1 1/7 /OH 

KJiniunn- 1 v :m<Kvi.a: ilthyiifok i Rm-LTtv:r/LriKr.LKKpYELWFVYcccrE5 
KVKi/ymainwi. 

I'kTi.'HMM ..>**7 nSflOl 

•V'HK M •xeiSNMViM tiyivtthut iotl [H f.rv.nift 

NKEI J>HKJ,*VNRAUIKUaMWHYKr.YTrV.T::KIVA t FTFAWI JtVLWPEYKAtlEISRtS 
l<TA(4fl>F:?lj:a£:AliKKYKMTAI1I!;EAFt*KVYI(LTU~^ 

n rniFvu:::rt\>Ki *i .holt i vppllukekktle ut l.'irjwcr aw i AtVFtt ilk tFLlQEN 



to Tobust nomolog present in Genebenk/EHBL at of 11/7/98 

tgw^MrvYL^IYQEIFWLTWLHQPYYDX I LTPfTXY I PGHTKXDSNKLF QIOtSHA lW 

JmSwFS^FSWrLIFVSLVPrAGLVTUTO 

n£cYGLIC«Xl/2^ 

AT 

CPn_0050 66849 66499 

no robust homolog present in Cenebank/EMBL as of 11/7/98 

VSOTIUXFUWRYAKHTT^ 

OOTILLPIFLLLLAJLLISVtJWXLPFRLCCFAL^ 

CPtL.0051 66797 67111 

no robust homo log present in Geneoank/EHBL as of 11/7/9B 
CTAYLXAWUPWOIHETYIHPCVLPSSHA^ 

ksseol><dohrxpliffckhhptisxuwnrf > swlsxfyncercf 

CPA.0052 68008 67304 

henC-Porphoblllnogen Deaminase 

KXieVCYSDPCX^DrCOCXRPXAXASRNSN^^ 

ajjaxxiPLHLVENSYrrm^ALVHKr^^ 

LLVYADKYVKEPLPI^PIttjGSSSLRRSAVXJWlJJ^ 
IVlAW^SIJUJlLHHAYSIEX^PPYHALOCSLArrAKDH^ 



69350 



67986 



CPIU0053 
SuTlSKrcSrcNScC^TAPKWLOO^ 

StZLBtZS* XTIDKACWDRIL0CCVVI^L7XLOS0PG ICJtSTLlXOTAXSLASOJCYXVL 
YVOCEESVrr^SlJUUaailXSSPLIYLPTETKLIWTO 

SHANYRKlRSVKNRrCPTNELL ILSMHADCLKEVSNPSCLFLOLAIUPI'IUSMX XPXXIS 
SOU^ILOALVSSSPrANPVIUCrACfOPNRFSLlXA^ 

AXLPEOOXSSLPKEI REWTUflGVXTIICIJAXRLli* 

CPn.0054 70089 69313 

rnc-ftibonuc lease III 

Tl^mPIXIPNSKnCIXyu^LSI«PPIDITMEAJaiffTT7^PK^^ 

DYLLICJ03EKIOSERGRLSAYANLFES ILCAVYLDOCl^PARW.TVPLLPPREtILPO<S 

GNPIWlM»rroK0nWI4VYOSTAVTDAOCNVSYOI 

AAOOMirrYaiKNQNTMDV 

CPfU0055 T0096 70590 

CT296 hypothetical protein 
CmCYLXRxWsAX40jOKIJWFKNHGSXL 
TVIOIRENIFRSMPEIYTVVRJaiRUJrTAAJX™ 

FLU I'ffrP 11 ArtCf ^ T y^cpyp^p^vp-PccTTCLLLArssvcr PVI 

CPn.0056 70917 72746 

mrsA-Phospnomannoiiuease 

£7IjriiLKRISL«KrVEORIRSLYDAVTAINlCTWLSN^^ 

DLFCATLTrCrroCLRSLNCICTNRlNLrriRRTTgGLVOV^^ 

RKN5 X EFAQ ETAKVLACNCCEVTLFOYPE PLALVSFTVRY ERA IOCVN XTASHNPPHVNC 

YKVYKASOGOVLPPLWEXVAACSAVT4EILSVPSIDHPNIHLICXZYEALYRDTLKOL0L 

YPEANRXSCR5UXSYSPLHGTCXSLVPHVLKXSCFL5VKLVEX0A XCDCOFPTVQLPNP 

EDPEALTt£XEOMLAflXX»tJIATOPXUDR\«?A«XEDCOPYRFNG^^ 

WSKT^LGEHDlCLVXSLVTTEfllJAIAXHYHVDLIN 

CAEESYOCLYGTMVEDXDA X I A£AL I AEAALO0KL0CXTLCOALLS LYEtYCYFANKTES 
VVFSAXTDEOEIRJOCX^HLEEISSANrFSCKYOVEXFDJYKOCICFNUJKDS^ 
TSKLCYYFSOC3CPVI I RPSGTEPK X KFYFEKSTKYPERVTDKEIOKOREAESFOHLDDF I 
FDPKEXPSNL 

CPn.0057 72913 73554 

sod* -Superoxide Dtsmutase (Mnl 

ILKRYWKSFVPYSLPELPYDYDALEPVt SSEINI LMHOKMKO I Y IWILNAALKRLDAAE 
TOONl^ELXALEPALRFtG^XNHSLFWETLAPIDOCCCOPPK«ELLS 
FUCKLI EVAACVOCSCWAWLCFCPAKOELVLOATAWDPLEPLTCKLPLt^ 
LOYKNVRNXA'LXAFPQX XNWCH IENRFSE X X SSK 

CPn_0058 73627 74562 

«ccD-AcCoA Carboxylase/Transferase Beta 
IRWLVRLfSYWPKtlCVOKIKAICFSGWlJCCN^ 

AIERVKLLADKDSWRPLYTDUCSODPLEF XDTI7TYANRLEXARKNTTESECVXVCICTIC 
LMPVALAVMOFNFMACSKCAWCEXLTPi. IEEAX ETRLPVl IVS ASOGARKJESVFgLMO 
HVrrSAALAXLHEACLPY ISVLTNPTSOCVTA5FAALCD I X X AEPKAL ICFACPRWAQV 
tCEDLPEOAOKSEFLLEHCMIDK XVERKELKTTLCTLLDY FLAOEYTOCKSKAPRDLSXR 
LKEIFLLTODSE 

CPh-OO'^ 74^2 7W50 

fm-ilirrp Nu».le»>r lUohvlroidsr, 

IFJOfTA^CNDN t ICNA 1 UfTVFCELD^CCELPEYTTPnAACADLRAN CEEP XALLJtXTRA 
Lr PT; I KAE I PEnYEWVRPRSGLALKMO ITVU WPOTX DSOYRl^ IRV I LINniDJTFX 
I EPKKP t AQWL:;r\\\»ATFVVKOE.':LAI!TARC::ry:FQ imA3 

CPn_00'.0 7^004 fS t 2 h 

ptsN-PTS It A Puv.t»in 

P.KLPET/EVLV I LEOAKHP^Yf rvOQLF.'JLFULLCPRLVNFLCKl WRDEI LOW.TTO .VDA 
ACLLEDK0AFFOALVR KEN ( KTFZ f f»T r /A t PI (OKLESC-'illF F t A t(2 1 1 rpQi • I V\ I ft *• 
ALVRLV FL t lC P ENAOAEY [ JCLL.TTLTL.': 1^ KEC R RQOf -LC/VNT I E EVHNVFV? M 
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HUM 7620R 

PC3N-PT: 1 1 A Protein . KTtl DNA-BinoirtQ OOMld , 

OUHCC tUXn^DWCLDr^WU^OWjOWJCSCAX PSYSItW/RFSREEJ EWLL 

t*E5Vt^EMl^lWn/LICTGXGeGXAL«lA^^ 

VG r LITLT ACQOlCKLNLVNX rVHLOCUWRSrrXNYPNTOLL^YVXWtSOTW 
.:pn_00*3 76251 77690 

y.'-i,! AVm* :k!*t vxavu;:: : ri^tttf'.TAiiA::::.."/ -.rrvr :?. ? 

tVTOK^ERl^KEPRAJUCGYLVPSSRTlJAR^^ 
PRSVKIJWNWU^1UUUW(^SAFS^^ 

55 ARIWYCTPSSAAPStFLET£tVRAPVWT1crL00NlIH I PVVqvqTWPKg MnCT tO 

LASOASIQQSErrrroSLRCJ^QGASLPV^VRSyPEVSVQRQWTrJ^ 

VRQAJXUtfLTWCVAWX^VTSTt^ 

NCKQDKTTPSEDASQ CEQQTGACL VRKT P KSQVAgKAQNrYRNSlW TNI P ^TffffTf ^ 
SSEr™PCSSCVSKRRTHHSlSVCTMV\nVIAMIVG^ 

CPn_0063 78109 78267 

No robust homoloa prMtnc In Gen«ft*nk/1HBL as of 11/7/98 
pMYANCiooJCt^YDFSfuwsppa^TTTPPYsrrLCiF^ 

CPn.0064 78340 78576 • , M 

No roouit homoiog present in to* 3 ^"^",?*}}! 1 '™ 

LVHTKIQCSAQYYRSRPAERAQTPPOPFTJUUJRA^ 

LPLPVKLLPLAACSYLJUr 

CPn_0065 78882 80651 

CT2S8 hypothetical protsin 

YPYYKYNKF P KWWW TCT P T HF lU SPKLHPDCVNPHFrglWPXVAI^mVVIj g IAL L 

SCIVUICTPLC^ISMIU3XUASGGAI^^ 

ERPELKMi?YSIJUgVWmJO<SVVKHL^ 

KXSQMUCIJOIIAYREXXLKEOTQYOTl^^ 

SUCFSTWSRMSRIHTmVIIJU^VVSVMW^ 

LSYLVROIl^NTIWNRODrnCPVIWVOIEXiN^^ 

QDWYTQYITWIBCRIUminrrYKEII Wl ' l^ 

SETPITOCXEPAIOJ^SQNlSTrrSPnTOOPgSUWP^ 

CPC WfXlPTr^^TTT^FtrMnifLr^OT <^VWWIWP» CDroW IYSPrgLDMO 

KLECa^fYNHUtfAQEErTOSLCDLVD^ 

CPn.0066 80916 82655 

No roousc noaolog present in Genebank/EXBL as of 11/7/98 

CVTfMANPTORPPSPCISIEElXLOElWSCSSKTTnSKrpPPSCAATAfiEVSlJ'I 

NSmEZCPLCSCEVYDVVCXTWOGDPEVRIIKEVFVW^ 

EPVRTIHNSGYGLGSCFIfiIRjmiPPRlW\aSOAI0AR^^ 

LIYADSAESLn.PSVRCPSYQWAIi»SCOCLXraMNUUVUP JtPCPSSSXZALWNUQOH 

STWI1uUE*aDRa>SQAVt£XJ<POPSHCTmiALT^ 

IVTYMVTSYSIIfrXJYFILIXTTWPqCRIWnWIJ 

VNWlPPUSVIPCTASFATGSFryVDLTWFrrSIJSRXXSJV^ 

KLDSXJtPSONALXTFHQCLFMPLI ICJTO0LV10VJ»RVVIHPWnAVrDWrS0EA*IDS 

COVXAIGQTIWrUOCLLVlKWTrVRSVIWN^ 

CPn.0067 82920 840S3 

No robusc honolog present in Oenebenk/EMBL as of 11/ 7/98 

KS5CYSYRCPPMAVBCRVl«S0AUCOO0EVI>NX0SKCLU^ 

LIALTI^ILTSVPYlJkLCVriXXVTLKIZFALC5EICIiaC\^^ 

ZCKEXEJCSPEMFCRTATOZ PHR£AX£QFNHSCHH IH ES PALTETVTISHQOVLLFKEW 
CPVTLPCT/TStEEVLIRSVVCSYIJJCACVPKVSHLIMLHKlCIJCS 
0RKASPLFTOJG)IATPTIAYTRVNC3HIAPFRAGAieW 
CYYAIUArNQTQftX.YNOX^tMJaJUXYAIMDXDPl^ 
OQEX2REYP5RAAQDQFVG 

CPn_0068 84909 84331 

CT360 hypothetical protein 

SrMIKKfTIYSLIP^STSAPIJCGICNCJVSSOSRiroPCVt^TQU^irrPICEOT 
rRNELOAISDCOKSS£XIEESCCTSOSB2l^nnWESSNEYVU3 

cosoqvaoz iarn<R£Ft)iromnjiJcimi2xiJUXDL^ 

EODZKQTLKLLKK 

CPn.0069 85191 87086 

No robusc homoiog present in Genebenk / EM8L as of 11/7/98 
LNFLYVYLLX FNU3IMTTPP psrssspppydwielodlgntonnssratppppevocelp 

PYF^ASNFmERG*PSU>SPOQLLSLPeYSRQPPPg^ 
UTCPANSERIVEIHEVrCI Y IASTSOTQLEAVOOTW 
ENTCHSRIXVSHTVRAMTYITORrFSRKWNWCTOF^ 
Y\XCt^PTmRCNYHVOHYRVRGF*PSCLDSlJU^^ 

T FDNA I RYC ERCIXVCSEGMCMLPETQOQTS PLTSLEDGHEVALVLNPQQNP EALS IAS R 
U4H£ERQCRLESNYMPGR5SNPFKrSMYVLVRI/frLAOIYIMSPYYSFOSNOIVCLIP X S 
CAAVETV5Y I FLTVTDSTCCRRYLRVPRtVCTClitNlJtfJTrrXElXI LSYPRSVECVPF 
WRFILCT/MCrrRVVFFAWNLIIJWniC^^ 
YAIVFPSrV23CLLTALAHANTNIUU^PYPXIESGDI^ 
CLV XNTC IYMLI LFANLIFMVYSVRRYHR5RR 

CPnJ)07U R7399 87208 

ttrr robunt homoiog present in Cenebenk/EKBL as ot 11/7/98 
YXVnLFIILKNONFFSNOSRTYEQRFPKVS PHFES X LPiQSVC F SSOGTLL I SFRDTEX-KR 

ftLYI 

'.'Pnjl07 1 «80*»1 87599 

Tl25 liypnrtMC teal protein 

I K'AJCl I LSF rCPLOHARCLKKOHK 1 lEELPPETFOFOHLYLKLHENSJSRDAFDXICRKL 
KKtn.Wta.WDLYLYEVYQDC I UTFTYTKALICSGIA5LFTEVY5CETPST I LTCKP I F 
KWLTI'YU.-pait-NtJ^LYMRHKOIAWOYUPPCT 

•Injur/;! H'»I5I MR057 

'.T:i4 \m*#i*+t ioii protein 
lYW.TKT: VYKEKVI. I LI YCIXOTP1IYRH5TPL-S0G rSPOTOYVPOELFCORLCSSR 

::N::Mx«A:y:ua' i vrtpp i :?ALVALTDUcLVPYN0f n privrrrR lknavek wlflornwk 

ft IJLt 1 1 <ANAL t I.WIIJ nVALTLT t WUTVG t/I ICV/FH IFTATCLDKENKMRHVNSUNNL 
UBBjl UjUiftf nHO I LLATK tAS^JALlYAVPQAVU.V IOF5 ICHQUZ INTVYGARUSO 



CATYArDIWAMKKPIOlIE lirir,W:>C*gLSAL :EIN»»CTgPA TANUA S 

SSln^pyct^hp^^ j3YUJL»«;;rco: i araooc :kt-5CT-30 r wcpqri 

tESNH 

CPn.0073 89)53 99574 

^E^^^^riviLm^^ 

AYOLTKARWYRHR 

cuCA-tlunuaCiun Factor Tv 

StoisotfWwiwcticwvi^ 

KA^rNASMVEYCTPNRWAHVXrPOTAIT^^ 
KMJBaANYXlXVREXJOAVIXJNXWPERn 

vKvsDrwvazancETivpcr^^ 

MSVCTHTiyXSAVYVLOKIIPCRHKPFFSCYR H ) ^ M R II W lUWTLPrCTOfVWPaW 
VfOiWJ^ALEZXaOlFAXRfiCQrrXCACTXSKXNA 

rCFAIYTVDLVLRXS ITCLDC ITTTLFC 

CPn.0076 91334 91903 

nusG-Transcriot ional Antiterninetion 
OPfCSVNOIYWYVVOVrrA<3EiaCVRKAI^FK^ 
i\^iryiVKrrtiV»0{LTDESWLYV7CS^ 

EEVAPG0E5E 

CPn.0077 91956 92435 

r 111*1*11 Ribosonal Protein 

FTVSYPLFVEVSOCKVRFSN^VIOCNaKXXiafllPOCKANPAPPIGPAI^ 
EFNAATOOKPCDLlJVVITVYAXMCTmitlCOPlV^ PWOOCVOCL 
T0WEA XAIBJMCI»«rVlAESAJ^^ 

CPfl.0078 92453 93160 

rll-Ll Ribosonal Protein 
SCRIKTlOS^IRCXlJNWSlCSYSIJlEMDIt^ 

OQTCAVFLPNCTCKTLRI LVT ASGNKVXEAVEACAflFMCSOOLWIXSaNlXFXJVAVA 
TPOeOUMagflXVLSPRN LH^lPKlUl ^ 1 llW Ug^XSELRKCKIEf^AWACVOIWC 
IO^rE550IITOIEAI^SAl,:iOOCPPAAXOOYlJVSFTXSSTtCPCISin^ 

CPA.0079 93170 93688 

rll0-U0 RibosasMl Protein 

ISIXKODCXXLLOEVZXXXSAAgcrXXJJrYXJ^ 

fXAZEAACXiEVDCSXnTX3IXX7(/VFSCQOPVSAAK0VUFNXOK^ 

GABVlAVAJCLPSlJCEiJlOOVWtJAAPIISOWXWS^ 

CFQ.0080 93720 94121 

rl7-L7/L12 Ribosaaal Protein 

EPWAVTlXDVPADS3aGVXXVVREVTClAUCEAK£KrEGL 
KL00AGAKASFKSL 

CPA-0081 94219 98016 

rpoB-RNA polymerasa Beta 

rREXLSHONSRJmWtJCCPEftVSVKlWEDXro^ 

CXitVTREXFPXKSYNZATVLEYI^YNU^CYSPEECIRJ^m 

XXIZEVYmXPUfnXSTPXXNCA£RVVVSGVHR5RINrE0E1« 

ROSWLEAireXNK^XHIDRXlOWUCllAXTFIRALCYSSt^ 

X0FAU.VCRXtA£inXDEA5SLVYOXACEXI^AIflJat)a^ X 

KMLAKDPTP5Y EAALKDFYRRLR PCX PATIANAR5T IMR U F D PAR Y N lJCRVCRYXliWUC 

LCFSXDOEAUOVTUKEDVXGAlJCmiUJQ^ElCACVXX>X(^^ 

CRSaUWMEKrVRZRMAJ'DFSSCrn.TPOKWSAJQGLASV^ 

AELT^OCRRl-SAl^RXJLNRERACITVRDVHAS KYCR XCP X ETPECPNXCLXTSLSSFAXX 

NEF(7XET?YRXVRIX3XVTDEXeYMrAI}VEEECVIA0A5ASL£EYNHrrtPV^^ 

AFEADT8TVnaaV5PK0LV5XVIGLXPFLI»OnANRA£>43SN^ 

TClXCRAAKDSCAIWAEnaCVVOFVDCYKWVAAKKNPTI KRTYHlJCKFliRSIISCTCIN 

OOPt^VCXJNaTNSI^XAISPATORGElAtZSKNV^VAFMPtfl^^ 

AmiYIEEFELTARXJTKLCKEEITROIPNVSOEVlJWriaDCX IRIGAEVKPaDXIAICK 

rr1>KSETEXJ^PEERIXAAXFCEIUVAI3VXDA5tTV7P<r^^ 

LVEEAVTCJOJLOKCYKWVATLrrTYTlEKl^L^ 

QETIERIEQ£OLVI)UJ<PNCEMYtVl«KCLfc5DYETALQRlXItlYKra 

C^IR0VKVWASKRXLO^0XH*CR>«>«CVVSKrVPEA 

ERFI»OCWICYIYMlJCLSHLrADKIKARSICPYSLVTOOPLCCKAOMOOORrciyE\ArfAL 
EAYCVAMMLOEILnT<SIX3V5CPTRIYESIVXOE^LRSCTPE^rNVLIKD«OCUa^ 
PNWOA 

CPh.0082 97992 102221 

rpoC-RNA polymarase Beta* 

CSSYCRJUUJCNOVlXXXMFCEK5ROXCV1^KECLFtmUXCXA5Dr^ 
ETINYRTFKPEXOCLFCEK XFC PTKDWBCCCC r/KX IKHKCI V^JRCCVEVTLSICVRRER 
HAHXELAVPIVH I WFFKTTPSR ICNVLGHT AS DLERV XY YEEYW t OPCKTDLTKXOl<LN 
DAOYRIVVEJ(raDAPVAKMCCEAtYI3UJ<r:EDWSIXKDUtE^ 
LKX iECPVSSSWrWMVUOJIPWPPOIJPLVPUXXRFATS 

A I LRUCTPEV I VRNDCRMLOEAVDALFCNC RH^H PVMOACWR PUCSLSEMLKCWCRFRO 
N2XCKRVDYSCRS\*l IVCPCLKFNCOCLPKEK/VLELFEPFI IKRUCDOQiJVYTIRSAKJCM 
tOROAPEWfOVLEEt lKni1PVt4>{PAPTlilRl/it0AFTPVLIE(3(AlRIHPLVCAAFNA0 
roW0MA\aaTl^XAOLFAKVLH>IArW I FLP.^JOKrVA ( PSKDKTIitLN'YLNAOPTYF 
PEFiWniCTK I FKOE I EVLRALNNTFiF I tit WF^WROCTnROI II IMEXIKVRtnnOUCTT 
PGRVI.FMRIVPItELJFWYSNrrrFP I f JKL 1 1 f/Ti RKVni JMTVNP1 JVXJCCH/SF XOATKA 

AiJMnijrovRrrDifc.uiucnAYDryAivKrv/ooTt itdierjukt in i wttvjeolgd 

ALYVE I :nCOTRCKHNTLFLH I ftyw JNK-.'-W rVU^l-RCLMAK m l\ 1 1 ETtPlTYWHE 
' XTVLEY5 1 S5HUARKCLAOTAtTTArJ:> '.Yt.Tf P.LVOVAOOV tl T« I A XTVUH IIEICAI 
OVKEELLTUtDR I YORTVAWJVWIDK: >UMJi UDVt MKVOAEA IDCVVl lETtRtRS 

TLn:fc;HWT\^APCLTw;inLAH »ii,irM s^v ; ; t AAiXi I' iKrt rrw.THirmii / 1 ; i aats 

:.TI'Er IT*fc:DntLVYMDLftWl4 r^niNI.VIilFF'lAI J»VVia5»-Y:irrLNKYKKU.TTlC3lE 

:;LKVKPVEUTVKn.vAC«7Trv:v^i';i-vf:iJaiiPi ir:nxn:piitYKni.vKfii:rrEJ{w 



76 



WO0027994 [ file:/A\dcwas03\firmdataMpVFoleyPat\PatentDocuments\W O0027994.cpc] 



Page 79 of 330 



WO 00/27994 



PCT/US99/26923 



UCVDINDKHIEI IVROHLOKWTOrarTTLLreED^ 

VPVLLC IT* A.TUGTESF t3AAjF0X7nWLTt>AACCSXTOYLX£FREWrKnO< I POCTG 
rETHJCRIKOYUKECCOtVrorVSETtCVC 

•:pn_no<n 



102206 103312 



jeu/jeavvwgiwxioto^ 

VQRAVFl^LFEAHOGBKlWLLVlUPG^ 
■CAIttTLlSPFVCTIYt^IAAYGOXraXDADOTAS^ 
TKEXJVLAl^CCOLLTISPIttiDEIJaSOHPVlOM 
EOAMATEKLAECIRirAdTTOXLETXITtrXKOIAAECA 

CPn.0084 103356 103751 

!£|QUKDA!«:iGNATVSVSv*ItO^ 
7SXWQS50LZS 

TPTV.00B5 104512 103766 

CT311 hypothetical protein ,__ 

FSKKFnLMLIVAQFPAFSAOPimjVS^ 
AARJCXIGXFEICKPSISIMJWWYSG^ 

LTPTINIAQEXTSKSSKEY XCCX ^^^***^*'^**^^^S^?^SiS* ««^^»Ti«r52irr^S! 
GRVFCLQATTVI DHT AY X FTSTATIJOYAELSTTFIJCVVSS FQ I RGGJCEATSGI>AI LEXA 

LEALQNOOC 

CPnJ>086 104898 105527 

atpE-ATP Synthase Subunit E 

NIKAWLMADGXIJCQICDAtRIJm^AXfflEAAAliJ^^ 

EX3WWKIK0GEVAI£0AOiaAIXAIJC0AV^ 

VOALEAOGVSGNLTAYIGIWVSPRAVNEX^KAVT^^ 

WVX^SSAXXEimYLOXDPIUMXrOGS 

CPn.0087 105540 106376 

CT309 hypothetical pro tain 

SHEXirSrrKVVVWOYYrLSSn.PTQLPESVPLr$ tSOLXm LYlJggXam i^ftCtjLK 
RmrZNFAmiAWIPrsrcEVTOENVEIWUSOCJWSXJDNX)* Ulf FKDFLMNHKSSOD 
RUMFSOLFREFX^YKOTOSSXTODYFRFraLRV^^ 
DPVVrJttI^105SPH*EXJ>EEFSDLOGVUOT^ 
EX^LARCATYMFAIIWSIASVEXCREIINHIEICAIKW 

CPh.0088 106352 108345 

CT288 hypothetical protein 

SYlWGNO*vTVSE©TACGWVIEAYGNX4*VRXTO 
OEVlWQvTXOTOGAaU^vYTSGKI^ 

KHV>uxsi»cnjwinrrpvAsvcxnxRRCOX^ 

GTYHWfWAKARnAQGKECArrVVQRWPXKOA^ 

KCCTITCTTCPFGAGKTVLQKHLSXYAAVDIV^^ 

SXJOWrcilCNTSSMPVAAilKSIYLCVriAX^nfPOHBLDILL 

RiITIX«EEAFPAYI^SRIAAFYERQQAITTiaJGSX«SLTICG^^ 

LAWGAFOHJKARAXIARRYPSIDPLIWSXYI^CVOT 

CSaaCRMEVWEOTSMEI^IYIJU^YIJrcrUXJNATO 

RIFOAKFVTUSPOUARSFTmOSXim^ 

CPn.0089 108111 109466 

CT289 hypothetical protein 

UX3*KK0WYKWRKI*CriYTKITDIK^ITVEAE 

DUOVTLOVFOOTSGI^TCDHVTFX^RPMXCV^ 

EXATPTFNPVCRIVPRS*VRTNII«DvTTOLV!^ 

ADIVVlGGHttTFVDYSFFVEESIOXLOFADXCVKtt 

AVEEXJCNVLVU-nKTAFAIttUCEISI^ 

SITLITVTr^DOITKPVTONTGrcTECOFYIJU^ 

3DLANALI W.YADSRXATERKA#Cna^NWDI0a4AFSELrETRU<SLEW PLEEALD I 
r^ILAOSPTSEEVCIKAQLINKYWPKACLSK 

CPh.0090 109439 110080 

stpD-ATP Synthase Sutaunit D 
\^JOCSMSVOVia.TKNSnU.acOKlAlU-OTYlJ^ 

OAYERIYArAEXJ^IPlXTDCVEKSrEIOSIONDrEHIACVF^tVREVrLFPAS^ 

TPIVLDYMLSASKELVVKKVMAEVSKEIUJCILEE 

IAVrLSDRSITOVCOVKMAKKKXEtJtXASCDECV 

CPn.0091 110074 112053 

dcpI-ATP Synthase Subunit X 

VRUJIHICYI^IOWKAOFFSASRELIW^XSICKC^^ 

l^AiXFVTOESVSVEDIVSEVLTtJJKEIKGIA 

RCT0X3l*FFYITO00»IEXtt.EEKPNWYI^AYOTD^ 

VNEI/JVDLAMiJREIRKRSDROTLYAYRREVLieLCNYDNE^ 

FAVACWVIVt«IKXajOSlXlffiYOIYMERVPVDPDETXPTYLENICCM>t« 

AYSOKOPSTWVrrArVLFTSMIWOACYCIXrWSSXXrSWKrRRKW 

AtLGl/XTICVCTTTTSFFCMSFSKTSVTT*EYSKT>^ 

PSLKA X RDPKAFLLATE XCSAC I ES RYWYDK F I DN I LMELALF I CWML3tCMLR YLil Y 
RYS^ICWI LFMVSAYLWPIYXjCTIVSLIHYLrKVPYEXCCOICrVYCMFCCICLAWLAHI 
QK.™WEEtICVICVroDVt^LRIYAUXJU^^ 

5W 1 1 L3 XMOT II tCLRLNF I EWYH Y SFDOCCR PLR PLR K I VC3E0AEASC I HLDNNS IV 

'.IfiJIO'O 1 12121 1125?) 

•r.rJ(.ATt' :iyin:lusp ::«»b«inU K 

K/IJKAH W::MIOK,W n»ALVLCLAMin^ICC(»«CV,v;HAVM;^ I DEOinKMGMSA 
MM ::«r: IWF I C -KIXMOAA I KNCTL3PVCG IA tCUTVCAALLV^WKC'.VlV: I QAYA 

ksk i vikcyaa u ; ( vks f:Jt .f awfallll 

•:i«JW'M 1U440 11 JO 15 

'.T:<ii tiyi*tf | n *r. •<:.•! protein 

\ijju y.T:Kp::ui-JTcyv:peY:wiAA0L5 tEO:^iDF/Yi^jONArvTwr.LPr.RMRKCr.PVT 

I.YIMVY* WWVEKr/rYlO^lAC^BVYCLWILEYFELO' « t:^«VALC3TMQF.IV3RRH 



HLMMEVLUXZP 

H^vtcvlkn^^ i rem iao iatoawctw^ases 

T '-' V. (»•:-• »• : * v , • ;;..*VPKrrv**Lr 

r t ™E*CHR I P\^«iCD£RVLCYICS3 I PEEVACDFXJSWDDPtTVUTTVrSSCtWP 
cSSY>£?CEltSYXSCKEIU>YX»CEAL 

eLC^FYIuSnliLIKOIJ^ I KPTLfCKO 

SjSwEKEAWPKAiwtPOi*^ 

CPtv.0095 U595« U8790 

pknP-SFT Protein Kinase 

ACIVCLDRIBORSLERYDIVRI ICK0O43EVY^YDP'.T:SRIC\'AXJa IREXHAENPUJCR 

oTtJttARIAADLINKTWPVYTIYSEKI)^^ 

CKTSVCAn^XrKiaCCTIEYWSTCIWRIJLKPtWXUC 

rPYWU0KWIVIIX»RIPSPO£VAinrreiIW 

ESKXJCGSTKVTLTTALPPICKSSSWia^ 

SgvTLgjcgaillX^XIJJ^ 

OtaXSDKinTUAX-EOKNKSl^irrtCTTWL 1 
FE3SG5 IJTVSCl^VPOAFLAEXI.YtlRALVX.YRR XAXS FP<3UCIX*YIARFRAG XTVLEXAS 
Tff>Mgr>rr M A T riT «nOJfflCVAAPIJrYl^ALVYQRI^XrfflEEI TSI U A X . m S0WPE 

inuj®MvvYiajasrnomLAiATOiLvi^xAiwxTPOOEE« 

IXDITVLEXJlSSiaiEX^LSYWSCrXI*M12«Sl^KRAWX)S^ 

sscxoxncx«iJix)KATnivEysTK>LcywAxosir^^ 

YirPLT/VNIMrX FyX? f * t w^r jy.Te-CKVPPi^MPVTJlWMEXRAMIAOttlEKAtJrX F 
EKYTIXOXJCCEXJKEXJ'VLTQCYIAI^OCAEAAIWFTJVCREXJR I FPASUARHYKUfiU 
r ^,,^rppiJJJU3Kri,YrHCIi2WDERIXCCmrrHIXTEEFQL 

CPn 0096 124347 118837 

^i2SS!fl&P^ 

AfiWCRYISTIJWATTrraPOTICVttlHC^PTIAlKONHFSW 

OCFWWCNffreOTIYSFtTSGXPEXXOT^ 
SvT^DXaiiOTSTWJO l fl l VrYTP L TO O tfSPHAL^^ 
. NI^KENCCSFAGNCSSYLYKTXYOAlAnAl^FKIXTPW^ 

SVAWEfflCITTEraKSIJWHHW 

QJOJatJOCEOERTAX AJOaJOCELTCXTYXLOEPS IGLHPCC^tJaiXWTEKLttOOWtV 
T f ufwtWMTSUMI IDICPCACIFOCEVX^lCKPEDFLMgSSLTAKYIJlOltTXPI P 
ESXtfAX^SMIi.TEAJrXtOIXJ0^IRX^IJkRX,XCVTC\^ 

OEKPIO^XJOrEVCXlCaXJKrXllDLKRSORSI PLTYXKAFOOXRXOJASOPMLROC&TX 
AHFSFWX^C^XOCOaJCTKriSQDDTPIPCSECOQ«YKSEVLEXX.YIE 

TATiwiorTXSHPiaKiaaHAxxsxRXjnfx^wPi^Ttswpq 
TLYVLr>EyrrajmmiOALirvxxsLTYXi»nvLvxEie«^^ 

mYIJASCTPKPUQLWlTAIUUAPYIESSLDXPWKSEPPSSPXmiLI W 
!MIDlJJJ^UCLXAIACPGASaOiSLVrDILYASCWIAYAEWPYX3< OGl 
GEAnCCLSWXSVRKCS5SNR5YHTXASALSLSNC3LE)aJAX LCEPFSPLTtXXX-SKTTPO 
TX IOSXaJ(SYlg>t)YVTXTSPXPLGgDIJIFli3TO 

EPAXVXQHTKVSPKNSSSLLSAISVAFSLSSE WXY XSQKXQRJCUYSXiaMCXXNBRLYP 

EITHQ IJ^ SDHPECRCLTCGCRCEXIjaSLEEHKXgUAHYTPt^FFS^ 

UJQJIIUSOPXJCXXTTIOEFX^FCRCSSI^ICMNAIXK^ 

CXCSC&I^AKrniXIWrSlXOXYOEMTFlZSrLrn-XCTXOTRSX 

eLSYTTLOQRQim^XXa»nttJ<IWkKKXSINLTWI\nfX^m 

LVAWWTVIATDRSCSLIPKAIJKAI FliSPCSGPOOC^DSXTrEVCPSVXJLKANVWTEV 

CPKAPLSXSXM^RCSX}RTXXWUIHHXONXJCVSAPLHAL 

nCKQAEIXXAKCTTTFSOLWIDSHPIASSQRSDISTYFD^ 

SSimTIIIICOCXSCSOOTIiSYMDRAinrALEXRPCPra 

ElXHTPIETVALRFPriParOKPUCAIXOICU7ri^rC»)a^Sl^SXICTAXJff 

TPCT PTLFLI DELFSSUP XKKOKI^EKLRSLINSCMSV I Y I CWDVKUJCfiADYLX CXGP 

GSCKOCaCtXFSCSPKDIYASKDSLLKKYICNEELOS 

CPh.0097 124549 126006 

pyk -Pyruvate Kinase 

DSMITRTK I ICTIC PATN3 PEK1>KIXDACMWARLNFS>CSK ETKCOAICFUCELREOK 
RVPL^IMUmCCPEIPXCNir^PlSVSQCOKLRLVSSDIiy^AEOC^SLYPXGlFPFVPE 
CAOVL X EfflGY I HAVWSS EADSUELEFMNSGLLK5HK SLS I RCVOV ALPFKTEXDIADLK 
FCVEOWHCWAASFWYCEDIETHRKCLADLCNPKMP IIAKI ENRLCVENFSKIAKLAX)C 
mX AIWOW X El^VVEVPNL0W«AKVSRETCHFCVTATt>4l41^ 
NAI YDC3SA VMLSCETASG AHPVAAVK IMR5V I LETEKKLSHOS FUCLOOSNSAUQVf PY 
LSAICLAS tO I AERAOAKALIVYTESraPKFLSKYRPKFP It AVTPSTSVYYRLALSC 
VYPW.TQESDRAVWRHOAC IYC XEQGIL5NY0R I LVLSRCIACMEETNNLTLT IVNOXLTC 



CPn.0098 127494 Uftiftl 

No ror^isc hnoolno present tt, <>n*(rkvik/EKBL uu nt U/7/9H 
r.Vr.KKFH0IKRTlLEAPLYYLVr3f;i lAtV.'RIlTI'R.'IFl.Tra/tKRFRPtAFYX I.'3>YRKTAL 
TNUOAFPEKTrOERYK I ARQC LOIIL I ITLLUU AI EOL'A M I OKU tT t VT::.'HWPRCF5 
:;KfW rr,!JEDLEETFKNt£EKOnL ILP'.* :iH.WJWKI.|»KI.V ITKWYH : IAPAKA irNQRUlK 
KtFAU'EVFKUK lVPrKH*.IOUf.tEy\lir^ !KJ.'/ ;f\rt UjWVUK^IYTYPLFISJPAFTrTa 
r-AU^YKTRFTV I AVWJ UU,V\KGFF?/ 1 i ■'. JVK I .YANK SI ,1 -MK K VA r LMI IFlJSOC X A 
:;UI'FV#<WtHKRWPKt:WtKKKYPY::im/;™ 

i WALULEELOEOKPT^.IL 101 J<NhOt/ 1 i JVM TK "f I A f Fl il.TtWt H .YK 1 1 FkKT f 2>"AVY 
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ncncmxLYKXCW 

CPnJJIOl 129986 123141 

ybbP easily hypothetical protein . 
PSTLC WTSOt rrOC PSKTM PF P I T Y TTTPtXEI iLiwwiKnxKrrw nwcwroL L 

AnJtXVTJUBCLKLPr l9JaJ€Jr^lMVArTllTQ9ZZ9UdJSRWi9M Xrri^ Z 
OFVECUU«IYOWEROIGAtWLENXI*FDEYl£FSSVKW 
HDGAVILRGDliAYARWLPIJUtlTITOWRSiejX^ 
SLSRICIXTICVKIORfXAVUlSILSPKEWaUCPLrSVa^ 

CPnJ>102 "0099 131466 

cydA-Cytochrotne Oxidase Subunlc I m . , L _ ^ I1J 

f^rorwmDALiLSRiOFCiJ-iTnrifLFVPi^Ha^M^ 
FWGiFAtTFVLCNVTciHOiFSFGSNWA^ 

U.FtWKVSKI(MHrrSTCMVALBWO!SArWI ICAKSWK^PSCYtMW^^P^TSF 

wcwvpsPTTiDRFiHAVijeyrasta/^^ 

\iaLWSAOTAWWKNQPAIOA^ 

FLVHRN1 KT PVTGLDO I PRIJEWPWOAVTOLYH LMI KLWCVKVALTL ^SVSAYWC^rfKWXt* 

KPrn.VILTrSVU.PEICNBOTCAAJH3ROPWVVOCUJCn^ 

FSLVFI ALLTLFITVLCKKI KKGPEEENOLTEFEVK 

CPn_0103 131465 132511 

cydfl -Cytochrome Oxidase Subunit II 

fraGIFKEX^TSIAPLAWmL^WAYSrcOC^^ 
PVWKWEVIfcVIIVCWAGFPACYAT^ 

wxiPwoiiricsCTMsrrt^ivcNLiunj'tJPorsYASi^^ 

3AFAIHGSCFAIJ<RTSDSUttRIA0QFPYILSS FLVTYVLTLCASLISI PKRFOATPTyP 

^LLIALTSCCCVAAJCTSVSKKRYCYAFIYSTLI^^ 

SYTIYNSAVETKTIJCSIAIIVLIC^FIITYTCYIY 

CPa_0104 133884 132676 

CTQ17 hypothetical protein 
EKSMMOaiSKWLCTAINSPAIYAADSOSOT 

OCTIIR^KQDLVAVKESKDYYVISAPPCITCTVrRSrVXJWVra 

APVLVRXJflGTOIOPASOEPNCTCWLEVVIi>SOC^^ 

AMDLINSAUIFAHin<£XSLNCn}lXAIYiaCINLVQSEEFXDVPGZOC^ 

YLSKS1XS0OTSIASS0CSTPKVSSSEVTTSLLSW4IIUCOT 

RIWASMQQGNDHSEJ^TQEAFYRAEQIOOC^^ 

FLYGTS XljLfiQWIAKRVTVGCtJRPNNHFAPPAYYWSIXEAS 

CPnJUOS 134863 134029 

CT016 hypothetical protein 

YVPFTIXTSNQNPKIXIYCXXXEIHL^ 
SOIAFIESQEGENKWTUUJiLII^ 

OVLE»tAC^RVUGNraTU^RPEP1{CNCMHCOXC1^^ 
0SCDlO.YXV™pUiPS00FSVYI£PPI0CTCCEPtCCHIKA^^ 

CPn.0106 135073 136374 

phoH-ATPase 

EKVRTQHMCrMNaDTSVriYOPEALFSFOrrRI I X PPPVIEEtXAFCKFRDCSAIOiASRA 

ISNIRIALEHAJCTTWTrOTAPSMEUlIEW 

VTWLCRRVRAEAfcQIESRDYESKRrSniSLYra 

^SF*JEYFFHSAGENHFAU»YYVSECKIIAIJCAKW^^ 

VKLVTLIOJACSCmLALAAAIWKVITJI^^ 

HWHOPXYCtIMC^FSX}CHCNS5CALOAIJ<DAJaa^4EALTyiRGRSLPKAFXXXD£AQN 
LTPHEIKTI I SRACKCTKIVLTCDPTO IDSI.YFDENSNGLTYLVGKFHHIAL7UHMFKTR 
7ERSELAAAAATXL 

CPIU0107 137321 136392 

CT05B hypothetical protein. 1 

KKSPPPVTPKE I PT0PKPP I PORPEVSPTPTDH IVPGS I EASPILGKKPSPDSMVSPLSL 
FHKMLLENWTPVEEP rPWP PAEKNQK X FAWALNOSK L X FVSTSSN IAQPRLVTDSHSMM X 
VNAANRT»GROCACTWOVLSAAVSVDSVCLSORPLN PEAOCTPLNBCCCRACMWRNADCS 
NHTCKOGKPHYLAQLl^PKAVDHHNKSOAAFORCKNAyLJ^FSLAOTTjCVrFLiOrPLISS 
Z I YAPPENRKKPNSEENKVRMRW I HAVKC ALVAAHO EFCNEFO/TDRRKL I7LT0LKTPA 
rT0PKKK3HL 

CPn_0108 U7B87 137303 

CT018 

KNLFHYIO^ILMSIFKEEVFIISHRKTPlCOTSTAUWrPLVNPU<RTNL0RIASYIPXFS 
TF[CXKTUCCtS2t^SMVUfTCKFSS^KTLPCPCXyCCLPKVRXCAWl£XFCIKALYY 
LVLCVl K X X KL I VRYLCPCCRPPE PR EPQNPLTPTPLDMGQQ X DA X FSTPT3PTG FKDP F 
LODLLOEOKXKAPHL 

OPn.0109 138646 141783 

i leS-tsoleucyl-cRNA Syntnetas* 

RQiOfTAQEVQVlSFAXKEEGVLKFWKONO I F EKS LONROGKTLYS FYOC P PF ATGLPKYC 
I ILLA3T IKOVVCRYATOOCYYVPRAFCWDCHGV^ 

EECRK IVFRYVHEWEYY tNRICTn^FSSTWKTKOWFMESVWWVFOSLYW^LWB^ 
WrFTTAl/TTPl^FEAJONYKEVDOPSLVVRMPLO^^ 

• II-rTL'/YVR lODKH.OnEOW I LSODCVSRWF :WPEEFVttXSFSCKOLVGRTYE PPFTFFOS 
KREETAFkV lAASFVEESECTCVVHMAPAFCBf^FLVCKENHVPLVCFVOAJI'i^FTEEIP 
VYtfX/Y I Kl IAOKEI I XFLKKECR I FTICTVKIIRYPFrWRTCTPLlYKAVNSV/PMVEK IK 

i 'KMUtAN.*::: ihwvpeh toEiWFrjcwLHWRDWAi^RNRYwrrrPi r [wxjAt/;ciLwnsi 

KF4XELT* !TU ITU C HftllP I DCU4 [ VKDJKTFH R I PYVFDCWFT>>V.AKPY Afcl rHYPFENQK 
KTEEAPPADT lAEUI«^nWFVTLTVI:UII.FORPAFRNAtVNCI I LAEC/;NKMSWa.N 
NY R;t^VVUW^UAI,RIALUU^VVKAE0LHF3DKf;i BTJVtJW [LLPLTtFA^FFNTY 
AKI.Y*SFni-K:^n I LTAYTE I R>* I UWLY: Wl :KVl<F:»<{ajYHLNKAVEPrnT I0OI.TN 
WY llckt 'lilUihVEAFjyrWlHAAFjTtLYEVLTVFlTKV t APFVPFt^ED I Y0K LKLEK EPES 
VIH/rWI WKMWC K JT»t «KRMJ IDI RR I VCUill;:LflKOU({J(VRyPlAN^rr/CSKDflL^ 



KiiviKRCMLjM rviuK lv K"i ' . u--.v;r:r.;*Fr :r.i:.^.*.-iHY; v; jr-»-. .jjim; 

MnDWFNOSYCRL: FPCTSJfTCCFrOHICEWOOEPWUCDPKI^PVSYAmJOC 

TrLPLRttHLiS-IRNNLTTSPFIVAOCCAYTr^ 
g^^ICFCTIRYRUa^PLTQLH ^ 

FW«^YIKOSPWXIO«PTWKrVTSrmOESSSrP0PYIAFVOID^ 

JSb^WiCCHVLVl^rfPI^RffC^ 

PTTLSCn.VSCIAiATCl^LICYVYYOKRRJUJ'Pia^^ 

CPIVOIU »«" 4 

CT021 hypothetical protein 

olo^pimpWstyfe?.:lokyi^kxo^ 

TOQWriAITOOrrVI^RCSORTFVLVKSTTPfa^ 
VI^LESPSOVTVVSI^SECSOFFSLhrrRTKSLEPVC^ 
IKTTCKENXPWSPKVSFrcykPLTSISVNAW 
W^XETPXCTSIVRAVDICHCATSPYVYSLPOSKTO 

CPll-0112 144743 145093 

aatB-<Petll2> Clu tRNA Gin Aaldotranst erase (B Subunit) 

DSDfXWiOOaVWI^YROVIXVTCSTCYKFV^^ 

SHPFFTCSIOCrVOAttRVDlCFLKRYS^ 

CPn.0113 145329 146405 

o£rA-Pcptide Chain Releasing Factor IRF-I) 
CilOaQCVAEYtMRiJkEVEIlUSNPEXFS^ 
A£SKOAIJaaa3PD4VVWJEEGI)(ENKVE^ 

GTlBrVQRVPirETOaWKrSAITIAVIJEPSEEDrrtXLXNDCD^ 
VTDSAVRri1<LPTSVVVTCQI}ER50HK)flCX)KAMR lUCARIRCAEMOXRHNEASANXlAQV 
CSGDRS ER I RTYNTSONRVTTJHR ICLTLYNLDKVKECDLDP ITTAMVS KAYKOLLDOJ 

CPn.0114 146371 147261 

hemK-A/C specific methylaie 

VMPTTSYSNKE IKXAlO EX77 AYLOrYGVPLSIXrEAL Y IUOLLEVS SKAKLFtX.VO ISTT 

»!IEyiUCRIJaJU»RCPTAYXNSAVSFXaJUJ(VK 

EXOrraiCCGSGCUXAXXXSCFWEWX^^^ 

APYTRPADAJVCKPPYT«STNEIXHIDPEVRCYEWKAX»VOCSTCtXF 

SlYWSWtXICSS<>SE5Il«tFS10CIYCRIi«I^RDRlFFXJ^^ 

CPn.0115 147279 148622 

ffh-Signal Reeognition Particle GTPase 

KtllSWW^IFSFLVSSRMNEINXSESIREVRIALLD^^ 

CZXIWXHVSI<»0FCTCU<Eg^Ari^OCREgFTIOrrPSIIUTO 

WIKNKXAJOCVLVWOLWAAVDOX^ 

NSHSr/It^ACRIMIXMEUfESLTAIOXVSOANDUJVmVAf^^ 

LT^VZX^tfTt]CDARACAVFSII04VUKPIRntXCERXODLRSF^ 

NFVK1XREY X S EEEDA£IuCJaa*VTAAriYEX?YYKOK?CAFRRMG PLRXLLOeCPOFNHARP 

SOKSIESSEQOKKRTEAX Il^MTTEERJCELVEIJJKSRMICRIASGCCLTLOWNOFRJtOKS 

0SKxrnamsnsiQ4E0VRnQ<5Qooffi 

CPtv.0116 148592 148972 

nl6-S16 Riboional Protein 

EKNVRRKSVALKIRLJWRRNHWYTa.VUkD^ ELLCWYDPKS5XNY0UCS 

ERIFYWIJ3^CAOUSSXA£ALVX0CAPC?VYSAUJKOEARXLV^ 

AAXDATK 

CPn.0117 148983 150071 

trnD-tRNA (guanine N-li -Me thy 1 transferase 
TCaOUDII^LFPCTFtCTI^SILSRAIKCRIXD^ 

KLLMAEPVTSAX RSVPJCEMSKVIYXSPOCAIXTAEKSREl^AASHLItJjOCHTTCIDEI^ 

XESCVDEEXSXC0YVLTNSGXAALVLIDAV5RFIPCVljCNQE5A£RI>SLENSlXECP0YT 

RPREFECKEVPEVLLOCDMXA I SOWRLEOSERRTYERRPXYLNYLYKRS IDHKFOCETT 

TNRWFKCDXXSVVUVNXlJCRAKTfrtrCKVFCLi^SCE^ 

KKJ4XVTUULIXACEECFCYLXJlRWEtJX23XIXEKOA^ 

RMK 

CPn^0ll8 150075 150464 

rU9-L19 Rlbosomal Protein 

KKE^RWYIKWLLXELECEOCRNI^PEFH^tTrlRLATKISEOCKERVOVFOCTV><ARR 
QCCSCETVSLH RVAYC BOI EKS FUMS PR IVS [EIVFJCKVARARLYYLRCKTCXAAJCVX 
EFVCPR5SKX 

CPn.0U9 150520. 151164 

rnh8*Ribonucleass mix 

LKNTS IS E 1 0RFLSM X AFEK ELVS EOFSWAC I DEAC RC PLACPWAS AC X LPKCKVFPC 
WOa<KWPR0RA0VnOAL>CDPEVCFC ICVXSVEP. IDQVN I LEATKEAMLOAXSSLPIS 
POILLVDCL'nJ^HDlPCKK! lOCOAKSASIAAASILAXEHRDDLMLOLHRLYPEYOFPRM 
KCYHTSUIVEAIRRYCP.^-HRICSFSP IKOHCAXV 

Cf*nJH20 i-U25 1«>I77R 

qrak-rjiP KwuiDf* 

EfXFT^IKAIT/rrYCHWILXT^JPFrtPWiyKrCl'KLFTISAPAfJWKTTLVRill^OEFSS^ 

AtTl.'^rTTnrPRECEVPiavi^HFVilHEEFXiiRlJ^ROALUWFLR 

:U/;KIIAVAVI0 ICt-NLF IR.JRMPW:: I F f ArPWEELERRUVSRtWEEnSORKERLEMSL 

( El AAANOKUYV 1 1 NDDlA^AYn VI XV. I Ft AKLliPMlI. 

r:iti.ui;u i*' I7*«*i i*.:wm 

t.TO»l llVIAf rut mm I viiin. ii, 

W I £ M I KK U< FTN KK UOCl.FI ■: If ■ F: : I .Vt U A I K'JAX IK \KYt'AN\K'.'i WVA I tTI .VLLOREC I 
';l'KmK I V/TA; ! I TVtiXKK; tta ITO SPPKMn yiYTWriOVK 
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'kvmpokvl 'TCAm*K:rni<rwivscv7U'ADsr^AftnwixcoovLY icggoefgxa* 

tD^«LYnroWRFlJWYVBCTCPRCCn)HAM 

^V^VkSto^FLLDRKKIJALLSF lOCCYLPDHVRKFVVDYtEHVRSRAITRXZLSWCI 
PVTOH^J^WF^ 

LFCA^COXuj^ISY PE^VAnrtMlJPKSLENCNLOTMYARCLWKEEXLDVINEE 
FHUUPRLLFTTVt 

CPnJU23 155775 153774 
recD-Exodeoxyrtbonucltase V i Alpha Subuniti 

^SNTKYFO I HSYOS PLLYEYRGVFH YLTSKL IRC I ^ PK ^^5^5r»2Sfr^ircVrirTr 

ERt^EVSGISETRCVSICICOXEOXXLWCTtJ^ 

mPFLLAKEHEGIGnCTAOFIA^ 

owAio^ovFiyrpiTixEraTOiu^w^ 

LXRrLFSSRRIRSireEXAIAWVEENI5IDLAOT 

ITOAILXI FEQVTHK: ILAAPTGKAAWWElTOXKSNn'IKAU^YOFICrKSFWOOfflW 

IDCTLXTTOESQWDTHUJOIFtJCALPCrnTLVFtCOIHOLP^^ 

V IRUOC IFROVHOSGIVrNAHRVNEXjEI^ Z ^SETCTKOFLFFOIQIXJffi^LW^I WjVT 

KFVWKYHIYlN>DXO*n*PMIOC^ 

IRNNYWEVFNGOIGYVSTXNFEDKAVV^^ 

ESPCXt X P IHTSHFMMLYRKLL YTA ITRCKKLVI LVCTKKA I Al ATRNNRVOHRCTGLAE 
VXJCELDXIQQtYA&L 

CPn_0124 156575 158068 

No robust homolog present in Genebank/EHBL at of 11/7/99 

I RSKQRTVA XTLLVLG ILLIASGI XFtAVAIPCI^AVAIiaiXGKrAI^IVU.rrCLVL 

LIIISDCIJ^IXOVEIKCJARTIWNNKLDOLSC^ 

TM^DIEEIFlTLRDIRNAl^NEEITHniAICOCL^ 

RQHIJINDPRWSMlTKKVKglVVRFIYVS'rMYXQIKSKF EJU»DFGQLRXKIX>ft/YKTIEE 
VLYOSFOKGYNRAALLSEOTRI IHTSSIXHWEKDn3XHLyiKNECASRI»ENt f KXi''RTLFL 
CI^EETCDFTGASGWXSia^RKEVPLrXXaa^^ 
' QEEDPLDRUfflQVEQEATSVLKDQDRYWUnETSEAKFfc^ 
3DHX^/WA1JQLSAAEI^XEVTIM)EMNR£^ 
LEXEEIPLVAARNSLENS 

CPn.0125 1S8072 158605 

no robust homolog present in Ganebank/EHBL as of 11/7/98 

KISSCAZlMSEWPtJ-IJOraFDIATQRF^ 

KCFVKRCI EDFEARCLCVLKEELASLTRDFH DXAKAETSXL I ECPCIGFYYS XHOEEQRQ 
ROERWnUERYRXXICaVLEAVOVEOICEKX SS RVWDOSYF FFFXrFOK VSNTUqCEQO 

CPn.0126 158806 161085 

No robust homolog present in Genebank/ EKBL as of 11/7/98 
LLVFSYYOCLFFFSGAIS SCGLLV5 LCVCLGLSVLGVT T T T f AftT IJ.PKIQSHLREVPK 
APDLLDZ£X2ASERLRVKAS R5LASLPXEX SOLESYI RSAANDLNTI KTWPKKOQRLVETV 
SRKUIUJU^ONYXISELCEISEXLSEEXWLXLAOESUMXCKSLFCT 
HI^EVIlPYLAVNDPRIAEXTEESWEVVSHFINVTSAnC^ 
EIAETTIYKSLKRSYlUn/XXSE134RIIHl*PL 
EKTFTWLDEirAISYHIEWDFIJ^IONXW 
PTTEaiaLQOAORAFEROSOEFYTLEOTETXVR^ 

ma^ipy jnf c ygicT uk mvBVQKPQRt.wffiT IQRNlEOtUtfKGESmijQNRRIIG V KAiWDA 

IEELLSYESRCILPIRENIJ^YLOYNKCSEIl^XAKFFrPEIH«lXVSEA^ 
KOVOGKCOERAOKFAI FEXHXOEOKSUKEQVRS FDLAGVGFLXSELLS IACNLYI KAW 
KESIPVWPCMOLYYSYYEONEAVVTtNRIXNKrERYONFiaiSLNSIO 
P EGKETRLKXRELQ ETTLSC KXLKVAOORLSELESRLS RR 

CPiv.0127 162152 161130 

yztr -Cat ionic Amino Acid Transporter 

ES FMFPSANOESRTRNVPLG IFHGLVACLYWC IVFVIPNFLGSFGDLDXVLTRYT I FC I F 

SLXACAXKNPSVIKOTPLYIWKSlXWTIilNPVYYFGira 

AVLYHSM7<OTCEIJnfSLLrAISSVIXTC\rtLTOLSAL^ 

LWVIWIRWSIAEKHPNLTPDTWSYLIGISALIICLPHIIILI^ 

CSEREX FLL LCSAHG X F5S AXAL I AWNXASLNLSPALLGA I L X F EP X FGLVLTYLYSOS L 

P5LQEG XG I FLMLGGSLLCLVLFCRKVOKS LENSQVSSSNE 

CPn.0129 162262 163053 

bpll-Biotin Protein Ligase 

EDRGRMLRNOVLVYCSEGVS PYYLRHT IR FLKYYSTOEGAFD ILRVDGNFL I KN PFWEET 
TRLLVFPGGADRPY>mVLKCLCrrAK:FOW^ 

ORARDLCFFPGTAM3PAYRGNFSWSPSGVRVSPOLFSDFGLCYAMFNGGCFFEGSEGYP 
C7/tlIE3RYDDLPCKPASlVSRIVSKGI>Vl^PHXEYLPHYCRMVKEW0KTREFL0RER 
TTLORYCONLVORLROPAFSXAOC 

CPn_0l2'> 163747 163064 

similarity to CT036 

DEQY I LSH I HMOPRX FVTSEPLQKTYOKLGEKHVNNLG I ASOV2LTDLQNKT9YENNLI E 
TTTNE ITY Y FPWHNPDI LRSEWDP ISNQLYL X FKKFF X HYHNLFSTALERNQI LL IDS L 
NTfiSSNP t ARGMELLAFUCVFEOLOYNEO EYT XEPR DYFNRFVYKNSOTA PQ XO/SFGLLH 
GYCEICr/ASNN IRNVLTM3IVLCSPILY0L CTEFOTTKIHADOFOCL I 

::Pn.OUll 164251 16J751 

No rotjuur homoloa present in Cenetunfc/EMBL .*» nt 11/7/os 
.:^MVKC:::; I J HENKKPACI.LPESKFAA ITKLSLAILCLFLC IAAC IL t ALS^LLPNTLL I 
tALTXi;:t P/T^m;i;;Ll.r<TTC<r:;KWKDE0KPK.:iFPKETP.':LOPWLLNPIJWK IQ££ 

(-TiaxoPT:; f nlknfj.kfp:;feemxk iflxcpofl!k;^lanwk ile 

iPiiJiMl 1M441 li.'.'.Ktl 

r*» fAuKit it»*i.<i i<t.<Mit In *ton*il*ir«k/RMIIl. »m ll/7/*»M 
K::;:LKr>»K::U*.lAl Fl.l PFFTr-AWFP:: |(.TIJXFH»IAM:S^»VYM:r:wi IJ(Ti^/AO 

myyya o w ; i yvi 4^rr:^Hi t r icu ivtaf t Fr^YLi vfvi.t i au mm i ::i »vi >ll irsv 
u^iHvrm)*!':*:* jyank/iiw* uvt jivxhwj i i4>::i-Y ikvuaij* j-.i n<i t'eop*JOAA 
'ji4i«;iwrKi-*:5tvi vf^i j.i?:i'vkk»xy i w-'/i^n.iR i KRv::i.i.vH.:ArrLOOUC£0 
• vMHJaMKW UKKtNKKAt<KiK; low J<he rH::::t£rnTVPU>i-.ai:pov:?jAMpyvYRY 
i.i<i>m.tt::ki j« tiiij Krc;nwiN •uv:FOiPKW«M)::pf -knvwskf t jai;kk 



ALUCNPCClSIROUCOFLVi - 

StSa^htwacT:ew£wni^^ 
xmaryi^PNtTP^ 

CPn.0133 ^llHin U6564 

ICIHNPLASERF PYAAKEYADVRFSSXS1ILLT1C0LEXSELI XNGANFTIf PYDSMOI ET 
l£sLVW^POXETPSNl*IDRAPNrt.I 

SssaxelpkSealpsix'A^^ ikpi^coakxhfyssypcfodri^ 

OXNTPCTPTEEI IGF XRGLFFN 

CPH.0134 « 7 « 7 

?AI7YRKLRRTTMAAKNIKY^ 

TimGVTWUCEI n-EDKHENMGAOMW EVAS KT ATKAG0G7TT ATVLAIAI YS BGLRNVT A 

g^SSStcxokawvvvoeijociskpw^ 

QScsnVEEAKGFrTVL0VVEGMHF>CTGYL5SY 

E^ILTGSLVStnjGKKXBnTIJWCt^ 

TOI^BJSTSDYDKE«JOB«JUU-SC^ 

JVLPCOTAl^XPTlXArLPKtWJEI^ 

QVt^RSAKECYDALWUYTmil^ILDPTl^ 

KSSSAPANPSAGMOY 

CPh.0135 l^Ui 169143 -.^ 

oroES-lOKDe Chaperon in 

MSOOATTLR IXPLG DR I LVTO^EEEEATAROG 1 1 LPDTAKKXO DRAEVX-VLLj IXJM. 1 LLCT 
LLPFEWCOIIUWAGOEITIDDEEWIU3SSEIMAVUC 

CPIU0136 171419 169569 

EeV^SLKITSffEA^PTirTO^ 

SPCHYOIDNPESUJIiS«FSVERKri»LYIYAHLIHDOOrrrn»tt 

FSOEISWI0PAl.IAli5EEICVAAlASSSVLAPYRFYl.IKI FRLS PHTOTAUKKIIASSPA 

AUA^NXAFSSLSX^IPFGIAJCDSfCEEKPUH^ 

YOYIU«rTANXlJ4GXVQAKLPXAXARmSCXXASIJOm 

RYniLWCEALjaJCEFHFYIWAPISOTrSlWSYZ^^ 

LSIOOIVCRYtNiefgCRSGAYSSCCYOSAPYIU^^ 

OPYHXiAQYPlJTJiEIAS'IFNIitiXKiALSXStXJSXn^ 

AfrrXIKSAAlOGTPLTEEFWTYGlMKErifGCVVTSOSL 

OYATCI XAAWFAEKXXTOEPGAI^YUCFXJCS^^ 

ArrrnaoLLSsixsED 

CPIU0137 172263 171502 

ybgl-ACR family 

yrc^Mtr^nT T ^T.TTr^gglTTFQnYGP^I^VGDPQTPVKXIAVA^AIttJriKPAVAAE 
ANVLXVHHGIFVKGMPYPXTGMIKXRXOI^IEHNIOUAYMLPLDAHITT^ 
NWKDLKPrGSSLPYLGVOGSrSPI DIDSFIDliSQYYOAPLKCSALGOPSRVSSAAUSC 
«Yl^SAATSOVIx:FrrCNFI)EPAWSTAI^SNINFlAFCHTATO 
FPISTTFIDTANPF 

CPn.0138 174094 172700 

• haoL-Clucaaeca-1 -»araialdehyde-2 . 1 *ammomutase* 

TTaRLFLAITOU^OWWKLTTO^NSHLNCSNOKH^ 

VTPPIVSSACGBIFIJTHGREriOFCGCWGAXXHCHSHPKIVXMOICTAL 

EEILFATHLUSLiaKEHKIRFVSSGTEATIffAVR^ 

LGGISTTEETIDNLTSLIHT PSPHSLLIS LPYNNS0ILH KVMEALGPOVA5I I FtPXCAN 
MGIVt*PKA£FU)DIIEXjClCRFGSL£IMI)EVVTGFRVAF 

CLPAAALVGHRSI LOHXKPECT X FOAGTMSGNFIAHATGHAAXOLCOSEO FYDHLSOLEA 

LJYSPIEEEIRSOGFPVSLVHOGTWFSLFFTESAPTNFOEAIWSIJVEKFOTFYS^ 

VYLSPSPLEANFXSSAHTEEKLTYAONXIXOSLIKXFOSSAORFF 

CPn.0139 174686 174093 

SPTWiKLRODOC I PYARLEXGSLLVASPOINOGVFARSVI LLCEHSLNGSFGtXUOCTLC 
FEISODirTFEKVSNHNIPjrMCCPLOANO^lXMSCSEIPEOTLErCPSVYLQCDLPFL 
OEIASSESGPErNUFCYSCWACOLiKEFI^NDWFUiPGNXDYVFYSEPEDLH^ 
LGCKYAStSTVPONLLLN 

CPn_0l40 175140 174673 

yqdE 

PRS NOQC- X FCM SLEK ELL EETP LVLLNFYXLV S TCKfACM I LCT EEKK FA I N"GHVS>COA 
FOGAOTEGHSPORPFAHDLlJiFVFSCFDIOVLRWINrTYKDWFrTRLrLEO 
VOVDARPSDS I PLALTH KIP X LCVKSVFOA WPY EE 

CPn_0141 175817 175H0 

rplA-Ribose'S-P tscmerase A 

HSSSAVETDLHLHEKKCLJVH EAATOVTCGMI LCLC3CSTAKEFI FALAI tR tOTESLAVHA 
CASSOMr/ALAKOLA 1 PU^PEKFSSLOLTVDGAOEVOPOLRMIKCGGGA [FUEXXLLRA 
AX RSI I L'/OESKLVPVLGK PRVPLE ISRFTR5A 1 1 EE IRHLGYEGEWRLOOTGDLF ITBS 
:^[Y0IFSPNSYPNPERrUJai0IHO , /IEVOP/IERVEVWSSNSOCLI^KKY5V . 

CPfiJ»MJ! 1.>121 17SHK 

No t'****:: 1, »*«imiilt».i rr»*?:*nt in ';»;n*r»«nl(/l-:HBL .ir. nr n.lftn 
r;Hr:Y::K::r.TLEKFiiFK i wllctkn:; i u/fc.';hfe xrnvni iona iok i R:rk*n x p t aen 

(>. I NTl-TFFDLK I DYPKCWKRFP FLYn t'& I PLI IKLW^r/T 

•:rnjji4i ir:s47 r/'.2J4 

i*nr in rnxicnrLK: air^r^FLRPoiLrrTP E"LKDr;r; il'lwlmu t ku i a i oolikk 
tjHMt w rrOi:E^R.\Twi iylrwimk :i» a »Mi»ATBr.vFFU ;eram i nunntm iuv.t 

Hllli^UIFKFVKALEDKFTTAKOTIiPAI'AOFLr'^MIFMIfl I EVTR X PY PTJIOKI » I KO f V A 
< JY H KV I H WiYDAtX"KYJ <M J .THTr IJ ;j\ , \M*H " .W^TJ I DF.WAJJUl • t OOYI 4» t NNLV IAD 
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p rooLwm j r/TRCNY»oc ft A: r.5VOF t af ru . "*°C7Ii2^H2S5J2Jt L JK 

XEIfTV.-LGL'/TSinTTLOIKOEV tAR t IIOAAOYLFLERIJL-POCnFASCUCNXLTK 
GQWAWAI.VXe 13EEWK 



«^>fLtJUPVSLL«ArK ;EF*-LNrTELN t WCTKAAttEAKRYVtEJCCWrrnf 

S^^iAAn^^RNorTi^ccE : p lanuc : - : fsdlnlrscvsvexaauec : c 



TPn_D144 177142 
TlpQ-'.tO Pri'diS ATPas* 



tS05»»0 



fiP^ljrTAVKCV^EW/EREW^ 

.... j. i. •i.i.?i>4;.»».«i'r!;ff •«;.: - "■tw: yArJ-:.rvn-.rr. • 

AANUJIPALARCTLHCXCATTUICYOICYX f3CDAAL£RRF0Pl JYJ5™££^^dt'mdE 

EKYEXFHGVRITIXyUJIAAVU^YIPD^^ 

KERELA«JVI«»IKTOSPSYOtt^^ 

KKJGLE3MXFSEEEAERVA0YNRVAELRYSLI WJ^"!^f^SS?Sii2S^ 

RLIAQWA^IPVOKMIXGEAEiaAX^^ 

PORPt/^FI^PTCVCXI^LAItAlJu^^ 

VCYEEGGSLSEAUWRPYSWLFOCIKItAWEVt^ 

FIKTSNIGSPElJUJYCSiaGSttTKEAILSWSPVt^^ 

aiVKIVGIQMRRIAQRLKARRINl^WO^ILr^EQC^ 

3 KALLKG DX KPOTS X ELTMAKEVLVTKKVETPS 

CPnj>145 180717 1B2369 

ILTLG^CFSPEEXSLIOKI^PGLSU^ 
RADYYSNCIJILALRIHJlEWRYraSPC^^ 
EHFSDWSrLSSVTDRKTGVajCroSLYTSWOA^ 
TTAGCRKLPTASYCirtJL£3LCVRTP 

Yt^OEEUJEUJCFVoiLcaaanrau 

PGSOTEEXASYXEEIJCWUOCSSMKriaOR^ 
SLHLRIvCXItJCDRJlEYTKALKmXAnU^JEXXpF^ 



CPn.0146 182595 183095 

no robust hooolog present in Genebank/EHBL si of 1X/T/9B 

I IVCISMSSSEVVFOTVHGU3rOC2^KSVVPFKKSLSIlAPRVVCS ILVLTLCLCALVCC 

IAXTCWCVPOTLM3GICAXVLGAISUUSl*FVt^^ 

OSPSRAAPSGMGLPGSCSPRASTPSCXXELOAEXO^VTQAXOQKSOO 

CPn_0147 1B3213 183671 

no robust homo 1 09 present in Genebank/EHBL *» of 11/7/98 

KGCPKAVOS IWaVTSAATSVCCVNCSRXMPArWI^EWTSXARSVIAAXX AVVAX SLL 

GLGLVVIJfcGCCPLGKAAGAITHIJjGVAlAAWAXLXT^^ 

SATPPLECSVAgEACRCGCSPLTQLDLKSSAfiS 

CPn.0148 183122 185702 

pknl-S/T Protein Kinase 

SOPFMEAITOWVKLAKI/WPGXLiSIENVSESECRCFLVT 

LT2X£XVDXVSQLASI*IXYVH5ECIjA0EO{NLD5VYZHX IJCVPKVXLPDLGFASLIXER 

ILaSFISOEINRiSKIKZRVlXHTSBSlCCXaEinYAFCAXTY^ 

rSDFIYXWFI^SSCLSCFKEglUUCELFPLXIUOCTlJSEEI^NWTHCt 

SSQNLPQAVXXVCETXVStSXKCSAEKLEFVLVEACS X DEAKDTAXESESSSGVEEEGYS 

LAWSXAVREFVVSRin/EAEJCEEPKPgPXLTDIVL^ 

HSlTIJWrVTNBOrtRVl^CCGSBOOlCryW^ 

C^/TWYGASCYAEWIGKRXPTEAEWIAASGGVAAIJl^^ 

YPPNPYGLYIl<AG>IVYEiCOW*CYDrreiS^ 

LRCAKRHRNN PGAVNSTYGFRCAKN IN 

CPn.0149 185706 187700 

dnLJ-Orw Ligese 

ERFKKEENSQAHYLALCREX^DHtJYSYYVWRPRXSDYEYEKKL^ 
WSPSTRXiSORPSGTFSVVSHKEPKUXANSYSKEELSEFFSRVEKSLCTSPRYTAnEXJCID 
GIAVAIRYEORVLVOAI^RCNCKOGEOITSOTRTIRSU«UU.PEI^EFIEVRGEVFTSY 
STFOI INDCQOOtXXTIFANPRNAAGCTIJCLI^POEVAKRKLEISIYTJLIAPGCNDSKYE 
NU^LE>CFPVSCKPRI^STPEEVrSVUn , IETERASLPKEIDCAVIKVM 
ATGIQ4YRHAJLAYKYAPEEAETLLEDI LVQVGRTCVLTWAXLTPV^ 
DEIHRKDIRIGtTTVCVAKOGEVIPXVVRVCREXRPEGSEV^ 
VSVRCVNPECVAGAIEKIRFFVGRGAI/JIDHLGW 

MQIPG IRERSARNILESIECAKHVDLDRFLVALGI PLXCICVATVIAGKFETLORVX SAT 
FEELLSLEG IGEKVAHAI AEYFSDSTHLNEIKKMODLGVC IS PYHKSGSTCFCKAFVITG 
-'-ECMSRLDAET A I RNCGG KVGS SVS KOTDYVNrtCNNPGSKLEKARKLGVS I LDCEAFTV 
LIKLE 

CPn_0150 187759 192444 

rrri47 hypothetical protein 
irrnCFFYSYNCPYFISFFVUXT/NMASSSWSTKG^ 

N3LTPEAQTSRSWFSOR10<FLEVLOVSLED(ENNOLFJCYSRYKTI ILIATLVTVAITCI V 
PISMVFCIPWVPCLIUTl*Cl^AFLSHRWSKCKEIKLRYRAY0IYR00LLSOYPDLR 
K3TLYKYSITKVXPKKCrVCKLVTNLRPDU<KNKDOCrJkAACSR 
L0V3C>n^eW0RLASLXMSVKNDXUOA^EPIDFJ«RSALVVKX0iaCEX0PCCIL 
D I SROI LAICC YC»WC\flEAKKAIDQYKKWYUCSTF I AWNPOLPAIAOSYLLEOORKLD 
YAAKIFQDkSALTTAHGTCQALEX3XJSl»LCTYDQLI ESKCVCEKI XASXH0KHLDLAM0D 
: ^OOeMUCKWSNLYHVFSITIKEFTBCaCLBONEWSR IQRLRCKLEKSXCS I LGNCRTNA 
EY ATK 3 EX K LADY LLQ IGDREPFLTCMMKA I ATGKAIQGKVECV I SOHPEXQ IMMLRCS I 
EPUX»U)REDIir^IU)KNEDEVlJUJC3TMEA0W 

/^cm:^r»NLLNRLEVUiAESSTDDLVLHVDRMSEDUCKTI EE I OCNLF0VTPEELS LL 
AI*ir/0iUM4ELrLIVOECNRL0EAICrESVSOCDU4JCU><RDEKtNXNTESSRKNLVA 
I AfJAK; :a\J<N t D: V^LAPLIORNRACLDN I L£NMYLFNG3 t RN IHALDTETLVAT3SNN 
I'.tAMUTFTWN | YTN!.LDVLEtQSKPAPAPMENFDL FIALPEEVOtWVAEDVSCTHRLHHQ 
Vt.rr<Ri*AUt.KNM UVQLMS [NKVOiAKAIVLC IVA7LFCVL'.A IFIOONILSLLILSCVG 
t A.t,W» Ml. I FDU I :»K jKETEKQVLET*>OnL I TATK I LPSEFNNK0UNRLAKLO0NLNLE 
. !P IPTWAKN I V::01£l; I mUC:;i JCDLTKEFRKD-.:Kni.NKR I KRRFKBCLi JO EAPWRPT 
U VOI Rv 1AKVFAKUIRE UaiLQKOKEEI J I RGOALTVEPMCLl.' LEK3KYnNEKAHAAAKT 
KKVT.KU* I 1 Ml a«NNLTYVR I WFFRTL lOEXLCRPTVOE t DWKEAKELHELAA I IYC 
NTryX:VW*<AKKOFKU*VLII t At;KnOl*Lt£AYL:rn , A» i I jCHI K<HQA3 FR ER I LLNP 
I » :AKI k Ml -V : R F. f KLKTLCUr/LTP FVR F?J: : r-F..rtVTT^Ng | LKVREOLFDI EQRL 
ijf JOKTV: tltt lYAAVOAALAAYVRKllEr.L IV:TYi:i/;AVt? Vr.^rKVTTLMfiDLHAVEELV 
KMrA^KTYHIllfcSIV^JtRVIurVlJI^IILROCOSJl^F;! IDWKKLFCLLNNNT^fiNPMOPEC 



CPIU0151 l?* 179 
nnpA-f«onooxyosn«se 



l>2-i25 




ZS!io^lspl^lCLL-rrYNLVXSDENFMXKT3H^^ 

SSSSSy^JkSrk^ 
SIIlENCsruipLW 

NSLFII^RYICYKTHTnUJaXISYUJ 
CPlU ° 1 " tiCSl^Stiin 19019 

dkSicvlhlpntttpexxifptv^ 

PRM^KAL5VWAPlADOCIUJCQ.YENFSKHCXCDl ISVCKDFCFCPPPIIVCSCIR/OL 
LIRIOWVTANSLPTKPYILHQOG IDCTLVSRTOCTLFKKTAPGRKTFI SYPNTOOO-AT 
APOLCHXLDQIVSMPORTL 

CPIU0153 195430 197892 

leuS*Leucyl tRNA Synthetase _ M 
WYDWLXBOXro^ 

rwrncRAW^vwp>cwMrcu>AOTAiRT^ 

SSS^POYWTOOWLYtXXa^YliA^ 

YTWTOOCiWWXLKITAYADFT T FT7T TUr.rVPP^QI^KWIGKSg^VTFKLTQXSS 

r£AITTKI7rU^Fl.VIAPEHPOLDSmEE0Ri)EVTAYVQE^ 

TCV7TCNYAJCHP ITOttXPVWI SCYVVLCY0rrCVV>e3VPAHD£SI)REFA£KrSt*IRTVI 

tWwiHSVYNDFClKSLSOOEAKDWNYLEMMW 

rpxXRFDSTHRPLEXSXLPLLPPNX DOYRPECFCOCPLAXAO WVMI Y UlA 

il^SEsCWYYUUXlJAHNSQLP^ IOGAEHAVLKLLYSRFWR 

vwSISumEpnoxiaNOGLVi^sirRiw 
S3SSiucvopo^^IEE^aw^^ 

TSSXVraXEDROa.VlJUOaVFllITEHXEKHSUrtIPSSF>IE^ 

AVRVIXPIAPHISEELWWLCOTrciDOAAW 

XXEAPKEEVlJLSRSVVAICYlJNA<JIRJCEIYVPWaVHm 

CPIU0154 197674 199202 

oeeA-KDO Transferase 
TsircpiocJ«cvHRxncrri^vcA7^ 

PCBOT.VWFWASVCEVIUai^VLEXrCErrTO^ 
SllMFSXIIKSVVAIOJlPSXVNTSECOCWLNrXEEAICRICATTLVI^^ 
KnJWL030WSPV«FlXOOEVOKORFLSLCI PEMOOVTCNIICTYVAAflTAUILIMr 
WROtlJUJTttSIQ.^USWRSDACKWLPV^ 
rjttPYCD«RCAKFSYVPSWVVOEXCLIJt0LWACDLAF^^ 

VPUFCPKXTSQSELAORlXLSCACLaXEXEPX XDTV5FlIJINQEVKXAYVQttKVrVX 
AETASFOR7VRALX5YI PLYWS 

CPnj0155 199697 199488 

No robust honolog present in cenebank/EMBL as of 11/7/98 

NSLSFCVPTLEXUtlSLIP XEDiRKIXFMmtNSSSJCFSNOEJCCIRTYTXSIJIiCrf^ 
YTUUNINPN 

CPn_0156 200147 199770 

No robusc honolog present in Genebank/EMBL as of 11/7/99 

IOTHJJVR»««NIVVUEXPWAFli3^ 

FYFYRAATPOTOHPDCCCFirXXRlJCEliSAGFFYCOLJESrn^ 

LFXRDE 

CPn.0157 200753 200298 

No robust honolog present in Cenebank/EMBL as of 11/7/98 

FSFVTYTCEALWIYOFSPGASPWWQASLMAQLNSYFCIjGG 

KAWSTAEKILKIl^ILFPLVLIAIJURYU.YMF^ 

knpalvxeaaltvs plfyslpkxyolmkvetp 

CPn.0158 201463 200894 

No robust honolog present in Cenebank/EMBL as of 11/7/98 
PPK XTLS INI OLLLEDLDTDS I PWPK LYL5EDFDFAYYPESKA 1 1 DTVAKLEKNKPCEEF 
CLESKXILARYlXEOLFXLO^LNFPTSTIDGGRESFLIEFSHrTKKPTWAFIYFYYYH 

si«pkle»fkoaccevhnruj<u;lkyrpoacaondcrw3gpycp icflivweowosv 

LKDHGFIKON 

CPh.0159 201811 201467 

No robusc homoloQ present in Cenabank/EMBL js of 11/7/9P 
CCFOOETATR I FSMTPSCFSLATEEIT/OVSTAEKVI K I LAL I FFP 1 1 L I ALAI RYFLHRJC 
FORJCCFVIPC0TPKELXLILAANPOL'^KAAREVHF<3FFALPTKYOSMYI0TSKC 

CPn.0160 203794 202127 

pf kA- Fructose - 6 - P Phosphot rans i erase 

TVELI^LNKSYFEIORLRYRPEILTLLETIRSKHIOETSSPPSPPPEWWIPNLCRIPE 
VS I YTEOETSSK PLK ICVLLSCCOAFyjMNW ICLFOALRVFNPKTRLFGF I KCPLCLTR 
CLYKOLDrSVIYOYYNMORrOMLSSSPEK 1 KTEEOKKNI LNTVKOLKLCCLLI IOONNSN 
TOTAMLAEYFLAKNCKTSV ICVPKT I C1DLKNCW I ETSLCF1 fT:^CRTVSO< ICNLAKOAL 
nAKKYKHF t RLMOCCAS YTTLEOTLCTLPN I AL t SEL I ATRK IdLKOLSEOLALCLVRRY 
OOOr/.TTVL I PECL [ EM r FHTRKL I LELNVUANCDi:.'? I EK t U*K U7PETLJCTFKLFPK 
OtANOLUJWO;iinNVRV::K [ATEELLAVMVKKn tEK tKNWEF}l:n/.'aiFFrTYEARACFP 
liNFT/^f/G I ALC USAl^LVROKTr 7fH IT INNI AJJJYTEVO^ATPt.YKMMHLiJmCCTE 

tpv irrosvppKnpAwiJ w oj."a r 7:;.7r.pc.Yi'Frcpi/jYFT :keel r noRM.TU«W0T 
iiSPP>ALY:rpjriKni:t. 

'.•rfi.oiiii iymm 
litr*riict<\1 .icylti.Huiiui.j:;*! i.iimiy) 

i ir:z.t3Rkcei«llit« WLtixwjrrw::t:n a j«fttp;i 4-irrrurYNprYMVtLL«c 
i A:;r*7*iSKK:3<VRiAuaTni/; i aalwumj airnxJDiKi^n^a ^wvKy»* ^ei ieyt 

11.11 <l i» l DOER LA I Fi::;:;UlTTLAW/rM-FFNK 1 KAI JVVWAIT r Ji IKl JIAAFAOrNAPKV [ 
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206394 

in Genebenk/OffiL as of y/7^?_ 



TMTVIcnAITYACKrtWPOrYTOriJCIOtVKELM -NLP P t L YMQCEQ DIX VS INHRTL 
FTEAFANCDKP [T ILTYPOVOKArPrACSSALSOtTOWUOUELTSCt 

•;Pnjll«: 205870 204803 

no rooust honolog present in Genebenk/OffiL as of U/7/98 
FVYTLYNICSPFRIK1CLYS ISSOVCTPWI P0UffiKVDSYLTLaCNRIICVn«n«COJJI 
IGKVEWR irTI^ILKII^FLIFPLIUAIJU^YFLRAJCYANX^^ 
'» WIHCTASH YK LTTL'/PViSOKHLOWCIINPLCVEAALH n RPSI" FCWAKYROI X, ISSN 

:.::..*\;::»;:si..v.wT!-.'iiLrrM:r .^m'l.tf^t?^^ 
• . /!-y:.;!5U*:r»^r,F:::^\u , iirryn>i.T:.r! v*/ cyshttt '.»nr twrc/FTPr*. 
pLOEDRcacrciuoLoa^rpicpsocPWiwoCTOCMiYWEDsrom 

CPn.0163 205831 

No robust homolog present ■- 

rEXAmCIKOCQIIICCISriKTPTPATPLra 
I IRETLWCCSLVTTYPKBCORLRSP »LRVl^LV0SYPI«UUIEIJO^POTUGA 
TYI mADTSTYILSLRSYOANSPSDCrrWCIwrcS XOOPWVISrWDHCrAfcPSTLW 
DPLLCTNX 

CPn_0164 206444 206998 

no rooust honolog preient in Genebank/EMBL as of n/7/98 
LcrRCZYiKiirsFiJcoiifrRSTXCssosirsRsrsoiaJWtJ^ 

AFLTX^VOGAUALAGGGVUFPLGLILCSVLVI^SIYXVSCa^^ 

*KINIVFE1CORNKOIEKAL£NPOLFC»HXRN\^^ 

MYPYL 

CPn_0165 206983 207582 

No robust homolog pnivic in Genebank/EMBL as ot 11/7/98 

NVXLfMNWVPmDHVDPCSEIDXRJCWSCYXLIl^ 

PSKYOEOAim^DmMWLTIlSYTODSYLTPLPASPI^ 

KHF CYYARL AFNESVCVYRELrPX tRLTIOfC/ECXTfSKEQZKiO^^tSFVKTtJfEG^T 
LIDaa7rOLXCRCFTOVFCT 

CPn_0166 207594 207962 

No robust homolog pretent in Genebenk/EMBL as of 11/7/98 
NCUC0YWCSDS11GES1MISIK1XACTOTI 

LVWVAGILPLLPVLXLEX ILITVLVLLFCLVLEPYU DCPSKIIC&PKVDELSWETD 
STL 

CPn_0l67 208309 207977 

No robust homo log present in Genebank/EMBL as of 11/7/911 

NLWSHFPP^FFHLPrcPTIIXJUCPn^SENYCU^IUJU^T^ 
ITVEELSAKDRKJTCPQSK>iCTLYTEDPILPAHNSFSNCSOI(?giTPISPIH 

CPTU0168 208716 208417 

No robust homolog present in Cenebank 



/EMBL as of 11/7/98 

"AEXJCQKyCAGSYEOONON 



UCSILSFVQ XLOEXDCraOFtAT] 
CPn.0169 209537 208710 

No robust homolog present in Genebank/EMBL as of 11/7/98 

SFHXEOTIGENNM)g^VGSECSQPLVMELNT0PLRNLCESRLVlCX7^ 

T ALAGAGX LSFLPWLVLC rVLWLCALFLLFSYXFC P I KELCWYMTDSO XHGWOKORN 

TJffiTOPVSEDSIRTVlSCYKLXKACKFEFWLISEII^^ 
VSHIDAPLFCPTKSYYRDCYLTPUIACPRYX INRA1 

CPn.0170 211098 210025 

No robust homolog present in Cenebank/ EMBL as of 11/7/98 
NVWQWI IRGOCWTCTVIArvt^HSYtH^^ 
EA1 XJCVLPXADIHVKLPCTITrotJWIUT/KNGFU 

FRNFQDI CHEXDPDLSVUJYNI LNYDFNSFDRVKATVOCHRF PPOO IQNEEDLLLX FNNY 
LQOCLDOTrmEVQONIIUJuWLYPSLPEXKAJMm 
noCTFAPQIWTOEPAOEAVWWEVDSTFPC^^ 
DSGFGCEAHAGIEIETRTirSSAiCVNPinLIEITRVT^^ 

CPn.0171 212444 211149 

•guaA-GMP synthase 

IIKLOSAPJWLIfriFILDFGSOYTMAXOm 

SGGPHSVYENKAPHLDPEIYKLCIPIMICYGKOLMARXJ^^ 

CELFlWIVieESLD«IRM^PJ)HVTTIPEGFWIASTSQ«^ 

EVSOSTFTGNKILETFTOEXCSAFTLIOTLYXOOOLVSM^ 

AOCTIYSWIESSRSGHASEVIICSHHNVGGLPKNLX^ 

SSYLLORHPrPCPCLTIRVXCCI LPEYLAILRRADLXFX EELRKAKtYDXISOWALFLP 
IKSVSVKGOCRSYGYT lALHAVtSTDrMTCRWAYLPCDVLSSCSSRI XNEI PEVSRWYO 
I3DKPPATXEWE 

OPn_0172 213237 212440 

■impD-lnosine 5 1 -monophosphate dehydrogenase (COOH-cerrainal 
region only) 

APICAAXGICPLCISRAHHLVEAC»NVlVIOrAKAHSKCVFOT^EI 
tVTAEAAVSLAEICVIJAVKVCICPCS XCTOIVSCVCYPOITAITNVAKALKNSAVTVI A 

orwxaYSCDvvxAijjy^rx^LcsLLjyjroE^ 

ADR Y FOTOCOK KLVPOCVECLV AYKCSVHOVLYQ t UjG I RSGMGYVGAETLXDLKTKAS 
F7R rrCSCRAESH I KN I YKVOPTLNY 

CPn_0l7J 214041 213715 

no ron»j-t homolog present in Genebank/EKBL as oc U/7/98 

TIFOLIYKIDSYKHOOGnUJFSVFPDRFVESTSPnFIEDXOAiaLVSNCCHYCSRCLFIF 
l~LL3I r I^iVY^TTSGETASLVFGILSLIVLVLLI IECRNRECCRRIS 

'.Hi_0l74 214215 214724 

Mn tor^m ItuMoloKi prusent In rVnetwinK/CMBL jc oc 11/7/98 

K I F (WF7KX IV I L3MIMTT ISNSPSPALNPCLdLJ P PPTLVSnrrTOTlILAYT l PAQCRRS 

TLR I ILM Y I r I t/J LAT 1 1 3TF I V I F F LNn LNLLlTT P."« ( tSHSCXI IVCliFLlMCLYFN 

::^a«/a.vcLW/K£LSOAeEnEEEY iqei ealrgapraespte:;pstwl 
rtttjivr, at4M«,t, J 15275 

t*# ttiiiunr ivKnoluci precwr in <;*i«b>uik/HMliL •u; nt U/7/T»8 
UAAj^^UJMPLM&MTCVlOOrrTT^^ 

¥ I FT:i^C:m.EKVAirtU»;Y!lPKFYL^FrDRW>/^HYEVUXrVFLICTVAACI I EMS 



FLTDSH3PELL3EVKEALJ'. — - 
CPIU017< ^*U27t g 

^^^S^fIS^ 

uacunrroLFiRJr;s£ihUJiNrj»;KYi-\rrjt:L*;*i .ku.« . ^.-j— 7 • rr/ 

rpn 0177 217513 216608 

No^oiu.t hoeolog present ^^^1^^^^^ 
WCRXOTlCSKFIFLISEQOCOi^IFSSVaj^ 

VIS!jNJKSVSQLPHYPSAFlcTTOXrS EEWOPYWAI CTX^ PX 
STYCSLIWPIlSAALTXjmfPJ'HPrWINCYERSFNIDTC^ 

NihSsc^cHAiGiixrra^ 

CSKEVl»PVICCYYSLVWt5VSSS0SXJiAFC0SFAEX7)^^ 
OP 

CPn.0178 218052 217789 

No robust honolog present in Gcnebenk/EKBL as of 11/7/ 98 

VKEYUFLVCRWEACPOTIUWCTVSCKFtt 

ESIUCOLLALCirTCYINREREVWVYLD 

CPIU0179 218550 218056 

No robust hoaolog present in Genebank/EMBL as of 11/7/98 
PKIWWTETOXEATSVPICFWWXJUW 
WMliWMJWCTWlTSCIFESIGFT^^ 
LPRSLlOQSUCFKSSLirrn^ALDLSAPKKLECKPNLSYEa^ 

CPQ.01B0 218963 218355 

No robust hoaolog present in Genebank/EMBL as ot 11/7/98 

TSLH«XJX3CTTCPWX0mVWrrYP»II^ 

CIXCVYHTNHYSVTTFCVDNYT'NLRFTrVSS 

LKACKIWIEVPirVVCLDLI^IUSKIXUCTO 

LISLILLOCOAVLESFOETOSS 

CPn_018i 219175 218777 

No robust honolog present in Genebank/EMBL as ot 11/7/98 
VHII^IIXWYFTiaCFMKlJYNWSUISHHEra 
OOXS»XYCVIUU.YXfUWIVSXSWLSRXPWIUJaUlZJlYCrLP^^ 
GLXRFSRLAXSVY 

CPOJ0182 220704 219334 

eccC-Biotin Carboxylase 

ICXMQWLXJUWEXAVRXXIUOffiLGI^TVAVYSLAX» 

SYXIUSmLAACErrGAPAVKPGYCFLSENANFASlCTSCCL 

AKSXJUaUKCPVXFGSfmXEXJCSEGlJCXAEXXGFTmKAVA^ 

PJ^SAARAEAEAGFNNPWYIEXFXENP1U<IXX0VlGI7nOr^ 

X EETPS PX LNAEI RVKVGXVAVIK^RSAGYTSVCT^/EFIJ JJPd S jJlF tlMJJt f t K iqVBff 

XTETVTGIOLVKEQlKVAieaOCXJWOIWIErSGKI I0CRINAEOPTWISPSPGBLWY 

LPPAGPSXinraCAJC^SGYAXPPYTllSMXAlCVXAJGClQA 

IPrTCFIflJIinCFXJSSKWIWIDNXiAOGN^ 

CPn.0183 221207 220695 

acce-Biotin Carboxyl Carrier Protein 
Pjooo3LicoxEiaiUAic»saaatFAXiaiECLEXEX^Rxy^^ 

QEllPIPTTJPiaaJTIKrrrrTCETSTTTSSCTFISSP^ 
SEtTTlVCIVEAMKVMNEVKAGMSGJn/LEVLITW AXX3AS 

CPIU0184 22X814 221221 

«fp- Elongation Factor P 

QlflUJrFtXCroXHVLSSQLSVGMFXSTKX&LYOT 

WliRNFTCATOEVKEAOFETlrTLEYLYLEDESYLFLDLGNYX^ 

GVTVSAMVYXXAA/FSVEX^HFLEXin/SKTDFPGDSLSLSGCVKXAL^ 

ICDVTKIDTRTCEYIORV 

CPn.0185 222457 221765 

rpe/araD*Ribulose-P Eplmerase 

AEVKKQESVLVG PS IMGADLTCLGVEAXKLEOAGSOF Z H XDXHDGH FVPNLTFGPCI X AA 
INRSTOLFLEVHAMIYKPFEFIESFVRSGADRI IVHFEASEDX KELLSYXKItCCVOAGLA 
FSPOTSIEFLPSFU>FCTJVVVU<SVYrcFTGOSF^ 
EVlXSXDQOSAPLClUIAGAOXL^ASYt^EAiKLA^ 

CPn.0186 222878 2 2 40 6 R 

*suailarity to Cps XncA 

PIKDKILIISSPVNNTPSAPNIPIPAPTTPGIPTTKPRSSFIEKVIIVAJCYILFAIAATSC 
ALCTXlVCLSGALTPGXGXALLVXFTVSMVLI£LXL)CDSXSGGEE^ 
LTVITTTLETEVKCXJtAAKDOLTLEI EAFRNENG^nJCTTAEXiLEEO VSXLS TOLEALCR X 
N0LIOANACDAOEISSEUCKU3GWDSKVVE0I^I0ALKVIXG0EW 
CO XOALOAEXLGMiNOSTALOXSVEmXVQTOALTRV^ 

PTTPWEGD^CEDEGCT 

CPn_0187 224218 225045 

predicted nethylase 

\TLTYTRTLPMHSKFlJ»RRXI(NSSHKEETrWt>TlAS3YflKlVQDKnHYYIIRCT 
JLTLG5KSSVL0 XCCGOOFLEPALPKECRYLniD l~7.f-L (ALAKKMRrTVNSI IOFKVADLS 
KRIXFVEPTLFSHAVAIL5LONMEFPCRA ! KNTATLLETU39 FF I VIHHPlTFR I PftASSW 
irfOORCKA t SRH IDRYLSPMKI P lNAIir«K>KD::P. , ?TL':FMFPLT/WFREU:SlinFLVaCL 
EfWr^KT-TPrWRAKAENLCRKEFPI^C M [ :X I K I K 

iTI1iJll«M ^JOU'iO .!.!l.4U0 

iTU2 itvpoch.!t ial prnr.oin 

K-nier imfrklfpf::kkictd(/korijcnni :u/,»ai tor: ifvi jji»KA::KKA{^i.-TfYnLL 

•P VI' ( LVFFLRL^OflLFTNLNWFEMt >t IKKl WW V 1 7A I VEAAYI I ATtCTI 1 1 M SJF 
WVW. I LMLLJLEDrJLNk IFPT: HTf'UA JTfUVimV fTl.Vi.'ltl I VI \W SWIYITQ 

i hp f v/AKLF:;uai.-gfrALYF r -j* rvm.u.7 lai w :/aki >pva i ukt:;ai .i t ic 

; WP/FVKAFFr.t<Vi: IFN\^PTYIMIA/AU:;FI4JI4.Y IVTM rVl^TXW.Tfl ICNRfrT 
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F I Ft/TDK t LHXYLOL XTST.' I LALTTROfWBTX ^^JS^^^SS 
EKICFWf NWTWPVPNFSEiT t KD t AOIU-LKRE 1 FKKFNPDLC XTF I EN5 FONT FNQA 

:^kkkenltue;aj»r:k 

OH? 224391 229625 
rrill namolw<PossiOl« Transmertfflrane Praeim -xcri* 

tncgkisntam^^ 
scwkk^a^ 

FXXMOPTOLRAWLDAKXEYOLiraC^^ 

anwhxtofIuSjs^ 

I^ILNMILLR^TA^ 

IICKRU^IDRRIMJU>CinT)IAMDBLFHrDSID 

GSISSPEVD^ 

SXOQC 

CPf\iOX90 22*901 23X274 ,,.,,«. 

no robust honoloc present in Genebank/fHBL as of X1/7/SB 

EP^XCETHOmTKDVNISS^^ 
SKOri£RyVSSOLTIDlCVEt«PXT^ 

SASicswxOTrsAuuocETtsnrtxTOXTsm 

SPICSRTSLOUeQ^FQISPVDrYIVKAAHSCHWSCr 

CPn_0X9X 232039 231114 

glnO-ABC Amino Acid Transporter ATP* it 

3HKFPVFL5YQKRECVMTI RVRNLAYSVNXKK XIJCV^SLCROT ITXFVCKSCSGKTMI 

LRAl^GlVOPrOTXWXECEAPAIAT^ 

FOJJILtaiEDrAKWPWUOC^mXVRSi^ 

Ll^TTJTOELTVGLTTHrKOrWCLDRIYWOOCaVA^^ 

AO 

CPn_0X92 232643 2319B4 

glnP-ABC Amino Acid Transporter Permease wottiWtw . . uvw 

EVQTOCAIARIXLSOCCYTLCVSCXCI^ 

TVIRCTPUFIQI U IYFCLPEVLPIEPTPLVAC I IAl^MHSAAYIA£«i^lNSLSlM 
WESAHVLCYKXYOIFVy IIYPQVPlCNXlJSLTNErVSLIJaSSILKVVCVPEITICVTimi 
VSlEt^PMEMVLICACLYTX^SrSCISRLSEXRRfiYIW 

CPnJ>X93 233X44 232666 

KXHGVFWOO^ 

AfiERaARYSLPSS7EXTTTW0.VI£IW^ 
LWOOTXnfTPIDBCRLPIiKVSIANliOVrLO 

CPn.0X94 233X62 234241 

gcp-O-SU log lycopro tern endopeptidasa JL ,^nrtmr 

EVPHT IKGNVTTSWrTMLTXjCl»CSSC0CrACAIVNBDKO H-UII IASODIKASYOCVVPE 
X^SRAHLHIFPOVXMULQOANIXXEIWLUVTQT^ 

LXCVNHVEAHLYAAYMAAONVQFPAlJCLVVStSAHTAArr IENPTSYKUCaCTRDDAICET 
FOK\RWFLCLPYPACPLX EKLALBSSEDSYPFS PAKVPMYX5FS FSGXJCTAVLYAI KGNNS 
SPR5PAPEI5LEXQRDIAASF0KAACTTXAQXLPTX IKZrSCRSXLXGGGVAXNEYFRSA 
IQTACNLPVY FP PAKLCSONAAMIAGlfiGENFQKNSS IPEIR ICARYOWESVSPFSLASP 

CPn_0195 234X72 235785 

oppA -Oligopeptide Binding P rota in 1._-.«h 

YS^SYMRKrSVGICITILLSL^/lWKESSHSSTSRCElAINIRDEPRSUJPROVRL 
LSEISL\flCHIYEGLVQE>NLSCNIEPAlAEDYSLSSDCLTirrFKlJtSAFWSNCDPLTAD 
PI ESWKOVATQEVSG IYAFAUNPIKNVRK IQEGHLS IDHFCVHSPNESTUVVTLgS PTSH 
FXJOIJtfJVFPPVHKSORTLOSICSLPIASGATYPraU 

7ITIHFX POAMTAAICLFN0CKLNW0CPPWCERI POETLSNLOSKGHLHS FPVACTSWLTF 
N I NKTPUJNMKLREALASALDKEALVST X FLCRAKTADH LLPTNXH SYPEHQK0O4A0R0 
AYAKKLFKEAIiEL0ITAWX^HU^XFWSSSASSLLVOt»XREOWICES£i3FAIPXVWE 
F ALL0ADX5SCNFSLATOCWFAOFADPHAFLTI FAYPSGVP PYAINHKDFLE I LQN I EQE 
QDHQWlSELVSQASLYlXmUIEPIYTOAFQFAKW^ 

CPn-0196 235906 237519 

oppA •oligopeptide Binding Protein 

KLKSYSKERSFMLRPFAVF ISTLWLXTSCCSPSOSSKG X FVVNMXEXPRSLOPCKTPXIA 
CX^UWHLYECLVEEHSO^ElKPAlAESYTISEDCTRYTFKIIWIUCfCDPLTAODFy 
SSVOC£IU(EIV^SVYLYAFl.PIKNARAIFDDTESPENL^VRALDKRHLEI0LETPCAHFL 
HFLTLP I FFPVHETLRNYSTSFEIMP tTCCAFR PVS LEKGLWXLEXNPHVHNKSRVKLH 
K t tVOF ISNANTAAILFKHKKLDWOGPPWCEPI PPE ISASLHO0O0LFSLPGASTWLLF 
N IOKKPWNAKLRKALSLA IDKOKLTKWYOCLAEPTOH X WPPXYPCTYPERKRQNER t 
LEAOQLFEEALDELOHTflEDLEK ET LTFrTFS FSY*3R XCQKLR EQWKKVLK FT I PXWOE 
FFT I0FMFI J*JNY:XTVNUWTAAFICmj YLM I FANPOG 1 SPYML^HFOTLLIKITQE 
HKKHLWOL 1 1 EAXOYUJICH I LEPU'MPNLR XAUIKNl KKFNLFVRRTnDFRF t EKL 

riliJM^ .M75U .:«»MR2 

KNVKHK;:NMJCf*li::KKJKV fFKHFSRWITLKLLF WLTt XSCTaGKMKOtt'l iriHOOPV 
AF:;i'hVVKKA«UU; I AVI4.Fl\ U.TKETHRE5NOLEUA t AlJRVTVi lEDFClJYTFF I KDSAL 
W! aXHTlTilKD t RNAWKYAOENSP1I CO I FO^LN FVTPSSNAIT IHLIWPNPOFPRLLAFP 
AHA I ^PKNrKlX:>*.nTLVEYFPfJI IN IIILWCNftr/YDYUCVS (NSIKLLIt PDXYTAIH 

ijjwnrvrwv* syitexM r rvra 4(koc vvi nrvrY pvbtjapwi tunics phludwnr i ir la 

T. • n*P:: [ I KKA14J rfvVPAfTL:»i'JVr\»PNOYKrC;KPLT POKKLVLTYP.'WlLRCORt A 
1 ■: 1 1 JW<KAA : t M .1 Mi 51 ^YlUJ'VNKRKVOOYA ( ATi/fr VAYYf Ml. :::KROKUA)NF 



e: ip tYYurron.TOOF : et. A.r^xtwfYPs 



CPn.0l9H 
oppA-Oil 



I£lol^IAMSldtSJtSf«N 



• •"^vJ'T.l.iip-A: • • • • >: r : :-l; h?*-* 1 ."'' '^"5 -.^: JTt :. 

JJSSjS^iEETTPlIFLYHC^ 

CPn 0X99 241016 241963 

Sy^^OWU«XAJCt^SLVmaWCNmiX:STW 
SSSlASS^IWYXI^XWISXPArTFATlXCW 

SSSwrTWAionrcxre 

DUQSI IDPOIRYAKCKBKKWC 

CPn.0200 24X996 242868 

StwSx^xjyypwwslw^ 
cpn_020x 

SISTITWX^FLPEfCUICrCSSIlWIDITia^PK^ 

DicxxiSSLirTCTVExin^Pia^^ 



2426X0 243715 



CPn.0202 243682 244500 

£™S™*5xtSb^^ 
SSSS^wwwoja^a^ 

»o£E£lJWJoSfONT^ 
^DX4^PXrSUSTStmEtYSL0VASK 

CPO-0203 244966 245802 _ _, -0 

NorobuRt homo log preaant in Ganabank/ EMBL a» of X X/7/98 

^^Montn^oenVrFSPTLOlCSf SLFXXgKLDSYF f ^'» TOILVIT9T9IIttA 

ilSScS^EKIXKIX^XLX^VIXAPILRYFUOO^ 

SRSA^VWISPArFSIPIlKYO^RIOTIOTAJ^ 

Nl^PUOUKOri^PEZXAimXCSXEmiMXSlJS^ 

FNPOCRCYrSEISTAKIHFKOKGRYCPIRSSCPIIOCZX 

Nrrobuet homo log preaant i^Cenebank/EHBL as of 1 1/7/98 

P RDttWVFTRNKYSKDPFS SARSIWANP FFOTHH ESN I K X KGKJYO X PTRUCJttflXaTSS 
YH81NPNl^FPOEOCTVYWtS0rKSAWOHCIL0^ 

CPO.0205 246073 246327 

no robuat homo log present in Cenabank/EMBL as of 1X/7/98 

xiDiScYWASA^POliXl^ 
YRJUCOILOLKNYKCFTILNTSPLCS 

CPn.0206 246346 247161 

CT203 hypothetical protein 

IVDRRSPACYUS INSDAICVSLLKDISHXLEOIAYDECILPREAXEAAIVKOHOrrPYXX 
H I LMDATQRVPE IVNOCS^CCH LYAMYLLAO FRESRALP L X I IUJAFTXJOTP>WIAGDVL 
TttLPRILASVCNODSLXKEX*! ETPKINPYVKAAAXSCLVTLVCACKX PRISCVXRYPAXL 
LNYRLEXOPSFAWDr^IAGICTLYPCEX^PISKAFtXXLVin^FXSXED^ 
ESC IHTLCSSTEL INtJTXXEMDCWLEDFPX EP 

CPn_Q207 247208 248617 

ytmx / sodiTX -Oxog I ut ^rat e/*a 1st a Trans locator ... . 

VNXXKRFLSLLFLTAVLLJ IWFSPHPAS INSNAWOLFAX FTTTIMC I IFOPVPH3AXAI X 
CISTlALTOTLTLEOCLS5FHNPIAWLVFtJFSIAKCXXKTCI^CERXW 
GLT/CLVXTOFFLAPA 1 FSVTARAGG ILYPWSL^OSFCSSAEKCTOOLXCSrLXICVAY 
OS3VrrSAMFl.TAMACNFLVAALJ^HVrWSLSWVL«AFAAI IPCLPSLFLMPI XLYXLYP 
PKXTOCEEAXWAJOJlLK!>CPU«EEKTIU4XFFI,LWU€rro 
LLI LTNILCWOKOVX ANTTAWETF rvPnALIKXASFLfCLCFIPLWOSAAALVSCLSW 
ICFPLLFLXYFYSHYLFASNTAHXCAK/P IFUAVS I SLCTNP I F AA LTtAFASNLFOCLT 
HY<33QPAPLYFCSHLV-r\*CEfc*<P.SCFAL3 1 VNIVIWXG ICSUMWKAUCLI 

CPru0208 :4R935 250602 

pekA -Fructose-* -p Pho^pnotr.insforase 

SVAVILMJPLYVDLOT t l^S^PPLPKEFOEAA.SLtAVPtTTSHSKPWPCVKTtfPC'rYH 
LPYUCPWENVVlfrPLkV^FStJnPAWWmVIOOUFNSLKOFHPOSSLVC^ 
LTtttOtS tDlTEEFLi*KFRN.<F;FNC ITP3RKK I VTPEArEAlXKTAEAtOLDCLVI IOOO 
i EJffTATA I LAEYFAKRRrKT;: ! WPrT IO!0L0' ff FLDLTFnFDTATKFYSi X XSNISR 
DALXKAJIYI IF I Kl J% W^VUI C ALB.YJJUrnilf J I AL I'iEEXASKNLPt JtT IX ItKICSVtA 
|jRAANEKYYih/ILirEi:i IKFtPEI Cin.ITFjr^U:r/F.OKl:;m^PEnORLLKSFPAPI 

i f-rj rii*xmoAiif^Tv/::K::r/r,rLi.uii:/:MiWYFwn/PFt4Ai:wFurrTO 
FMrrYT/ra/wrDv : 1 1 Ai«ii ~r i'n .rr I « a a« rfFwrw UA r i wimPTVKOOADOTLO 
r-r. t r *ylvo u*H-rrAV*R kfklyi* r t wai :yx*ku n»i /y I cti i-wmtJ wfppltlllhhh 
fWivaiiEtprfrrY 

tn, t'*»*r.r. iM«m»l*ki i«i..::-fu n< •«iM<Uirik/KMlil. oi II/"/ 
MrirjoiiHi jo4ETY::i':rrv2*:rtrn^.Y i mkkij*::yk:>v y irrnv i a itva* :u\iaybon i 
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t.'HiJHlU 2Wi* 25 1440 

mo roouat nomoiog present in Ceneoank/EMBL as of 11/7/78 
YOKLXEREREYFICT I REXEKAT I STMLVELEALWIErAHLItDOKPTSrc EITSLYOCLDH 
LEFVLLCLwSK FIXATES EE7VLFESQKA X DAVfflALLTKARDVUCLCS ICA t YQT I EFLC 
AYLSKVNRPAFCIAJEIHFLXTAIRmJtmLLGFRWPL^ 

TFEKETKEUlSnLLREOUMEKCS rOOLORKUX I : lELKDVSLTTFSrTPSOEEYOKO 

•. :rr -rrr-.-r-:./- tt:r k»* r :f;?k£1SP.\: 

CPn_02U 252765 252463 

No robust hofltolo? present in Genehenk/EKBL as of 11/7/98 
ECVKSYroiSNVOASStOSAlXHKTSDOIOOKRCncOSTFVILAVSLVI XGSLFlLAfiVA 
I LTVFSKCVLSLVFCVLC IVLSLLL1AOCVCU.YEEAXSLL 

?Pn.02l2 254011 252888 

No roDust notnolog prumc in Genebank/EHBL as of 11/7/98 

EUYCWVSXYSEILSFSELTSCKHSWroPIETASIRIHhVyTIWI^ 

LCMVTLGWSTIfLU^SMILGLXXJSICXAlXKFl^^ 

I IWOWAXXARELDXXOIOCDTLIRGreARXX^ 

AOEONSIlSCCKEAiXrWUCSAOEIFIOtYI?^^ 

YICXCVFECTAPHFIJIEAYAICRTAXNIJWYVI^^ 

EIETIX^JJETinJTSDSE3\a££YOIHCIPA/TM^^ 

KAVEKIETFEELOHCVVHTlCiaEIJWLYVDIU-EA 

CPn_0213 254345 254190 

No robust homolog present in Genebank/EMBL as of 11/7/98 
rLVVTSRVIFSNTM3ICIPRIiLXLPLI«XEKDPrcrtrSRVECTFIILNIK 

CPn_0214 255768 254446 

No robust homolog present In Cenabank/EHBL as of 11/7/98 
nXSJSXWEIlPTYaU PPAPHP0RVD5KTCX 

TSCGVLXGAALP I LCI GL VLLA VAL rVFLCHJCKKTRO DLDYYDCD LDSLVI HKXZ I PKO I 

SELRVTFEiaONLFOrHTKDFSDLSOELOCKF INOfEKWLTLEDEVTKTLIVRORrLETR 

R NJ mmjV KG IOSN I FOLHEXXS5 LYLILYRLRKDLQVUJ^TIX PPG I LKVDYPE I E 

AXKGLFIRI.TSRLOXlMCAOERiaCFXNEMSREnCEVEXA^ 

ARIJKSRTESIiDOQTOlXALKNQGLOPEl^HPEL^ 

LISGTVTSCLTLEECENRMRAAgTGI^ALLVRJaon^rxS^ 

VXKSLELE2.XKKIXDXVTEE7 

CPn.0215 257039 255759 

No robust nomoiog present in Genebtnk/EKBL as of 11/7/96 

LTSSIOCOVKSSAJARICFPSPSPCPSSTLCVHPPXY^ULSVSIJ^^ 

VNAIFSFSVLTVGLGCAGVFI/KLLLII^LIFFVSYHRXX^EAT^^ 

LRJCEUIE^EVSNPlXDEWEIiFXEWAOHKSOFA 

DVAXiLrnjOMWGPLEIXRJOCSX^ 

WE0ECYRDXRKVEKl^VrPEGYlWEII£VIJCT1^^ 

TVFSEEELQEAIJJKAXAXIXDIGVraSVVE^ 

TFSSTOEKVIJEYZAIJtARXlUCTLRVKLDC^^ 



CPn_0216 257623 257174 

No robust homolog present In Genebank/EXBL as of 11/7/98 
NXARTJOWrfWICVmPEXTTSUtlNSYTVAGC^ 

GAAFTLGLGCL I FALFLTS FSL ILLLSOEJCRVPOVLS LYXEKEVPO YET PL YXH3LESER 
OKSAXSERlfillEEKLRZAEKFRYSOSVFV 

CPn.0217 257881 258579 

YPdP 

PKCClOJCCFIJVm.IFCF0TFSVVVLCr^FASRCKAWLT^ 

>GFEVTSADVYVICLLTCt^AREHYEXNDINDAMIXSWVIS IAFLVLTOLHLFLI PSPN 
DSSQEHFLALFSSTPR IWASLVTLIFVQ IVD IKLFTFLQRVFSKXYFAMRSTI SULFSQ 
LXOTX X FSFLGLYGLVSNLCDVMI FAMLVXC IV ITLA I PTLTVTKAVLDRRS5 

CPnJ)21B 259064 258582 

No robust homo log present in Genebank/EMBL as of 11/7/98 
rFl^XKWFESYEOFAWASSWPXSUlALVOGRYFVDSEXKrn^ 
RStJ'IISTlOCIIRLIE^HSCPIHPRDKKXYKFI^^AVIElLCI^XIL^DriGCFLA 
FLVAI IL31XLYCNSTFTCVONLSFTERMLEG I GEAVNFLA 

CPn.0219 259348 260472 

tgt-Oueuine tRNA Ribosyl Transferase 

GSSLAIJCFKLlHOSiaCSOARWOIErSHCVXOTFAr/PVATWAJJ^ 
TYHLLUiPGPEAVAKUSCtHOFMGROAPI ITOSGGFOIFSLAYGSVAEEIKSCCKKKCMS 
SLWITDEGAWFKSYRICRKLFLSPEI.SVOAOKDLGADI I IPLDELLPFHTDQEYFLTSC 
SRTYVWEIOSLEYHPJCOPRHQSM^^ 

UJEHSEWKITTSFLSKERPVHLLCICDLPSXYA^FCinSFDSSY FTXAABHCLILSK 
AGPXKICOOKYSQ^STIDPSCSCLTCl^ISRAYU«LFXVP^AAIWASIHNW^ 
nVMKEIREAILKDE: 

CPIU0220 260660 261236 

no robust nomoloo present in Genetsink/EMBL as of 11/7/98 
FYSFLKKKC IFYMSKESIRSYSEISTPTPIFRETPSKEGVAYKLQLRaPAKDCILRNRVS 
LKGALLRJ r PFYGSFLGAKR I H SAWS AJCDA PCTnT/YHYLVGGLELLCLCVVVLAC KVLA 
TALKFLFSKASSK IKOMKWRFJCARNLAAKDTVCS 1 KEFCSVDLT3CFTRCFRLRNRWEE 
('ASENCTVREI FV 

ailj»22l 261621 2620M 

Nm cotjiiitt homolog present in i5*neb.mk/EH0L oj 1 1/7/^8 
ftAI .RYKYF.IC I QMVNRYK C.lAEFSATWfYYOOIL\Tj4 r 7/KRNLRnL*\ r\*EN CVC LFEO tJL 
LErn^TIP!H^:tu:UWUU^A«TOOPKnSKI.:: t niTALC t LETU LG X IVLLIKtT 
!T I LL I LFTTl ?( -U 'Y FHY JtAA Y SI>FKP I 

l'tHJU322 ^»«.2474 

•wstk ^ iBiil.it icy co Qacr.»tr ioptwi*' niro (ou«n 
•*:KPi.Kirai4tt:LNAra/rjPEr/nMPA*VLM^^ 

KIS *PLT LTYPTKI ILfVYC:; LVK [ J I UJ LKLKfcLR KH I PH K [ R Y F Pi .*i •-AVCTK LOR PH YH L 



r*i rtwiu' nr»r>io«i p. .if. i« ■ ^nwmi EHBL as o! ll.">>99 

:TMLrnRY-r.DOcrrcATKm r-r 1 1 k^fvrdmxgltnpi se rvsrrsss ikdsvlasl 

P I LCS I LT^L^TL^niMLOITOEJtlMrT IFCALS; LCLO^,L>tF|rtIP^IU«:ir 

klvxcfcx *' i : * 

»!Pn 0:24 263402 :t?3-i74 

No robust nomoloo present in Genebann/EJQL as of ll/T/98 
YTFKNPWWKKMKPNS I IFLEKnCHYPCrFREGFVRDRMCLKLASWL^STEITIXRSIL 

....... ^. . • . . ...-V.-.TW r - " 

t:pn.o^'- J»je4c :«»4>4i 

No robust nomolog present in Gencbank/EMBL as of l 1/7/ 98 
NSrTIKFUJflTNAXNSOTTTPOPNLTriAEPIASRACCKSIAVI ISLFALGKLLLCLGII 
LISIPIPCLAAOVALCLCIVSLILCIAi^IGrUnilRCKOVPWPCT^ 
GSTPTALPWOAC EFLCKVQ VS AT P X LLPXNCDEEXSAXVKK EGAEAASS XXOAVLESTEX 
UOAJUQEESPJtEARJUCXVAEEA£ASPJaiXCCC«tAA£»EAIJU^ 

CPn.0226 264545 264967 

No robust homolog present in Genebank/EMBL as of 11/7/98 
A IFtflUTRMPYYAKTLEr Z0GTQSXXP l*F1CYGFVRHHYKGQLE X EOASKDMOFLEPPSTMC 
RTLLAAJPILGSVIGLGRLFSIWSIRETODSOEYKSIFW>n^AVLEXLGi£X^ 
LATF IMAMPGLKRVAT FLFY S 

CPn.0227 265467 265009 

dsbB-Disulf ide bond Qxidoreductese 

KERreilTSCKLLKIUmiNFIRSYALYTAWAISCACTLXSXnSYIL^ 
ICUT^WILGISAYRmSSIKLYIUK3AVI£U:iSIYOVFLOEI PGH3LDICCRVSCST 
KI FIJTrVTIl^ASVVAFGAIVCIiVLTiaCYRG 

CPn.0228 266242 265412 

dsbO'Oisulfide tend Chaperons 
V!G2RAflrT*aJCEXFSCSILJOCX31AFEFYVFC^ 

crGTLX woofT i lppkah i ptnaxhfptiqjpyap iNrrvFEEPscsACAiJ rrrvrPL L 

«HYTim:EISFTLIPVCFIRGSKPAACAXXrrrT^PR0AOir^ 
CSWATPEVLTKLAEGLKINSGRSVNPICLEOC I ASGQYNE0 1 KWWLYGS0VLGG0LAT 
PTAWGDYL I EDPTTHEIERAIOH IROLOAVDGOHDD 

CPn.0229 266163 267560 

CT178 hypothetical protein 

NSEAFS rLRI EOENFSFXTIOCSAI^FT^YNTANLTXSTFTF I LLIX1JX3G3GGLRFMXXXCT 
LSIIYRHniYRFIJG^ ILPAFLGIXLXXSPKTLfnrrOVCVI FSORLCSCLLI FLAIASLT 
KRSLt>/I^PLGrWmJACVAGRSPTIFANOTLICFAItJkV^ 
EOTYWPSRflCRIUUWtrLSLXJCMX^^ 

VLSLAGS ERRWHTRPK IVI ATALALTCVT I LTLLP I X LHQLR YDCWLCLCLT I EPALAW 
FAYDTTRATLHY I SOFl^nXRALTRASFFGSr^nrXKn^TERTVL PLRXAYXOAFtCI S 
FPXH0LXJaLVA7^VKVNSS*t3LPTTPRNTLNICCWFI rVIJILAf AEg^OJhSWLI 
FSAMLFSPVIJHXPA^PKrLPIIVTGLILIILSIGKRRRTTC^ 

CPIU0230 268277 267576 

CT179 hypothetical protein 

RFEXALI YMSSOPLVTTSSSLTOYVVLTGIEKVACYXKArNH IWHGAPAXILAAALLKTC 

IFGfVljGSILLGAinXGA5ILYWILPWlXPSILVFVIiVLPL^ 

ITOaNYlttI«JKCTr£10CTIWOCAl^YXE^ 



Ji.2'i5il 



.!•« mi 



CPh.0231 268996 268253 

tauB-ABC Transport ATPase <Nitrace/Fe) 
PCAfVSX0ORGFSKAAMRLCYSCW0VIlJCX>ASFOAS 

AGFLPLQEGEXXWNG S PL^^UG7^/AYKGC^CEALLPWRTALKN^^^LSTELC INTSHNALSNE 

RLEEIIHNTDLGOLLIJRYPDEI^OGCRORIAIJU^LSLKPILl^ 

Y0DIVAlJOCKX3«TVlXVTRDrKIX/SCLGI7VLWIKNXTLTP 

0LKXMLYT 

CPn_0232 270134 269232 

•similarity to 5 • -Methylthioadenosine/s«Adenosylhomocysceine 
Nucleosidase 

KKFtMRRFI^XLSSLPLVAFSACNFTXLEX9(0SPX>3RVSXXFAIJGVrPVSPlX3ICPXP 
WFSKSKKTLBGQR I YYSCDSTCrV'rwSALWPNKVSSAWACNMIUCKRVDLXLI XGSCY 
SRSGX^RFX«5VLVSKGYXNYrJA£7VRPFTTRFTXPOIKXSWATSEVNREAXU)CCEE^ 
THKQEIEEIXICTHGYIJCSTTiaEHriJaraVATC 

SCAVSQVCYEYS r PCLGVN ILLPH PLESRSNEDWKHLOS EAS KI YMDTLLXSVLXELCSS 
H 

CPn.0233 270439 270348 

No robust homolog present in Genebank/CMBL as of 11/7/98 

EXARTNFLGKV1XFULJIISRRSTV0EIGXFFHLETPDLKXVXXAFV5TFIVVEI^^ 

KBOS 

CPn.0234 271246 270548 

CTlfll hypotnetical protein 

FIHL03CKKALLS X WGILATHP I PCHGr/EAKSCFLGKVXGWFSKKElOEEARXLPVICDS 
LSVKP^rDYTSSSGFSVEFPGEPOHSCOIVEVPOSEm 

Y PEXVD I GRPELNLOECFSGMMOAL P ECQVL FMQARQ IOGHKALEFWIVCEDVYFRGHL I 
SVNKTLYGVF>^rfKNKNPOALDKEYEAFS0SFKITKIREPRTIPSSVKKKVSL 

CPn.0235 271395 272177 

kdsB-deoxvoctulonosic Acid ^ynthetese 

VrVRYLtJ<KPE£JECLCIO/LPARIAJ*."RYPGKPLAKniCK3LI0RTYEKASCSSlXDKI 
WATDDOHI t DH\nPF07YAVKr*PT^.^JICTERTGEVAR KYFPKAE 1 1 W IQGOEPCLN5 
£^nmALV0KLRC5PEAELVTr^ALTTW'EEILTEKKVKCVFDr i ECRALYF.'^PrPFIU 
KATPVYLH I^"^AFKREALFRYLOMf::rrPLSDAEDLEOI^FLDCGKtKVCIVDAKSPSV 
DYPLL IAKVEOY XTi.'U JNAYF 

«*rn_oju -?72iiM wtriut, 

t*Vf«; 'TP ^vnrhct.iutj 

L'KTIYIlMPPKi' I FLTf? Wrtr;LCICCLT/iAiJt AL1 LEROR LilVAMLK LOPYUA/DPCTMNP 
KM * <E I YVTOtt ^VETOLOLT3l Yl I»F.':::aAL: -.mUXATnVj t YARV I KRERBGOYLGSTVO 
VII1fITftEII0V[LOAAKEJCPr^lJVKI^rri^DrEnLPFLEAIWFRYaCEOCU* 
?[^/fhH^f!fIl2?r r V M:;,yVr( '' l "' M P,iA « W-TISEr H.TOBVK.TK IJJLFTNVPNR 
AVKNVIOVKHTIYE^4r*fi<LAOCK IAilFi' < :FJCI.KI^TVPEfllJ)OWXVI.VNOt < JU(lLPKVKI 
t W « YVOHRhAY KJ*. I FEALT1 1 AAI J> f/^IAACI I PI DARLEM LTMEUV :OMXVFfX 
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imwivrnrprocLCcr lEvswp^t^FHPEFVSKUSPHPtrtA/ ieaalvysxsa 



•:pn_o:)? 



271741 274214 



mi^A^ 

.. ..... ...... - ; ; »,:*- % t .:-\kx: — *>* trr 



•TPn.0238 



274210 2758)8 



PI^LLIIRXOPOBCVALKnOCVPGTWrVRP^ 

UGXJRWTOD^^ 

RPL 

CPn_0339 2 7 5B6) 276672 

SSx^ISSk^^ 

jSS^T^DGAiacroaio^POxsra 

FNSra^S^TffWO^^ 
RSPLTWIISPESYDlADrCNISSIYKMDIL 

CPn.0240 277861 276698 ...... 

No robuit hooolog present in <*"«^k /EH8L a« °*J*(1 / *1~ ITTS 

LVYFWFSPSSESWKAN5\OTSNFCYT/t£}« 

RPTQOWPGFXAAONl<GNIJNSFGXLIMCFS0CItSCOTFEJl^ 

l£PT!X!ALVYCAYKVYTLCaO<IYSUOCAKAIC\^ 

XtVI«AMTCSLVVgI^LaLIALTAGrW,V^ 

SUCIWMVRICAJOlOEACVtSHLT^WUfrAVSEMX^ 

YGHLPSNEEVAOLVQGGAPCOCSRPSOHYOCSSWOWWaaOffOCSHP^^ 

fCAGYPTAPTNPSAPPPrPPPAYOTIYG 

CPn_0241 279372 27820) 

No robust hoaolog pr stent in Cenebenk./£KBL as of "'J'" ee 

Tirt.WrgMSAKISLS^HEASIASWTOVRgVLVSl^^ 

TAIIDDUfflWETEGP<HFqVYSNISIJ^^ 

LITairAAVlJTVLMVFCrfUJXLCYSAYKIYOLTIW 

HRS<»VAAAAAS0STIKAC1CVTR0^ 

VKTA>MICCCAAOtnCXUOTGmJ^SV0ICS0ECVHHKW 

LPITPGTlOCVLTOSIRRYCXJFFSDCfcYROIESE^^ 

VIPRESPPXYSTIOSSNSPrPSSSPPPYYR 

CPH.0242 279975 279487 

No robust honolog present in lGJSJ^'HS.^.^L^vI^Jwta 

KsuwcsLYorsgicPTviLMACsirTncacayw 

W5VVLULICVLFLI4^F!TOjC3ASSAI«ERVSI^ 
IGTAXHGFX ISKRTSEDMLSSAIVniOAPX WLKWSGLIKP 

CPn.0243 280609 280133 

No robust homo log present in Genebank/EMBL as of 11/7/98 
INYNYLVFUKFVKGRI IMACS XCVKU3«AN«Pt3R7VASICWVADIU.YPFMAVXCAW 

FAvurnncixriAiKrLVNTCiXACK^PLPSCKZNrocLrG p 

I ICTLIYST X XTVOSDTNRLRYF X ISPAYOVCST AX INW 
CPn.0244 280906 281556 

adk-Adenylata Kinase 

CAFEVTKGSVFI I>CPPGSCKCTOSOYt-AhniICLPHISTCI)IXRAI IRECTPNCUCWCAY 
LOKCAFVPSDFVWEILKEKL0SOACSKGCI IDCFPRTLXJOAKIiCSnJffiVHSNYTVXFL 
ilSroEIUCRVCSRFUTPSCSRIYWSOCHTECPIXHVPLIRRSDDTPEI IKERLTKYOE 
RTAPVI^YYDSLCKLCRVSSENKEDLVFEDIIJCCIYK 

CPn.0245 281627 282499 

ydhO- Polysaccharide Hydrolase- tava sin Repeat Family 

TCOKEXMCHYLSFSPSADFFSKOGAICTOVlTCEKVL-vnCCSTCyAYSOLFm 

CHSFRSTLVPCTPEFHIKPNVSWSVDAFLDPVCIPLPFCTL^ 
^IVCSCTPQCDPRHLRRL/A'^rAELLrKDADLUJ^PYVW^ 
GFINILYOAOCYNWRNAAtXyyAOCKWlSSFENLPSGGLIFLYPKEEXRXSHVMLKOOSS 
Tt.IHASQOCKK'/EYFtLXOOCKFL^STYLFFRNNOR'jRAFFCIPRXRiCAFL 

CPn.0246 282955 292551 

rs9-S9 Ribosomal Protein . 

W^STIOESVATCRRyOAVSSVRUlPCSCKIO^FJ3FnWPLErORTTIL3PLKK IT 

EOOSOYCL r I RVSCGC I QCQV I ATTUjGLARALLXDJEENRQDUCSCGFLTRDPRKKERKK 

YCHKKARKSFOFSKR 

CPn.0247 2834)0 292969 

rllVLl) Riboaotwt Protein 

KJY I [MEXRKCTKTTtVK£SETTKSWYWOAACKTt/7R 

M.",IX VI V INAEKVR LTCAKKGOK I YRYYTCY I OCMR E t PFENMMARKPNY I X EKA I KGMH 
IKrRUWWUK.'XR IVKGttrVETFESOKPtLLDl 

• TnJKMM i«44SJ JilCSO 
ytV/yM* At** Tr.insportitr ATr.Miw 

K :K« rr » 4 •: '« Vf< Y ATF. at tFR.CPAi.'KK: *RKIlAl?LRHFK"RLAM**LL I EARNLT-TCT IQOQN 
I ;:i |.TIIV::i."4 JUV KttCtV »:* tf*«TTt JJlU/TTLOVrrKroLRFFDKHJCNODlJS 
flKhl M 1 1« llWjNVYU J.:DUrrVLKNVI >H*AL I ARMI t .'Jl" JU FVYTRALELLDLVNLEDXV 
im«-:KI - 3 >:Kr^VA I ARAUNETA I UJVCFJ\«ILDEET: ;EO tllNLLLEOAUALCCXL 
! VTI INK til A :i" *lt U W U W.K LFFI W:'. 

• ihjIjM-i jhmkh js4 in 
■TIM liyfxi tf>*i pmrtjin 



• .: . \ y: ~~ M awv * *W r — F ? • * * 

{^^^iREim^ 

CPIU0251 M6036 287559 

ITCTJySYOLVORVVKX S O N QariTJ r rSKA 

CPn_0252 288112 287576 

CT144 hypothetical protein it rame-shiec with 025)71 

SJ5«TCOTreYYRA»PMCW 

epn_0253 288474 287950 

CT144 hypothetical protein (frame-shift with 0253?) 

SaCRIit^IPTTCKniSIPTrVRFKIESXNLTOECm 

SJSJrVPYG^ 

CPO.0254 289268 288459. 

oSTTLStWrYSKTDVSOTlVCW 

cS&pimri^AAraxsv^^ 

Sct^TillJJrVSCNNlCTIXCTS^ 
JttXSTIWCJlitSLSTCVT^rPFPSNFVrVP 

CPtt_02SS 290183 289339 

SatStsotSSIna^Isoisi^^ 

KPLIOJFt>FHSC»SYI^T<2SEYTLYOSRWmSIFRFIGWK^ 

\^pK7M>PNAnRYRACXFYlAX3SSSAC XCNYSFSLiLYYPODRC 

CPn.0256 291282 290398 

SSSRliSpT^XSIPTFVR^ 

SSrc^LWOTINIRPTSTtlSMV^^ 

ScWFGYYjam>Dt4K>RAAiK 

NOSTKLVFEVtMCVPia^ISVXMajm 

?r1SsYYCTPL0VVCWYAQN^^ 

CPIU0257 ^ 292136 291267 

SvHKIwSkXLK^pS^ IKDONlJWATLNVIXaWt>X ENTLCTRDLKV 

AOTITSPCEFTVOOa^AESSOFKATTt^JGLEIT 

wwwnTCCOAI^YTYYSSSOPTTVCKP I ETVCONPNPETYR ISASAKIYOAVTWrPY I 
JnuS«Y0VTWRRI3W XAV 
TOTTLTEIVATPPHOTPWIJTXrTICIJIKSHSTCVIW 

CPn_0258 292534 292133 

CT142 hypothetical protein (frame-shlfc with 0259?) 

CFSTCRLCSiraXTIjCWAlOIiAACTYI W 

CT*^SmSOOCTYSTXCYl^AWRDHSDXDPOPNNPSDKY^FLFVRN^ 
NYSFTLLYFAOOXV 

CPn_0259 293031 292441 

CT142 hypothetical prot.m (trame-ihlfc with 0259?) 

IYPWKRICrW/rXDfHHnW0DNNECYFKti)S?TVtX3X^ 

3VO^Tn(EXVSATCLTSASTYIU*ATCPAPSSm 

YVP^P0YFFCAJ(PIE<rT7>1FlX:SSRYLPITC0CSNYTLY0SSK^^ 

HWOTOPYKFLLOEPIS 

CPn.0260 294090 29)548 

•ecA-Protein Translocase Subunit 

AYLOFSKRSCVEEDHVSKK INRllDtX PCCSNKKYKQCCLK KEEOTARYTTfCKFKFSAEV 
LSASEOCEACO»JCTKl^l^SLTSEOKAAVCKFHOITFJIKE\mSKK^ 
VTEKLOOHNFEILffTCENMPPKESTATLNOOTNFVCEOFIPTOEOFRISENSOKPPVEE 
D 

CPn.02*Vt 2'»4272 2^50 U 

vri.jO-Pf-«Li»ot* ,'iiipiirr.imi ly ATPvtsf 

YSFMIPF IVFM:rrUXNPrWMK ATIKR I E;;LVRKALYT«TMLANIIIIK IWAi JfJUKOSLTL 
IXKLKA I.TOTCFf'TiLOUIAVN fTlKY.I'XSAEVNKPYl.Th ['.'OQI^Tt PFRT t PflPVAPETP 
ECYPCTOARRRLLFOAAKE I0A2A IAP»aillR W3LVOTAllJILUIKAKFAH<LrVI««VIIF 
C/T ILR PLI FTFEFW I RKFAK Kir iFAftVTfRLM W::c 4«:r AKO::i JCUilVFPLAnilNI A 
LAt4H6^X:30F.I 

»;iti_«2*:i JMMii.i, jv>»n 

rjiirE*J«rE- liK't Ai:i>) l+^nintur.in** 

I . ( FN I NKEVKVf LVU<NKR t.K I ( LTNfX/ ; ITAKf >4. M.V.'IAf ,LFAN [« JO f Y I AAPOAAU:' 
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■ nC.iMAI. :LNUWCAS PYAY POPVr CAWAVrT^ p: . xLKJrrVE3V3P0LVlSGINCC 
fiN ."JKNAWYjrrP tGAAXOALVOC [ PSMALSOONHISTFOODKAPCILKALVtYLLSOPf P 

-.lttxn if^rrjpTO»n^Lvppa)erfYEEPOYLCSw»cmciciscvRicm 
-EEtACML£Wir:^prF:wspiajcx£Eroro 

CPn.o:«J 296H4 2971J6 

vntu nycotnetic«ti protein 

TrAL^RKLRVRPP:XAX*rXTPCniKS>CPRPTKrSrpLYrSinX5W?ILOCPXAACSV0 

■•• •„•.•:• : :* : ,\~*ft*airnUi v ::.v:.:**:«rr.Tw\:—'. :*a"f^2JR.?.v.' 

KSVTX rTSSPRXLCTILMTri^ICLTYIHAgSCYSCEPRt^YVWE3U^LJ0LXEIVHR 
EOPSAf lAIEKLKCVIHCIWT 

CPn.0264 297730 297155 

uDiO-Phsnyiacrylate Decarboxylase 

MKRYVVGISCASC^/ILAVKLIKCLVNAICHO^^IISPSCRm.YY^XXOS^lAIJ^EW 
LEYIHTHSIQAI ES SLASGSCPVEAT X 1 I PCSKTTVAAI S ICLADNLLRRVACVXLICrRR 
PLXLWRETPXXTIHLEWXJCLSJCSGATWPMPIW 
PSOLTKQWSNPE 

CPn_0265 298632 297730 

ubiA-Benzoate Oct apnenyl transferase 

FAirraivvN0Cir»FiDWQ7msicRvx,pj^^ 

3U3IASLrwMIWPYWKRVTFFCWGLCXVYTVW 

SVGMVI AAND I I YA I EOTEnJRElIXRSVPAHYCEXXAVElAJCVNliWVSYI.AY I FSGTVC 

SUWEFYTrAIXPLWILKVVIWYSNYSTaTOEeEStf 

R 

CPn.0266 299181 299876 

No robust homoloo present in Genebank/EMBL as of 11/7/98 

XNAUEINNO^PSQ0XASST5QT5KIN0ORXTFAC7^TLLWATI^ 

LCLSVPLSC ILGTFAVTVCAVLF ITCLTI LVRXSIC XEOWCDLNFIJCIICTPTPPARPUI 

SKF5VTCSTTSXVlA<AUXCAWS^FI.TGYIOLSLCASLVGX^ 

LAIgZSSCSADSQSNrVCIGEP7CAA0EQKWYXMAWRG£DCXPTAXRLTPDC 

CPiv.0267 300122 300910 

No robust homo log present in GenebanJc/EKBL as of 11/7/98 

V5 IKSUOCnOLLNOPEPAVCLNAWDPKY INQDRXTT ACTVTLLVl ATLMl L7TCVTVLL 

AKCSPCLSVLVSTIICTSVTTOTALrilCLVia-IKXSLAWXOYOKYFQEV^ 

IPIOJtMVHXLTSCLPSPUSXESPSPEACTPVSiaJUACSmXVXX^^ 

TGYtgLMj^ A^^ 

CPn.0268 300914 301318 

No robusc nonolog present in GenebanJc/GMBL as of 11/7/98 

SrVPlM04PTPPWPIPSEDZritAYEPFVLPKTDPE 



CPn_0269 302466 301476 

Dipeptidase 

VAP!CV>mEWaJLI^HPHPCRXDPAVRCSPra 

C^SXJTSLPWP0IGLI£YEEEE»:55S0XX5L5LXItS XENASALGDOTAPLGTLLWCL 
IHLTKQGPLAYIC IVWKGDKR F U,bTL APKEI^NIX3CVUi3 IMYELGVP IOLSHCSDKLA 
EDXLDVTADiaPNLKVIA5HSNniSVLt>aUlNLVCAH^ 

LCDLEKHVUtAENLGI LSS XVLCSDFFYANEDSNFTFNECSSAEAN PVLNQLIHRI FSKG 
KAESXLSSIUEXFLXOVXVEQVNPKXTCVXL 

CPn.0270 303343 302468 

ywlC-SuA5 Superfani lyre laced Protein 
SIFCA/IWDKKAOITFStPEVHSAWQGKIVAJJTim^ 
EPSJUtfAi.YVNSIEIUENISGYPLSPTAICKIJ^C^W 

DHSVVRErVDHCCTLICTSANl^ErPSALTAOEirADFAI>{DLCirtCPCSHClXSTVVA 
SDPLY I YKECLI S RSV1 DilACTEAK I FHRTSKAFSKH I KIYTVMCEOLVSFLSCS UDF 
KCVVCmPKPKNTYTlUJ»EAUa(KrPSIVriYXJIOT 

CPn_027l 303628 304362 

Lysophospnolipaie esterase 

KLMIWSFFRJlKIGNIEAXECPOIPOOPIIXLCHCYCStAONLTPTOICSFSK^ 
FPNGIU*U3TOFICSRACFPLNVLLLQn£RW^ 

EULMjmpYNEIXICCFSOCAILATmVLTSONPYAflAXIFACARLFNOCWEXSU^ 
QVPFLQSHGYEOEIL PYHUCAH UNDLLLTXLNGOFVSFKOCHE I PSWFQKMQVTVPNW I 



CPn.0272 305272 304340 

dnaX-DHA Pol XXX Garoa and Tau 

FNROSDAT"f A1WHKLEEEN0GWEALLRKVYK0 EVP PAI IXHCFTCPVLOOXAEOLASE X 
LLS3SPCSEHmOKIHPOI\X3FFPEOKCRlJiSIOLPIlGXKKOIYISPFEANYKIYXIHE 
ADRMTLAAI SAFLKVFEE PPKHAV 1 1 LTTAJCVQRLPKT I IS RSLS I FIERCER I LCSKET 
F5YLPRYACCEX PVTEVSQ I IKE^SETOKOVUUJKVORFMEVLLELYRDRYTLWLCLKAS 
ALWPEHVKEILOU'LLPt^LLtVESACRSI^SSAASVtXWVAIOLVSWYKEKEL 
VSVSPGOCLEN 

CPn.027) 305853 305227 

Mk -Thy* toy late Kinase 

• 12 IVFr/IEOCECSCK5SLAKALC00LVAODRXVLLTR EPOCCLICERLRDL ILEPPHLE 
LJROCELFLFLCCPAQH IOEVT IPAI*RDCYIVICERFHD2TI'/Y0GIAECLCADFVADLC 
. ;KWRPT J-FL.PNFVLLLDI PAD tCLQRKHROlCVFOK FEKKP LSYKNRX R EGFL5LASADP 

: :rylvloapej:l^;l iokvmlhtoljlct 

■•'Hi.0274 IUe Ifift 105852 

•jyiA U«A i5yr.i:;u Sutruntt A 

Kl.TUWIJKDn 1 1 VkKNL£EEMKE£YLRYSM5Vt XSPALP0tRa'4.KI':X)RRVLYAMKOl. 
::U';i*;A«IWa:AKI^JDTSaDYHPtttE^^ 

1 1 AAMRYTEAI<L , ni:yVHYUIEDLDKDTVO I VPNYDETIfHEPWFPSKF PNLLTNGSSntA 
V MATT 1 1 CI1INU ;EL C EATU.LLANPOAMEIUJVHFOPOFPT03 t ICCCEGl RSAYTT 
« KV* At< U tVEENEDKI IR E5 X I ITEMPYNVKK.'IPLI EOIANLVNKKTLACISDVRDE 

• !WW '• l V/VLE. iW.KZllC 1 1 1 NRLYKFTDVQVTFriAfMLALCXNLPftTKS t MRMSAWX 
HIIHKBV 1 PPftTRYRLNKAETnAMVLECYLKAUXTLDALVKT I RE3CNXEH AKER IIESFC 



FTEPOALA 2 wELRLY5L.T*Tl- EYEESXNK X AYYXCVL3C ECI/.T2 1 tRNCGSL 

SS«VARRT?IErD^lRDrtDi XTNE3VI ITtXDDYVTRMPVsVntEORlsXNCVT 

p^mi AA f ^ I|Qff aaACiTri>77cw3vgtKvs ouc 

uTtfcnrrirv»<t.FTHiriXAVRfPHCXV*PWC^A^ 

Lrva£c™ttLvn>nOT 

XR UfC0Vir/>CRjr7T/R LVHUCECCALVSH EXLSSNDCCEVLS CS EEECSTTVSXJl 

B TPVIT I L E X3C C rvTVtrae:! P I tVH ERESAJCOCRr\»'SALrVVLTVliUOClCFCTI3SYXV 
S«WWEPXYICCTIWTOEIEPEAA1^«SC^SILW^ 

ivlSv^AtTxrrEPi^ 

SoSropTCiiwivErosAa^ocR^ 

iCTiiAALCcs:&*aaT^KiJtYrtiiiiiff^ 

VYIAQPPLYX VSAKXDf H YmOCE>CSYtLKlgTVtSS X LF KSTEXXXJtCEAiXSP IW 



EVWYXJWLCRICCX EX0mOJCII«aXWOTI9WE0^rHVSI^VTA« 
HCEEVPPRRrriESHALSIRINNLDI 

CPOJW76 3U140 310793 

CT191 hypotnstical protein 

IMruaudblQeSOVOMOTASPXKHAOT 
KaCSOAIGrWJHIXXVKVYNSSLYAXilCOTPONtt 

CPn_0277 312003 311404 

No robust hosttlog present in GenebeiU/EXBL as of 11/7/98 

NisrrYiwiKaaEvuiwx^purwixja xwn^ 

KXV»SIU SSTYAESXEEVAOOUXSTHSKSSTSVNXJiANRVK^^ 
EITALVLOVX IKLIKa^XMjCVOJCLCVCVVAI IGA1 AIXVWVmXAFCSQGBLI 
PIEVKTLXSPXJKPYPtVVYV 

CPn.0278 3UB84 312060 

•conserved outer sMBSbrane lipoprotein 
RxaMom^LLvgjrvx^CHTOAQrqciRxvASPTPHA n r rsrorT ^ ynrfl rxxjcxL 

pVDmiPNHXXX4)K0VDAKrTOK0ArLD^ 

SLERLKSQXJa-TXAlPVIJRTNAOR^T Hf 7 FTTCLXVCKCPANLMfTAKIJVaaCPIRJXNI 

LEVSAPLLVGSLPXMUAVIICNrAIAANI^POTSLCIXXa^VS^ 

P10C»XOKXJT3SPSVWFTT7rWfKCNILTKTQDtC 

CPTV.0279 313546 312875 

• possible ABC Transporter Pemeese Protein 

KKllCSOUOXLXJCETVNTLYWSTAm'SCAXCOC^I^ 

MILSFtTAI PFAILIVILFPITRWIVCTSLGPTAS XVPLTXCaiPrWTIWIStflWSAL 

NYIJSAVAl^XPKWIIJCII^ESYPQUFSIJCSLWHLISCSnJ^C^ 

OYCYyRrEWSVTTSVLVITLVX^ESVRIIiaJPIW 

CPTUJ380 314593 313550 

dppP* Dl peptide Transporter ATPese 
XKWWLVSEX3HSPIXSWOVSIOO£OHXLUICVSFSVYPCEVTC 
LX^IJ»<PTSGSISVACFIWLPTWSTUWFSnCVAYI 

KHSEKSKSEVEEOVYUrLNFLNLYHRMDAY PC^LSC<WOXVAX ARAJVCOPEVVLCDE I 
TSALOPKSTENX X ERLLQLNOERCITLVLVSH EI DWKK ICSHVLVMKOCAVEILCTTEE 
in«SENSITNELniEDINXAJa^SCYrAZX»EEVlJU^FSKXIAXCCI ISKVIQTGLVS 
XNXLSCNINLFUKSPHCFLX XVLECEUESIUOCAJCEXXXEXXWIiarY 

CPn_028X 315033 316103 

"dhnA-predicced 1,6-Fructose Biphosphate Aldolase Idenydrln 
faallyl • 

XSIJUlHTI*LNXKDXUaJDDENLL5Y0Cim 

U/mFSHCRIJVNSGYl^ILPVOOCIEHSACA^FAINPIYFDPE^ 

YGTX^XXSRKYAHKIPFKLiailHNEXXSYITICYHQXFrrOV^ 

ETSNEE mVSNAFAKARSUXATVLWCYXJWPAT/A^ 

ADIVKOXLPTCaXFKAINFC^CroEimSELSSWPIOLrRY^ 

SCJaffirrEAARTAVlNKfLAOOC^IlJCRK^ 

CPn.0282 316084 317529 

xasA/gadC -Amino Acid Transporter 
ItIWSLNTSKKVF>WSHSXPTKPl^FTV«LSWWISIJlNLPL^ 
AVICFKlPYALISAEL^FTPOCIYIWARDAlJCKWWCFFAIW>O^MNMTWYPAVtAPIA 
ST IVYK INPELAHNK\*YI ATVI LACFWI LTF FNFU3ITSSAUFSS ICVI ICTLIPCVILV 

NY PKAV F ICAX ATLT! L'.IaIS LS I AI VI PKEEI SLVSC L*/KTFTLFrDKYNLSWMTCIVV 
VMTXAaiLCEUMfcWFACTKCLFISTONOCIJ^FKKVTIS^^ 
LFLCLJJSACLVYWILTAI^VOMYLAWXCLrLACP I LRX KEPRAORLYSVPGKFLCICTM 
S X US X LSC AFAX.WV 5FLPPRELAQ I S ECSKI C YTTFLLLAFS LNCLI PFC IY FTKK RLS K 

KS 

CPn.0283 318581 317532 

no roeust nomoi^j present in CenebanK/£M8L as of 11/7/99 

CRRLSYFODL IKNAVAKI ISFRKSPPNPVKLLI KFAKKDUTNSS IAPLYEVtLEXLEAPC 
EEILEVlJ'SLDPtSrfLKSKLOPXKM^LCXEXSSETAETIESCSLCLISINUiiCLClJlS 
SH0R30AVK I IOOFCPCFSSEEVCNFVEORNI LTPFUlHLFECOEVALLNOLHLRLOLXV 
PNALY PEPPPJCHJ:? I NSEIVAK DAEDQQ EDFNKTK E//7KECLK K LVLPAX^C ITCIPOLL 
RARRFKQOAE I LMA r PRKKHKONrF X FLEALLE^'EEF" I .T/nKYUCUJ4PIIIUW)KLLIlA 
IYLCYP7PXIC0I *K I CTFORRANIMPEAFOAA rOOnHi^FLFrKMUP 

cr*nji2M4 »l«»0S4 um«,si 

Nn i^ur.f twmil.M pi. veil » in <ton>i»»jnfc/KMUL «•« I I/7/«»M 

KU#vi r r'Awrviivi\\T«ir:w:i^uc;x:u<pir/:.i la i i Arut vi.yfvwi i a 
v« 7Thvif*L i ru.voTvu VAYirwr; t rrrrvrr. tT::ri;vFF::PKni*juii weed:; 

VISA I OELl JCNFT'Al Jl^R rKHLftDtlFUDOJ *f«NFJ.> KEI)r:HT::K 1 1 . 

Nil n^icr homo I. m i,i i^<i»rf»mic/M«'l. .k: .»» ll///'»H 

KKT>TnJ'FFTANKFXfA:yiw.r^Kr^::F::i.;i'/riu:iJA[:nrtJa4*^KAlMVir/i. 
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AAPLCLLVW3CAA3VC3WA IV3LMCLYKOCKPi iNEEX IDPTKOLE IKOPESLXPV 
PVECOSLPKERICTVSFlCAK t PS tVETOFXPYVlOSTFYKONXVYSKP IAERNQSLEXEXT 
TLIVDf PRAt iFFTiXTSfi Sf ■ > nRfTV ISEIKXLFLWfLCT XVKY5L7ACXJIRXXB I VEXYA 
SSOLLILLLTKPEPLWVTQQLIAHXiiSUCmiUCLTPKICKLVLSI^^ 
I EKXVAYDPNLLTIJEIJWWZACN 

YVPAINSSTfMYOncmJVLIOCSIJERlAfCEXIPSPSSWmSVASH^ 

SNOOSV I LONPFLLXELLHEHPKCOTftKCliOCAH WSl*<AALfRPmJOILCSC3 IARX 

KELKX I AEM LTVPFXEnOA t ASCXILOLLLOKLFOr 

ragtE»Mo»» Transporter I CHS Domain I 

SC1«SXGXI>l\«EC*mNEnCLOTAI5SGHL^ 

DLSJCmEYWXOUYAVSCIJSESJ»ILTTCN«XTAXVArXI 

SEVUU,XEOfPTOEAVWVWXPt»HY]UlXt£UQSIO^^ 

FFAFUtETTVKDVSAC IRSNPG IDLTRLVFVLOFHC ELOCWTORSLI XNPPEHSLKOIM 
WXfHKVX^OATREEWLVDnfWAAl^VVD^^ 

MA CH EDVCYOTO ftM^rUWS^mJACLXSASVMAYrOKI spaualxxffipl 
XNCMSCMVCVOCSTXLVIlSMATCTrUFCRIUlETXFKEKS XCIXTCWLG XLCCLWYUC 
FLCXXJIIFSGGGXQLCa/TVAT<WLWLT^^ 
IKSHIXFPLIACCINFLFFN 

CPn.0287 324210 322089 

No robust homolog present in GenetoenX/EHBL at ot 11/7/98 
RRCM^gW 

TC 2 NFELYSOTO1WCF LSEWDNFRL X EQXilOOiPQLKNMJCWEHnilCAJCOEEVSLVAS 

yowafrSgSSvhgeli!^ 

CIAXliS.YLErPNIJiaBIOOU.ISAMRTriWSAl*VKALCV^ 
SCFOSCliWOLOATCaW0iaJDAYTSWIJa«SAWiW^^ 

RCIALKCIACATXDEAY1CEIFUESIJi<yQRAWXU^8IUXXX>EX>COSKytl<AEX40S^ 
HYCTAYTLLTKVDLTLSSSRVKL ILAAVLLCKGRLLQDTOPAEEARE XX*EPLVBYYUD)E 
NFLLLLCKVYXPLFVWNTOATCLCXlARTYl^tA 

MV IRSAOYCVR ITEAXWLNDPYLANLREI HAFREVVENQXCRLWLC30CTT2IXRN 

CPnJ>288 325785 324571 

CT288 hypothec ical protein 

XSmREPITFCFECWUCrifWHSCTNLTSTNESIJlPISPKA^ 

HRStm^VSVCXVNXYtWn^VRLTVtAL^ 

ILimLLTO*YILHRX£XXVIVISGVCIPPFSPJ*CW^ 

LD ISTLSADGSCIAAVYOCPPLLFlUtfTCPCI PCAHPFVALUUlXYNURFtVVPrYX IF 
W XY E H FFCKHLPEPDRnYTOVAMKSRSLAAnJCAJF^ 

LHcsvERWNn^xjuisvsiwa^siJWEtaacwa^ 

RCEIVSCAHPSIQLPERRCLCrrSGRYPHISVIPDSGNDSAXNFIV 

CPn_0289 325797 326996 

CT289 hypothec ical protein 
NTMRXMOOORSHYKiaiNUIII^XLVCLCLSSVOSPW 
VFOkLOTITSI0fffTm<VTUnCRXLYYTU.T^rA^ 

TKCNPXXnfLEVLSOTI^ENXFICPFIOGNVX SAA£XAVIXGTASXJI£EX£XHPVHQFTHS 
FFSinJOJUGCIJUIJ>IA«tfPSVILrXEXJaX3SNLTi™ 

ILLKIKKVS PUtVAXAMSPALVTAJTSK&SAATLPLTWFT AITDT EXNXNLSRPSFPLCS 

VINMCC»AFILITVIJl/ATSJraXISPI*51^OTriAT 

TCKNVPl^ILCLILPFYT^WIITSt^VWSDCCWSLAN 

CPn_0290 327027 328523 

Na -dependent Transporter 

RSAl,WMCKHASFSSRMFIFSWGIAVaACNIVRFPW 

I PLII I ELSIGXLTlOCAPICALIXTAGKXFAliAGGFITtV^ 

YAVSCXIKLGNWAXX^SHYOSSIPLiiAHLTSLGIAYLV^ 

Fl^IALLLRAVTIJCAVQGIKOI^SCOKSCFSWYia^ 

GFASKKTCWSNCALTAIClJNLVSLIJCr X IFSTCASLDILCTTOLOOGACASSXCXTFT 
YLPEXJTW-PCCIYLTTXFSSIFFWFSMAAl^SMXSMlJTJ^aTlJk^ 
TIXAFVIXjXPSAI^LTFTSKQI7TVVCVALIV1CLIFXYAAI<VYCFPKIJCX£VXNA 
RLNXAFUYI IKYl^IECIUJjCWFYECt^EZW^PISLYSLC^VU^SLCLIIL 
WKFNXQLYLRFSRYNHEXL 

CPn_0291 328658 329194 

inc$- Inclusion Membrane Procein B 

EKKMSAPIPTWEl^ITCUIVOYOOVSEIJWENKCDI EGLKTLT AALTADAG 1 0 PSAO 
EIYSLOTAAALILSASKK^PSCSTECSVTVDSPCKFKKVl^V^TIIALrAIAVLIA 
CIIAACOCFPLLLSAl^YTXGACVSLPIXACTSVALICl^FVANSUKPVITVRTTR 

CPn.Q292 329201 329836 

incC- Inclusion Membrane Procsln C 

VXNTXHSDFMTSPI PFOSSCOASFLAEOPQQLPSTSESQLVTOtXT^WKHTOALSETVlA 
QORORLPTASIILOVOCAPTC»A<^PFOPGPAODHH«PIPPPWPAOirrE:TTIRSELO 
UIRSTLC05TKGARTGVLVVTAXLMTXSLXAXIXX I LAV LCFT<T/LPCVALLMQCETNL I 
WAMVSCSI ICFIALrCTLCLILTNXNTPLPAS 

CPn.0293 329940 332723 

CT234 hypothec ical protein 

WSMQR VLRLLFNU<HGEEKRAFLFFLLGLVWG IGCYCTLS LAEGLF IEXLCS AELPX X Y 

LCSSLXLCVUSLILYNX^IOCNXSATALFLXPVSUXLO^FYLXLSSIFAIDPPRSPLrF 

YR1V IWSLTILSYTSFVCFVOOPTNLOOCKRHFCIFNAX IFLCOAIGSGI IASLVHTICI 

QCILXLrTAALVLTFPIVFYVSXSUSLSDDHDLf'IOTCHPPPL^ 

UTr/FLHQLUAXATEFNYLK XFEIOFASKEEPELVAH ICKCSLW! SLCNMCFALFAYSR I 

VKRLCVNM tLWU'Fl^L*LFVrPKTTLSXAVlV\KVVRECV™ 

NX I ntj IR I WESF t EP ICKLVWSt. ICFLSSQOYVFCL { ISL ( AT I LVCLVR.T/YAXAIL 

KNLT^OAtJOLTRSHCWI KnMTVKOKRQVELFLLAMLKH PSERHOTFAFOMLLNLASRSV 

LP"JLl^lMNKLSLPMvLXTlEMVK:^LWAXOFLTlXL^ 

IIDLUI IT1I I AEOLYDTVGORLLAA ILTVRROEAYGPYnOLAOKRUCELLNSOQPEOIVMC 
LT ILTLEXNPONFP I LLDFLNTKNEOIL tVTCKALirT.'.VRANUKPYCPELLKRLROCSHN 
DEA^ni jtt i:: I AU)I:*VKOULKTT^^ 

' .TIWPCR ( LAAKALCK ILttWLLXJCItAYK IVJWKAUKALFY^YIh^lY IOKKY PTYNLSLLA 
NTI /WIYYAKVNFMl^UU I U3SMEHSCVL IRALTSXNOK I KAOAl£SLEWI'.T3SKLFSL 
l.U*FVNtyl*^,Y:iEKYYPKCTVIPLTU<ELI/©WEH^ 

ynvwr i ynooiedkrteketl i skus i 

CPT* Jl'A'tA 111077 IHSII2 



cAMP-Oeoend*nc Proe* <iru*o ReouUrurv :»iourui 
IRNFFWUDRAFLUCrr: t>vSLDKDU.LT:AaiCTET: tFKPCCN\Tr: TCOP^FSFYI Z 

veer it t^xEiux^ u^ ^o c raga>»T^RWja^AmvM^vu y c^ ; ty vt 

BCP3VALSFLELTOUC3XXFHEP 2J ^ { ' • ' ' • * 

CP*U029* 33386* JJJ«7 

acpP-Acyl Ourier Protein 

AKStXDOVtAI r/ECLCVDPXEVNENSST IBBLr^St^LTr^XKTLEEJCFAFEIStlDA 

*••?*! ,**tv **yp? A "I 

CPn_0J-.»o Ji4-*V- jj4-.« 

CT296 hypothetical protein 

KI PIROl ICKDXTLVCJCKVIVTC3CSRCICLC XVXX/XEJ»3AMIlOJ4I»a0AVXISL 
TGXi2CTVSFARVDVSHNGCViaXVOXriJXHOT^ 

XSTmTStYYTCSSVXRmXKARSCSXXNVASXVAXICSASOlWAAAIUQXIA^ 

KEVAAW«RVrCl>PCTirrWT^VUJIWIJ^^ 

SSYWTAOTtVVDOGLTY 

CPn.0297 335724 334774 

CabD-Malonyl Acyl Carrier Tr ana cyclase 

stsHiasaNFHiamATiJVGazs 

CPEtaiWTVHSOLAIYLHSMAWKVl^ORSSIOPSLVSC^LCSYTALVASD 
l^VRJCimiiOrEAaiOSICAHAAXJjGLPSEW 

AEXVDOAI ELF1U3IjQCX3XAVRXJCVSCAF7frPLMQVAOIXUAPOX YAX£HXX3S 5ULVSKV 

VCKSLVmXEKREOJkROKISFTl>rrOSCY^XESEVt^ 

ITSLGTFAOIEKFLSEV 

CPIU0298 336742 335717 

CahH-Oxoacyl Carrier Procein Synchase XII 

YTSrFLYWWFSVNKMtKAAIWATCSYXPEXVl^NADLE10<VOT 

CPQEYTSLMSAI AAEXAI ANAGLSKDOIDCI I FSTAAPDY I FP SSCAiAQAHLC I EOVPT 

Ft)COAAClt^YCl^VAJCAYVCS7rYNHVLLZAADKLSSF^ 

XCXSJUCSX^NXLSlJBAfiSiaAXXX^AaC^CPASKETXASCXK^ 

VRJQITTAAXHSXAIACXOEECXlMn^PHOANERXXDALAXRrUDE^ 

ASSVCI ALDELVKTES IiaJLTTIXLVAroaa^W^VVLXOV 

CPn.0299 336726 337415 

recR-Reconbinsclon Protein 

RXKLVYTSESLYSlILNMPRPEClQOCIHITKrRYPDY 

FELX5UDS£0LKXia<AFWVASERSHCPU:FTlJ(ESXEACX^ 

KDVFTLERSKVTXC^YW^^U^PITX^XENER^ 

COATAtnJC0EX^SWISRlJuXlJICUIT)YVDSCTLAR^ 

CPiv_0300 337768 340152 

yaeT-Onp85 Analog 

CIUiac.IiqU^IiaXSIIALIOTPLTXrSTCICWaxanyvvK 
PKLKTRSCAI^OIOTECtJUXAKEYPS V EPAVgSECKTm XRNIHX6C 
NQWPEHKXUCnflXYJWDLFTOEJCPLsmDDXJ^^ 

D\aJKIKECPCCKIKOLTTSCXSRSEKSPIQEFIQTKQHi,l 1 ISWF lli ACLYHPOIVlOD 
SlWaT^YLKNNOYADAXVNSKYmWlOIXlXYIOXDRCSRYTtCKW 
EJCQSOWSPNI^YCPTOXVDCAHXXKOTYAICYaY IWTWVPVLF XPHATKP 1 > WT» EVSEU 
SPYKVCUXXTGtnWTKSXTVILKETSLrPCCTFNRlJCL^^ 
SQLDPI«3NAIX7rlOXrVEVKETTTCNra 

HC2noju2aci?a^xj»NrctucvTtrmj^ 

QTYO3MVSTTYXXlIEHLKYt»^l^0TSLHE^ 

VQSPRTFTroiROCV TFEVSC UXfrtHrf KL SLNSS IYRXLTRKG I LKIKQXAOFXKPYS 

ISAFVTUlSCrVCWEYXISUCOLWSACFCLPJDV^^ 
XOVSORFTFALaOKF 

CPn.0301 340163 340762 

(OnpH-Like Outer Menbrane Protein) 
IJC£U£K£IFVVFRJCGIWPFSXFXLVOVXM!CKIXF 

LEESOLCKXETEELXAKK00FVW4AEXI EEELTS IYNKLODEDYKESLSDSASETIJUCJCF 

EOl^GEYlttYOSOYTOSINOSNVXRIQKLIQEVXIAAESVW 

CTOJCTTE I IAI LKE5FKK0N 

CPn.0302 340766 341866 

lpxD-UOP Glucosaaine N-Acy transferase 

SKFXETSMSEAPVOrrLKOI-AEIXOVEVOCNIETPISCVEOISOAOPKH IAFXXNtXYSSF 
LXNTKAGAX X LSRSOAM0HAHLKXNFLITNES P SLTFOKC X ELF I EPVTSCFPC IHPTAV 
IHPTARIDO^IEPYWISOHAhlCSCTYICACSVIGAHSVljaANCLIHP^ 
ICNXVVV0PGAVLC5CCFGY XTNAFSNHKPLKH UTWIVCTDVXIGANTTIDRORFICWrV 
IHECTK IDKOVOVAHHVEICKHS X XVAQAC X ACSTX XCEHVX XCCOTGXTCMIS XAOHVI 
MI AQTCVTKS ITSPGI YGGAPAR PYQETHRLI AXIR^PKTEERL^KLXKCVRIX«STPSL 
AEXPSEI 

CPh.0303 342982 341921 

CT303 hypotneclcal protein 

REOKGLMHMOVSRK INRHTCFYVDS X DCV X KNPOHK PS ESKSROHEELEEKLLTXTKR XV 
ASAQEFONB KTD5K>Tnr LX KTOWLPF KWEEIX0TK ELF AMLTSMDKX IAOL FFYSPCCSS 
OWVerTEVICHLNSSXCUSCVLLCCCLFECOCEKVVT^ 
TYRNISU^COSKSEU3KEI£DVLXOKCVArrLIFKEI\^IOlXKY^ 
ARIYDNDVPTLPSVSSSPIALRYSLA>ITIRCLALHVDFSSLKFISPSILSNTEMTAXALli 
SOGECF I FSNLD ETNLGKK IVMOLLRTCK ISPE ILNKWl WCILMIKRRVRSLY I 

CPn.0 304 U»09t 304 

prlhA/fjdpA*Pyruvar.p OAhydroa^n,in*t Alnn.i 

OOKPLPKPLPYKKVMOr^APYN tA:XyrrEK.7TVF» I LO LYC PA:TC t K F LKQMVL X REFEA 
RCEEAY LECLV15 »FYI l!IYAl10EAVATAA I ANT' ILOFWF.I.'TY RH !AU\ I LIN X PLOCXAA 

EurxrvT au:ih»:»o<mi ^pnfrv ;r ; i v* nj i plamuaft i kyueoknrvslcfig 

jrJVVAf r/HIFTLNFV:;iJ»OLI-LMLl £ENW fW'^VtTftfXRAVAK^M' I AE30a~?YDXRAV 
1VN» ;Ffjl J- N: -l J ••FHKAYRYMVt/TRJU'VLVW.'lJt .':;hFRiai: I riDrW .YN;.*KF»epCLF1tK 
W* ( Vl/ir I ML I RLFVl.TFJ%EFyn I R9RTKTAVI .F>F;WAKUJr;nP;TVTTLEFJriVYA 

':Pn.OMi*. U4 142 i4M;7 

»*lhO/'*«it»l»'»Vrnv.ir« UunyJri^i'^t.i::» h«.r., 

M KESMI «F« IKTU: I RE*M Jl EA 1 1>F.EK'.>&I "NV'I I / '.FJF Wf TW| FMYKVTFf 3 J.fycW ;PKR V 
(UAPIUVAAK:* Wilf'SAAlM^UJ'PI I KKNi>«» TIIVAJ 4iy » t^ilAAXHI IW I ;KF.',VP ( 

v^Rnrtsy^AAov:u.v(i:mi , vFj:i.YA^iir«:i.i \ lA^stU'YrwK'iUJjj^iUNNm'vi.PUH 
KkEYMJ^f JEVITEK7I.VP i* :kajirvow ji .t i nv: ^nv;: i tkkai*::i akxrwx;; ! Et 
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I0U»TlKrt0I3T:L33Vl»HTSRCIvrE0CMYr» iEXIALITEHVFDSLOAPPLRVC 

oketpmpt.;* r leoatlpnwr iujt r ervmr 

CPnJ)30i» 3451)6 346431 

pdhC-DlhyrJroltpoanide Acetv it ranst trait 
QCFVXSUJtMWUPTMEWIVKWHiaSNDOVSfCCVI 

E 1 LRHECEK IV ICTP I A\H<STIAWCPW<LEEIXPlCTEt > SNLEASPKS5jCEVS PATTPOA 
ASATrTANH'FKrCPPl^PLVFIOiVCTnOCLoPtATOl^KEXNIWSSlOCSCPCCRIVK 

■r;.:rv.; : r •:.•»:■• rv: «vr «-:/ri.vr/iiFENf.r r r^rAAPr.^A/r^rrMrvvPOCVY 

VAIPOGHTPI I RCA1#JU4U*I^IKSLAUCAJU*^UJDTEYK^ 

ETTAXWPIWXWVCSVTBOALVLramK 

KXLEAPAVLLLN 

CPn_0307 " 348998 346515 

glgP-Clycogen Phospnorylase _ „ 

NCCI Vgtf SS FDCTnCVSVDSlOauaLDRfcYLSVVOSPtSASTO 



SDAGLQQGLGRLAACYt£5KATLAVPAYGYCXRY 
NPWEj^KCEYLWRn^ 

atiodi nwYTKmciiNXJUKvvvoi/nn^ 

VXFNYTNHTItJEALERWPLnjSKlJJtt^ 

'X VEECYQXft tNKAIUAVVOSAXVIOV5SF>iS0UIXX>TU RZP ¥11 1 PEW INVnCVTPR 
PWIAXJCMPRLSKLLNTTICORYI XBr^HLSURSFAEDSCmHlflCCVICXJOaOOLTSRI 
YNTCEIVDPNSLTOHIiaiXHrVXROLIOT^ 
AFGYVKAJCLIIKLIMSVADWNODSJ^ 

CMEASCTC8#0(F AI/CALTI77HX3GANIEMAZH IGKXMKFIFGLLESQ XTOUWEYCPQT 
ICDKOTKIROVlXLLEOCrTKSNDKIJLrKP rVHRIXHZCDPTTVlADLXSY XAAHBfVNX 

CPnJ>308 349213 349S96 

No robuic hooolog prtttnt in Geneoenx/EHBL as ot 11/7/96 

nrTOENKMATVAOTPOTTQroPSVSHlUl^ 

SCmSXACIVXAZOZVtJUZZALVXJtfMHXIQnCQAirrAa^^ 

ldwyssk 

CPn.0309 350977 34959S 

CT309 hypothetical protein 
n<RANEEFtXXOEmCmiTVI»MCJtSIJCVX£XVAC^ 

SCLVMJ>fl«PIHVHVrSVDKAAPrY1CP!X>OQ EJUA I< F I WYCSVWPEMTTSNTLVTPtM 

t&nvMjazrTKsmnxxvTrmitiFGnGsa 

FTEHLVSAIRSGEWKFWrTOtfDAIJriEDIEWSGra 
VSSSYAJTOLVAVlXIU^ISRFElCVAXPIHI^VOEGX^ 

NVAOTYCVS0SS ILGRSOSWEYWRCVAKYFCRO^^ 
UD3ME!3«HDIHMAIODI5KNLNSUOCSLEPrPStEMII 

CPn.0310 353472 351049 

60XM-60kSa xnntr Heasrane Protein 
mxLLSunwoHNiomxrvsuGXArux^ 

AVESVGLSVASWXmMCEEHOWAVRV^^ 

njNiHiALYiw^nnmmacvrLPTTmB^^ 

KXOTIFGTALVFTOGSI7YIPXi3LYDSB*X3XVS^^ 
YVLTNX»MOXXVSEESGSIEGINLFroT*^ 
LPCGOWXNSIGGYYPLLRRGIASIISJ^^ 
HSXOLESX^RSVt^CVYKX^SMPeZXX^rvrm 

TXXYRVXlQNQCSLOKVIXIVVICEPLAXfaCVYP^ IL7PL5EXA5GTS5 
LYXSGSTAPTfU^SPIQlOLYFVSlCYPTC 

P Rm qpg a wciftPSiSFwcvpAriwrjuy^ 



IXIQLPn^AKraJJCSSrU^GASFIPCWIDNLTAPDVU^W^ 

ii3ivKnjWKvrsiioaa3PVTTOOKOoow«3»ow 

VV00WITNKIlJ3SKHIJ(NEVVU«iaCHH 

CPIUOUI 354453 353575 

CT311 hypothetical protein 

t>qUfK\vrvVDKSKIVVISPTPWSLRLTVYCVyF r V GIPX«ACLSAJrci*ISYYGXJCOHLS 
FSKSOXJWALQiFFXYSILFXVFGARIJUfWFYGWSFYWPEEI lOXWHCCLSSKGGVL 
G FLLWAA IFSWI VTOCK ISKLTFLTLTDIjCCSVTC I AAFFIRIjGNFWNQE I VCT PTSLPWG 
VTODPMQCVXXT/PVHPVOLYECISYI^ I AC! SVAFI 

RPFAEYVKSHQCKVlAEDCUTICOILSIPIJXrGVALLI ICSLKARRHRSHI 

CPn_0312 354518 354976 

CT101 hypothetical protein 
CTHAWIKYI^ILFPGILWISAGMKUJJCATAI 

YLl5KTIRK0WLSSE1TSQKI7VlAYXK0TFISRRFLIMVIMIAFSLVlJlRYrSNPOA-L 
FV XRATVGYALIKTA IAYFS KLQNALMENPEGN 

CPfU>313 3549S7 355355 

acpS-Acyl -carritr Protein Synthase 

WKXLKEX5ANSMEXIHICTDIIEISRZREAXATHCNRLLNRrFTEACCKYa.EICT0PIPS 

FAGRFACKEAVAKALGrcxrawAWKOiEVFicvsOT^ 

C* EYAT ATA X ALA 

CFnJ>3l4 356285 355353 

r. rxfl-Thio redox in Reductase 

HIKSRLX I ICGnPSCYTAAIYASIUliHPUJECFF^ISOCOLKnTEVENFPCFPECI 
LCPKlJOe#4KEDAVTlFCTTCTLAOOIISVDF^VRPFILKSKEETYSCDAi:r t ATGASAKRL 
E I rX^fTOEFWUW^ACAVClXySPIFXNXDLW^ 

P.RM(U<A:PCAMEARA0NNEKITFLMN5E [VKLSOTni VR.TVD IICNVOTOE XTTREAACVF 
KA I CI IK WTD FUCtfLTLOESCY I VTEXnTSKTSV FTVFAACO^^OkTYRf^-TT^AGSCC 
lAALOAERFUR 

•ItiJMr. lSf.977 l*«H7hi 

it:l ::i i< il-n^il Prot.un 

Ml-KOACYT** ^:KKU£Ht BCLTEOV AEFXDLLYT AH P IT^XEfctflNE tOKW t LKLTW 
I • INKOFVWIAA ;UC:tFJ !V t PMlTEF r DSr.B ', LVLlIA EVFVY LDOA KPEEi I KWL5 R EK ATR 

•.*<xky I ijur'Kfj a: i via 73 rrRma a. 1 voaa«!AFLPi :rx>t unkk t kklloyvckvc 

KKK ( LK I tWKRRM I WtWHEtAEAPJI t.TKKAELIEOI * X( :GY Rft IWKN tTt>F>lVFLOLO 

1 ; i nr;u jc msnwot 1 km p.:f*vbuioelevi i L.r/MCFj<r;RVAu :lkckehhtweoi ek 



KYprcxRVLr* ivTLLPYc* : cec i ecl i h i :jemswvw rvprnr^NraivxAXy 

L3 10K0ECXX3^S«0TERNPWONlEEICYP tGLHVNAEI KKLTWCAFVELEPCIECUH 

VISCWTXrrATCATBnJ0ffiXCM5IB^S«XJOKP3RAlCICPX xsxznMMVOUMH 
KKVSLSVKEYXJ^I^YDCOSRTEXJFICOSOCPICEWaKCK 

CPH.03H )«7R4 Jfi0l21 

rmsA-N Utilisation Protein A 

. ....... •. ! ler:^ •.•^•-" , "«^p.^-. f ?wrTrr^^rr^Ar>•IAAlTr:r♦rcA.•::^.^ 

"i.-v-f.:r*-r-.>--Kr=::vKr '.tirvKtru fareyih »» vi- VY*tyvrr/rTOOT»:AAH 
aaku: i jgfcLRiuEfcLV irtb-i kiiia-nltu^vickkaku^li i ouac/EA I UTRA' ? 

KTEJOOC ICOK X YALLYEVQ ESEJCCA E\'I L5R SHAETVKOLr I3EVPELEES5VEXVKXA 
REAGYima>VR55I»KTOP^PKa0C5llVINXXRIlNOEXXOtm 
LYPXEXOXXAXLESCICVUXVVNCUroAWXCaatGXNARJJSHX:^^ 
LEIORLQLAE fMPWWPLBqSXSKLVIOIgJHACYDTIRRVT I »8AWDLIfVfOItL 
ELAYKXX4SVSXYCX5XV0EXPEXES 

CPO.0317 360045 362750 

uifB- Initiation Factor -2 
SIXIMLSKSA»#<EXVXLTJOnja^ 

AKDtSVKVAUkAT^TPTASAEQ AS PESTSRRI RAXNRSSFSSSEEESSAKXTVXTrSKPAP 

VSXAX)PEPEL£VWEVCOSSPEVHPVAEVI^EOPVUETPPOEICI1XPKP^ 

HXK WG PT CK HIWHX4JUCrriUa t AKEX«WACSl^ ILL I C I T 

FJQFNPAKRS PASGI*RIMGKIOILTOFWJWKXStJISXJCArrCROTYG 

RVYXZ^OWYDEASXORmlKXSX^XTVXSLAAEIOCXJCASE^ 

SETAVQFIG1JJTXTIDIX>YSEOOKU^NCTVTU)EIOSTO 

GXTTUQSLSUCSNVAATEAGAXTQHH^^ 

DIWI.WAGDEC I KE0TL£AI EHAXAAD I A IWAINKCOXPNFNS ETXYROLSEXMXXPE 

AWXSTVTVVrSAXTCTTTr S ST J .FXUU^AZVUIX^gPSARARGLVI ISELWCCLGPV^ 

TVMQMtaiJgjCSgAXVFWDrySICVKTHHWt^ 

VVKKlXTAWIIEARSAGOORTAL0OKKRPKFT>SMX^N^^ 

X$XXXSKXVDVEIl*TJISVCEXSESDIUJkAA5IU^ 

FTVXYHMIUXKPMrSLLDPXAmKMCSAEIim 

VRVXJWXEXXJfiam^SLIOWKEIlVX 

XL 

CPn.0318 363704 363126 

rot A-Rlbosone Binding Factor A 
VM5YttVKXX£XXKRfYNXJCYCKrTORAXXRVKAX^jQE 
RVSLSia3LHSARVYV5VKPHZ3maEZAXJAIJC^ 
IFSPCWIENLLMQXOEKEXS 

CPIV.0319 363133 363879 

truB-tRNA Pseudaurldine synthase 

TXTFQ4l^XXEtfTMQLAVEXJCXSXIXVDKP0G1(TSFSLIR^ 

FATOVMVHI^GBCTTIttiSDXI«LFEC<CXCYEAXAKIj C 1 1 1 UM IX1A>* WQROTUPSLEXV 

LSAAITFOCEIQO^PKFSAKKVOOKJXYEYARJQ^IERXHS^ 

PVVSCSianYXR^XAKELGTIfiJCCCAYLEOLPJUJt^^ 

LRQAHGNSL 

CPn_0320 363824 364783 

ribF-FAD synthaet 

TTPISIFXJTYEffl«WYSWSSFSVDSVI^3FFD0« 

rosHPcrm-suwKLiNrKrnu^LLOTF p 1 dwlcvlt fdu<fanosaee7t.tixwoc. 

KeXKULSYP SC XC J CE OQSNTEALPTIGKPLG I EVIKI PPYRMDNIWS8XAIB0FLEAC 
NX^CAHRFLGHPYAXSGXXTKSGXGGSLSFATINUREE5UPL^ACEIir^^ 
GVMOJGTAPTFGRESLYAEAH X FSFAENLYGXEV8X X PRXFXJUCX9CXF05KETZJBAXXX 
OILDAODWFAKGSFWZCTA 

CPnJ>321 365900 364767 

ychF-CTP Binding Protein 

YSIOCHVIXFXFIlCLIfSHTEGGXVGLPNVrcKSG^ 

VTC^RXJAI^XISWSOKIIYADKKTVIJIAGLVTCGASDGAGIGMIFLSH 

CFDOPOVTHVSCKVNFVTO IEVTKIXLI FS 0F55AKNX KSKX*EXlAKCXREVGALLPt«Ft> 

TX IAHLEKGLPLRTLELTPEO XVAUCPYPFLTKKPMFYI AWDX^LPOMM7tVAAVRE 

VAAMNSKVVPICVRIEarVSU>IEERLEFIJ<SI^IJ3CSCU<R^ 

TCPQES RAKTWRGSSAMEAAG EXKTPIQKGF IRAEVXTFECMIECQSR AAft R ELBKLH X 

EGRXTTWOOGCTNLFLKN 

CPn.0322 366231 367328 

yscU'YopS Translocation Protein u 
SNLCa^StCEICrEXATPXRLRllARXXGCVAX^OFPSA^FXVSKFTAFS 
FLVSKLSOAPTR HDPVTTLFYLWJCLKL I LTASU»UJCAVAVVCV1VCTUVCPTF»TTV 
FKPOIKKFWPII«IIC0KmiCTLIELIICSIWIFGAALILYrnJCfilC\^UIETAaVSPI 
XTAOXFKEXFYKAVTS IGX FFLI VA I LXXA^YQRJWFAJCELKMEKFE VKQEf KP1 EOM PEI 
KCRRRO IACE rAYEDSSSOVKHASTWSN PKD I AVA ICYMPEJC YKAPW1 IAM3INLRAKR 
ILDCAEKYC IP tKRNVPt>HOLLOEGKELICF X PESTYEA ICEILLY XTSUMONPIBIKMT 
NOPOHL 

CPn.0323 367322 369460 

lcrD- Low Calcium Response 0 
• SF IPWXU1IFVSRTL0C0TALNH INK3SDLILALWKHCWVLMI IIPLPPPIVDLPIITIKL 
SXSVFLLKVALYIPSALGLSVFPStXLITTTfFRtXjINXSSSRQItXJCAYAGHVXOAFGDF 
WGCNYWCF IXFLIITXIOF I VVTKGAERVA EVAA R FP. LDAMPCXOHA IOAXILRAGM ID 
ATOAROXRAO XOKESELYGAMOGAMKF IKGOV I AC IVI3LIN XVOOLTIGVAMHCBtDLAO 
AAMVYTLLS ICDGLVSO I PS LL I ALT AG I VTTR VSSDKWWLOCEISTOLVKEPRALLLA 
GAATX/jVCFFKCFPLMSFS I LAL I FVALC I LLLTKKS AAGK P^OGSGASTTVGAACOCAA 
TVCONPOOYSLTLPV'ILELCKOLSKL IQi IKTKSCOSFVDW I PKMROALYQOZGXRYPGX 
HVRT0.1PSLEGYOYM I LLNEVPVypOK X PPHHVLTNEVECNLSR YNLPF (TYKNAAGLPS 
'M^VtlEDAKAILEXAAIKYWrPLE*/! ILHLS'^FFIIK.'I.^QCFLGXOEVRSMXEFMXCRSFPDL 
VKEVTRL t FLOKLTE I FKRLVOEQ 1 3 1 ROUTT f LEwL^EWACTEXDTVLLTEYVRSSLKL 
V r ;JFX F.WOr.A 1 3VALLDPC lEEHIKJA I Yfj'f.'.N LALO PaTVNL I LKSKMT ITPT 
PA(rOPFVLLTAlOVRRYVRKLrETEFPOIAVr:;YOEri.PEinrOPLCRIOIF 

rMiJM24 1*'i44R ItOHHH 

»!Ti34 hnx3ttwric.il protein 

WVAKPPI rKAASCXTrcGLCCTa^/I JLAAV KA/k/»AK A D AAE , /VAHO ETi; EMNM tOO-JOOLT 
Nl 'AAATKTK KK EEKFCTLE.IRir r.TEA/aCA Kr Y \ :i Z TTFJir. fOT01AORYA:m'XUO0EL 
l& U>UA I i OA1A.1PEO I LAI .Vy EF. I KH'Maj: TtA\ J)V LWTrP PI W JKLK EAL IOARJRWT 
W^WTA HUKH I U-'.V^EYADOUIVr.-l-S :i .p-J .7 LC/n3TTimWU^WU>0RVTY0O 
HA tVr;. , :FIXKfJMATELKHa'.PT/T'::A/>l *jVI WirrWJI/^AVLTI WJYKEilJIVPI lloclk 



87 



WO0027994 [ fi(e:/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0027994.c Dcl 



Page SO of 330 



WO 00/27994 



PCT/US99/26923 



arc cyrr.ajLMFvKVAcrrMK : ir jokpptasxv. anl iGDSHosmvutirrsAL* 

CPn.0325 J70*M J7U49 

CT325 nypntnetical protein . 
ttUAJ^QWEC-IXSUPLUHTLAPOWNSCLI^ 

gtlpenvprer: FKAALJVNCz:nssirciurrcrm»LYLro:w^^ 

KLFSU4AK IWMESUmiLPOUrAi: IWA 

oaiQ-Clucjnotraiwterese 

PSCrOfUJlRVrMjrnTHSPSAHAWCLICTSPKHC IYLPLTS DfTXNSCC ICETLDLI P 
LIS^KOCTWIQUJIinmmrrSPYWSISSVAt^PLflWSLPNIPr 

WKELCSTPsvrrTOVxwinaFUtrTfWCC^ 

A t KMffHCEP DWKPK5LTWENFP0LT10tFKDEVLrF^YtOn<CYWI^CVXAYAD0HH 

VOIJCroUXLXSI03SCIWWFRrnTS3SttS^^ 

WWOBUJIYAONPYSVYWJHXICFFWJ^^ 

ASSOPIGEI^XXFODVrmTKLGXCCTIlIPRWEJWWESX^ 

TWDSPrrAOWWUfSPKCAKOrAXrUt LPy UM LI IMQ XDXIJCLSH8SASXPHXKLPWPY 

UUTPC^VSXia^RfiRIKnOTISlOCNWSYR^ 

CPn_0327 J72927 373311 

rl29-L28 Ribotomal Protein 
RIHR10MIWC3»LT^P*IUtfSYTLI^^ 

es«m.iajusA5AUuixoufitccvuiuu^ 

CPn.0329 373220 374992 

CT085 hypothetical protein 

LKYREIFMSrXJUmSLrRSOKOLIDVTAIVSPNLE 
SnWNIjnGTKHPA/IXJLreQAPnfc 
AKFWtFPPVSMSSVNIIWLPlXTSWEIXSCro 
OTHQUIFOIOKOC3GHHLYEAiniO50NIJV5VPX*SG^ 



VVKBWVRESLWALRIfctaXHl^TK/IJ^^ I 
f SETANX7T L£ K T GfS LNXSSKG IPHSXCKWIU3UWYOOaaHCVOMAPrC»»VLE 
TSXJn>RCIKSI4JWPX)LICSWPLXILAIH^^ 
ATHWWNFPmDWJOCPSYPKSVEVDPSTKQKVSEW^ 

CPn.0329 375095 376146 

Phopho lipase D Super tend ly (leader 133) peptide) 

wMcroianajcxcvxisriaLVGirARAM^ 
AiEKADEiariJiiYNLSEmooSLTROAOAranrvrriYYOKr^ 

PA<JIUaj«OXAX^IDXiaMWLBSANYTt^^ 

KOOT3KYrVLP0DPJUAIOAVLEia0TA0rr IOVAMPALTHSEI XOALK0AJC0RSIKVDX 
I IDRSHSKLTTT^LROUnNKDFVSIWrAPCTIJWICFAVlCMCTliACS DJWSKERPSLN 



CPn_0330 376930 376202 

CT083 hypothetical protein 
FIS IEKiI^RQU r SVU>SPJ^DU<VY^ 

PJCX*VPJO«aomRIPRVl3RrDWRXIRn)PYCM^AgSIATO 

ATWEAIRIjaJAI UUKIOtiJUL OKHRYKLIiSQI^^ 

OODOmOOJUlSIICVTlCKiatCISLOAAASOAlAAAA 

EEA 

CPn_033l 378452 376701 

CT082 hypothetical protein 

XORXXIIAVSC&XroPSSDPGXHNPALQGIOA^ 

SGCTCCTATESQXNKAKYIUCAODP^STSPKSX^^ 

KPASXttGCGTSIXPTEXDVAUOCamiS^ 

FSOUWLSI^SESSSVASPGSPOKAIEPIiSEZJCVNAWr^^ 

LVRIUWATCNB»IDLS0XI3EEVSTAIffSPW 

WVEJtZAEKOESTOUEDQMNIJklUMAGL^ 

PTQR5CDXSKKXSPGIEIC5TNHTNF^PUt£CnVXSA£^ 

PEAWKESTAFT<NPENSSONFTJ>1AVESVFPKESCTO^^ 

CPn_0332 378676 379536 

CKLTR T2 Protein 

YLDSRIRVI PLARORCTLLHLLAVLCPPIS 1- riVJV SPCVFFCFLDF 

CPn_0333 379117 378800 

ltu8 

\mrrVPVmCKPKKSRTORAIJWEI0ICKSTCVUCKPARXI» 
AOeyDOL'/RSLLOSQKKOTDXVLirNYDCPVrmXOHTSKYSXRL 

CPn_Ol34 379309 379823 

CT079 -Lmilarity 

T*SVHITPRKCFIU:iLSMFTLPTtJPKAHLILJ^ 

.T DLALG2RCVFLLLYPLTAL XTKKAH LX FSKESXAAL.VX VNM I PYGVPliLT X PHCALPC 
HCVRWSXOVLMXPLKC^UMLXFTSVXYIt^AXNSCXHKMISFFRRLVCY 

'TPnjm* V79»08 380674 

talD-Mtcnylene Tec rahydro to Lite Dehydrogenase 

EICMLLP'Jl PAAEKXLOMJCEE ISOSPTSPCLA\rw0.rCMDPASEVYVGMKVKKATEICI t 

.~KAHKLPCDSTL5CVLKL(CRI>C0PS[HGlLV0L['LPKHLOSEVCL0A tSPDKD^DCLH 

PVNMCK LLLCNFDCLLTCTPAC I IELLNYYEX PLRGPHAAI VCRSN tVCKPLAAUtCXH 

I'JTNrr/rVUt^SENLTErLKTAOt I r AAXGA PLF I KETMVAPHAV I VDVTTTTRV PAON 

AKnYTLl/iT)VnFNNV\iTK(;i\A ITrVPQCVRPKIVAMlXOTVfRfTYONFS 

"PllJItH. IH05i"> 
/'Ml* 

W TUKMJ' : 3 fl i : !: "WH RW: H H J :RY AJ{t^YMAKLPKFFL7LU:LC t < -XrJOKTTT I EUEOKr I 

kym i w/,t;u yuaoArUw-oo t oh:pmk id:; t ynnhwpyselt. i i nrapadvp itujvel 

::KFI4Jf//l/ri.YKU:OOTI^ITWP!jnXWl^lLK*5m.PW 

f mcTL i rrmiivo iou\awTYA\^u«icwF^iwYVEWcJiJF. rrrJauipacRPw 

I RIKAA* rr 1 1 J j 1 14 *W l ATT> Wl U)KMWf7fK rYTII tLCmmWI-LELSSYP ioiwswh 

-ay aim i Atvt Krmnc i fak^aeoui i lty rtix/vra 



^EnrpSoCKRiS !5iiiorU.WRL3P tieSCDIJIAOKEXVSIAKAIAIMpVnTL 

UimT R M t^QKCMTL I P LO<FT^RCYVXVRLJCCRGJCXAYPXRRTI i Duxmv 
AAAJKRRMH 

CPn.0339 "2272 381375 

»mrr. : .vNr;:».:KK: . -.-w.Trrr.-.r:r.-:.:rr.w !::.\-j«riTi'i.r.w vr* 

AKVYEXUAIJ X kJKR>'^ LVFX^TtANU; ^WtKW lTJw^*KU^KEXCDrPtC 
P DIQNALRJSUPAX3UCT«LCRTSFAVSREESRYVLTCV'^ 

IDAXVTtScSFSGEYX :PIKAVEEXX10«SDCGEAAXPWOTXAVtCt*miJTKXXSG 
trPOFSPV I STXSKVXLDLH R£IXITUJ«^Al^TNESSHSWSTtPCE3LTLTANCTTCV 

OT icvsi<AWYSGixi£iAFTn , rT7^ 



CPn.0339 313405 384034 

CT339 hypothetical protein 

VTTLPKPMKXCSXJCXJO^FItNHSOLeXSLAPXlJnrAOCX^^ 

OTITPGSSHPPl*TOFn©HLPCAXJXYTOKW10KXC^ 

IO»LUSaAPAPPJU^lXXU0CCNHYTLCl^YYHP^ 

WSWAPTYPSWJrSVVPJirOIYITOfFCLTr 

CPnj0340 383842 384156 

t(ra»e»shift with 0339) 

PLYTUXVUSRSSAEICSXJQCOAIILKRCLHDEOLVKKTTYI. S IORPLCSOXLSDUXEL 
WS»OJCEOLAIJ^mSLIKKSDISrrAVA£En<K0l^ISU>RDLr 

CPR.0341 384160 384495 

Ureae-ehitt with 0340) 

GSTSVGPHraniJicioKPvsorssiDoiwsx^ 

HAOJ»»VOOLLDPAPtT£QTLITS7W«3 X 

CFTU0342 384619 385062 

predicted OKP (leader (19) peptide) 

VALAKPCCPEXAXXEia«PSWLXitrOGXJl 

CPn_0343 384999 385595 

(fraae-ehiCt with 03427) 

U>M^JOUCAXXJtAPntAGSTIJWnrRC^ 

KKEnJCEDnTPXIEHVPEAAIJnTV^EOXOJCNSIUJ^ 
PKXKSLNQKKEXNPEXLEX 

CPn_0344 387432 38S558 

yaeL*Me cal loprot ease 

S5RYMTX xmLAAXAX^XLVLXHCXiCKLVVAICAVQlAVESrSXGraPALPX^ 

RicciprocrrwiRo aJffiujaiu^iL iDsvY^ 

AVLArSIX>Y^COCP^IQVySDCSlCVVCWNWI ^ OA n^ 

SLXJJM/CXIKRPCYLTVPSKErAIDVU UPTRfUV PUSGASYXXYSNQVPLTIXSIICt 
NStXAPKDRFVWKDOniJ^MAOISOXtllESYAFVXVAP^ 

YUWlLIDTOY^AfXICtaCWSSLYTLPYVIKSYCYI BSILTAX DPESPLPOPOZXLQLBDR 

ILAXXCTPVSGSVOXLRLVONHRVSI XVOQNSPOEX^EWSPJXAOCVXASYllSIOtLOX 

UWLOESKPVEVWPYWADPVOPRPWIDVreSES^^ 

AEWPSlJCISUaJIJCVRYNPSPVVMI^TKESLITL^ 

UTlGHSVGr5CVXJWXGU9eaAVLNLl.PIPVXiX^XLL 

LVPPTPXLX X PFX PLTFODLPRPPC 

CPn.0345 388587 387436 

CT34S hypothetical protein 

UCVACUOaAVLC^TGSICROTlXIV^ 

AVYNEEVyNEACQVPIOtOm^OECtTOirXKtmrnvVAASSCXD 
AXJUANIU^XLVCACELVSICrAKZZCIKVLPIDSEMlALYOCLES 

^HPO5UHCMVn7X£CWXSXieiPPEt&rpX0YALTAPCRFASPlUXMDPSia(^^ 

PVDEERFPSIRtJ«3C^XK0CSSCSriTUANEVLVWFljCre 

VYACHSLEOILEVDSEARALAOEX 

CPn-0346 389690 388704 

070-croD/ytgD- Integral Membrane Protein 
KXGSLHAljGPSPYYWSFFOFrSVrPSRiJ-SWL 

TTLVLRW4AKY ANAVSMT\XFCLVCVCLTTHOLTTLS LC7TLT LAAHATAKLTX;FI.XYTI n 
1/rrin/SEeSSTALVFSlXFSLSLVIXVntTI^ 

ANAVTTIFAFRSLVCSSFDSVFASSLCIPIRLVUYLIITOL5ACLVGAFKAVCVIXUAF 
LIIPSLIAXVIAK5IRSLKAWSLVFSICTAFl>PASSBAIl^AYDLC^^SGISWrLTM 
K/XWKFXSYPRCYPSKKrCXXSEKSSOY 

CPn_0347 3*1079 399679 

049-t roc /ytgC* Integral Mem b rane Protein 
TPTrNPEALSRKTXWXVLXKLSCVFSETI Ft^SFLAVTLICKTTALWCTILLISKDPIJLS 
ESL^HAr/PCU-VGALMAOVFSLOASlFWIVT^OCAASVTCY^I IVFtdCVOCLWDSA 
LC FVLWPF A ICV I LASYVK ES S PTLYNR I NA YLYCOAATLC FLEAT1AA I VTCASLFAL 
Uia^ROIVVTrPCXDFA\nCCLKTVLY£ALSLIFISLVXVSCV1(SVGIVLISAMPVAPSL 
CAROLS DRI^TtLILSAFFJuISCALCoYISVAFTCRAI ICOOAVPVTLPTCPLWICAO 
LIW«SLCLXJ3I»Ki»WtRnT>RKHPSPSKOOEHLUCVF^ 
Qr/FGPKPFP^\A3ILBV1^/KKEODYYRLTKrc 
Sr. E5VHEIAEEI EHVLTEELDIITLTE I LNDPCYDWIP9 1 IPNXKKEV 

CPn_034H 3-»l%l5 >9lfW7 

0ftfl-croB/ytu8-ADC cranapnrr.wr ATP^se 
frWIiNWDETFW3VHNLC\WEHAAVl.YII IttrcLCHTCLTA r I /',PtJT-V.lf.7TLLXASLC 
L I FPCSCr/"/PF»IOKrKK\TOP I AYMPORA.*r/l>mPPKrVI J)LAl J* X •YSYKlWkCP. I3S 
DDfrREAFIl I LfiRVULE^VATRO I^OUV V WOI'AFLAI«ALMOKADLYIXDELFj :A [ CMA5 

FrrsvnvLOEtROocacT i^-vmmdl^hvrolfmjwllnkrl icrcrToer : t j*xt ipot 

•r7TE I ELLEQTLKLSRTSXCPi'C'.' 

•:Pnjii4v ^&$nft r :-n v.n 

(««*V-tr«iA/yroA*:»lut** l*rfir*iifi hir^lirff Kimily 
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WIL»IA:WEMUAIO<rftrKVMRWrr:FVACC:TJ .iSCFONANSRPCtLSMWWXHDC 
VERVVHOTUVTAVLIICGJLDPIIAYDr/KGDXDKI 

PNSVKLCERL I ARCAFVPLEEDC tCOPH IWMDLSIWKEAVIErTEVLXEICFPIMSAfTKA 
M5EELVCEMS IU)3WAK0CtJTtPOlU«YLVSCHNfcrSrmRYI>TPECVASW*mS« 
( 5 PECL5PEA0 1 SVRD IMAWOYINEHDVSVVF PEXnT/YQOAXJCIC IVSS LKKSH LVRXAO 
KPLYSDNVDDNYTTrPKtfNVCLrTECUZ^ALCCOR 



j<n«94 



— • - ; .<v ; im p::k : {■>•;, ;»yA' '■ ?•>' •:tEtLrRYT.r*" , ;T?r/*Mio'r: jckmk.mVm 

a'LUMUCTriLESCITRTlNPKPhAPRUSK^^ 
ARFSPIOCPLTSLKR^XASXRNGXVSVEI>aWYVEAVKAm 

CPn.0351 193861 395432 

adt -ADP/ATP T rami ocas e 

KXKVrORV NfflMl^ PFClOJtfFWIHraEXja^^ 

rvCAPCSCAEAIPriKFWLWPCAI IFMLIYAJU^ILSKOAXXfAVCTPTLXFTALm 

vtYPutovufFTsrADRiOAXLPPCixcLVMUWinrrAArr^^ 

ANErTKIKEAJCRPYALFG IGANISLiASGJ^IVWASIttiRASVSEGVIJPMBISLRlLKAMT 
IV^LVU<ASYVWW(NVLTOPRrYNPEllMOKOIOCGAKPKMW 
LLVIAYGXCI^XEVIVKSOLKLQYPNMOTYSEFMGNFSFVm^/^^ 
PGWLTGALVTPVMVIATCXVFTALVXFRNOASCLVAMFT^^ 

KYALFDSTK EMAYI PLIEEOKVRGXAAIDWAARJTHCSOGAWGCCLLVXCGS XGAMTPY 
LAVXLLFIIAIWLVSATXl^KIJTAOSAIJCEOEVAOEOSAPASS 

CPnJ)352 395478 396630 

Mo robust honolog present in Genebank/EMBL as of U/7/9B 
WVCI FT INSHFTNSYAFFNQKV X XTVRHSGCTMKCS PtTLVPHXIWDCECHRSCSUCX 
RTIARl,IU^VUU.VSAI^FVPTJUtfISYAICCT^ 

PNEXJOKI XYNRYPKEVFYFVKTHSLTVNELia F XNCWKSCTDLPPNtHKKAEArCIDILK 

SIDLTLFPEFKIIXONCPLYWI^HFIOXTESVACT^ 

IFHSYTI^TLISESQYXFLYSXASWOTOPSVXICTC 

0FtJOTSHGITMEC*C*I0LINPONlfla4LCOFt»U«ra 

SNYEXTOJFMTWKEXJCVLLEKVKESPKHFASALV^ 

v^SSUWAFKAOTYKLEKXIESSLPIRSSL 

CPn_0353 396893 397135 

No robust honolog present in Genebank/EMBL as of 11/7/96 
U^rRNXXXSLirXK^IRYSOSCKZX^KCMlPFinaCSITSSLVTIXIXAXPVD'rSSX XQNN 
PNXNFXNXNISINRIFVKFTI 

CPn_0354 397062 396507 

No robusc honolog present in Genebank/EHBL as of 11/7/99 

miSXKXXJCXKTFlXXGFlI/ajlYNTOXDEPRXQfSNXTCPVI(W 

T T MnnetTf ^.it Ttrrn/titn/cvTt T/rj/^f r TffyrTrtTirirfTPWTgfilcg 

OVFJGXI//NRIRAHYFKFVSDFVSEAXPNXXD^^ 

QKIDTPECIARIJWEVRTASLKRXXSAASSRJLFPSLFKIWOT 

VELHRVKXIGGSr FTTH .WXKPEKLPTYWLX PLPWTNSS ILNLMTLVLARVLTRX7VP 

0HLKYAAI1IGEMNI*MSI>LNTMRGWAXYHAAY0S^ XFKH 

RYSMKOMSLXKTVPADLWENLCCLTUWraR^ 

LTLLSWrRrXPJOSTNXAMTLENIATHNSTfRSXJP ITVHPtJO*SVrSO?FFT)ESffT,L 

xc 

CPnjMSS 399955 398591 

No robust homo log present in Genebank/EMBL as of 11/7/96 

XRDrn^XXYTAFNRSISKCLAMSKTlVPHAIJTQWC&CHSTFPLSSRTXVRIAIASIJ'C 

ICAIAAljB CL APPVSYIVSSVLArtArVILSLvXLALIFGEIQ^ 

EAYGX^ISArVRECCVriAEFRQFSTALLCNISPEmKGU'^ 

LE10*WIFEDLLSCTCPLYWL0KFISAGD«>\^^ 

X PCKETHH X LQQLTXEDVLLLXNXALQEXWITrOEVXAXVCR IYTrYTARGTLICrEAGGX.T 
KmSKEIAIASLKCYSroLOLITjOLPRIlAWDWLCFTO 
LOESS IDFTJVM^YVIODIJCEAVOAFSASDEPiaCEU^^ 
HR IALEHGNARAR VYUV-KFVTGAR I HRKTS I FFXD 

CPn_03S6 400465 400109 

no robust homo log present in Genebank/EMBL as of 11/7/98 



ELIEMQLMKPTIWGVAKDUNVFErairo 
CPn.0357 401341 400469 

No robust honolog present in Genebank/EMBL as of 11/7/98 

ysswcasmvniopvyrntovnysoatqfsvcopalsli x vswaavlaivalvcsosll 
jielctalvlvslilfasamfmiyxmroepkellipkkimelioehypsiwdfrrdqev 
s i yeihhlis ilnktnvfdkapvyloexllofg i exfkdvhpsklpnfeei llohcplhw 
U3Rlvy*wsdvtpctycyywccpu:lyenapslftrrslllij^^ 
ntwssselvo i ronlftryy adkeevdeaelnadyeofds llh l2 fskkls 

CPn_0358 401757 401578 

Mo robust bono log present in Genebank/EMBL as of 11/7/98 
ED/LSVSKKLXPTC^SXERETDSKROXXIFTXYXCSSKVIACHFFSKLOKKNXXKESICV 

CPn_0350 401994 403617 

LepA-GTPase 

ITLQY I LKEYKI EN IRNFS I IAHI0NGK5T IADRLLEJTSTVEERD1AEOLLOSHDLOIE 
RG miCAHPVTMTYLYECEVYOLNLIDTPCHVDFSYEVSRSLSAOT 
0SLANV/U\LEROLEX IPVLNK IDLPAA0PVRIACCIEOY IGLOTTNXIACSAICTGCGIP 
A t LKAI X DLVPPPKAPAETELKALVFDSHYOPYVCIMVYVR I ISGELKXGOR XTTMAAKC 
n.^Fr/l/5IGAFLPKATFIECSWrC0VGFFIANLXKVKOWICDTVTKTKHPACT 
KElNPV^AGIYPtWGDFCrTLXDAl^WLNDSALTXEOESSHSlCFCFRCGFUXLHL 
KIIFEP. 1 1 REFDLDII ATAP.WI\TvVVLKNGXVLDIDNPSCYPDPAI IEHVEEPWVHVNX 
lTli)C(Lrji lKNUJl^KRClC.VKTEMI^OHRLVl^YELPl>JEIVSDFNDKLK3NnXGYGS 
KDYRl/il/YRKCS t IKLCVL INEEP IDAFSCLVHR0KAE5RGRS1CEKLVOV I PQQLFKXP 
tOAAtriKKVIARETtRAI^KNVTAt^YGGOtTRKRF.LWEKOXKCKKRNKEFCXVSXPOTA 

-lnJM'.'l 4ll*i 4U»'iJJ 

'Tuiii tr/r«n.r»<r u;.tl jiror.»io 

vac um f »». ti jlavmchcnlvixh iokcfsv! jvynrtpektrdfuceypnhrelvgfesle 

Irf-VMrXFUIKK IKLMtU* JKIVO^JIHALLPFIXf^iOVl ILGflNrlYFKOSERPCKELQEK 

. :i m$tn uix'.EnamtiK rHpmNPFjwrLVAP rrw i AAiowRPccr»vvrTroRAG 

I f Y7F AVI It Kit RYf IP tOI • 1 1 .'KAY' • 1 I JlDFLKiwATAVAT t LKEMNTLEXXS7L 1 RIACEVL 



AIJCDPECIPVTSTILCVVX JRWTAXDALKS7- PLJLI Z TAVS^R FU ~IVJCE I REC A 
^5^PLIFEKPraP3^ IGDVFHALVAS1C I ^WF^LL^E^A^D^XA 

rPClVUSArTTYiyrmASSSMS I JM»RX ff FGAKfffE^ 
K 

CPn.0361 «*«0 405382 

tyrS-tyrosyi -.pja Synthetase 

TLVKNADWUJE:^^.J LUwZ jwikku^ml'. kit ; K^kVHJSCUXJYTtr^'LIltitt: 
FYWlJTNYCT:UCXSDC«^TSCIOFrPJUCGLG^ 

TVWLDSDLTSPF EL'/CTULRLPDCT I PKIARTLTLLSNEEIOniDRRVQTtJPVAVlCEFVA 

LJXWUOCSKSEIRJU.XEQ1CCVYXNNWXAKEM5VCEEOOX^^ 
YLN 

CPTU0362 <07B43 407055 

fliA/rosD-Signa-28/MhlG Fasily 
UjjoacrVKTQQTONXXEVimFYVETOSXEY^ 

OLYASGVEGLVRAVERYKPERSRRFEGYAVTL I XAA 1 1 DDUUCCOWVPRSVX QKAMCUG 

j^uiostcitgrrDLiiriYx^isoQELsawrvsA^ 

OTiroEWk£3CrT7VVlWEFSLCXW^IOEl£ 
ESRVSQIHSKALIJCLRAALSAFK 

CPn.0363 409700 407943 

flhA-Flagellar Secretion Protein 
tAVTVSGXiaXXVRGMXFVPLSXLVLXFLPX^XUOFGLCXSFALSU 

;jJ CLrrr nnn^T a T nT * ffAgTPWTV<?C:r:TA ' tgT TVCT/TCrrCT/?gTJJXATTArTJJJ 

F\^NFIKVSRXXSX9XAEVRSRFFXXAXJ^AIG0MALX)SDLVSGR 

FSAMECVTRFVKGDAI ISC ILIXVNVVSVTCLYYTSGYALEC**'FTVlfitiXLVSOVTAI-L 

TSCAMTUSXXCXEESXXNrXJTYYlOOLJtXOT 

LWLAYWCE£PA5CDSCXEMFSYVIX»CI>XE{^SC^^^ 

tMEXRIWXJtVFaONVYXJJEifrPEAVlJrW 

XVXWaSUSLVVI^RXXVRfltVSXJCLITKXLXAVAVYQNSGO^ 

GRStJWDQKOTI^ylTXDFHVBaJMSSYSR^MPVMOW 

SCEOTEKJOQOI>WPCU,Vl<SHD£L* 

CPn_0364 409954 410236 

eer4-Ferredexln XV 

KEMSMAiaVXTSODX^ErELEtMSEXAEPCESMCXPFAC I tG V C GIC VXE VLECMIILS 
EfTEPEYDFTXWPEWEIUJiCOCIlIlCGGCVKVTF 

CPn.0365 410498 411544 

No robust honolog present in Genebank/EMBL as of 11/7/98 

PKOWJVNSLIMATISPiaTVWPLVOTiaaiS^ 

IGXXLHIIVIYFLTCISFIAWLSKnLYiaUTTtiW>RA^ 

ISIAlMRSKDWEMOPlOaONLPAPSALXVritlPYXIMCAia^ 

SASSMLCXrXXXEZX^QMAFXSSYVimTSPKSLLNEAIOETRVEIN^ 

WOPt3raGR\nTJ^IPTTPEAIYQYYYALY\mfIOTAINTVroi XO Z PLYSLRIXLYSREL 

PPG^RMCC^UWrrAVXYMAEX^PEYPLTXACVI^lJ^LPOES 

CPn.0366 411976 412440 

No robust honolog present in Genebank/EMBL as of 11/7/96 

. CL IVl*SLXAI R PALQFTLETGH PAA IAVLAVSGT ILLVAVI I LFCFLAAVPF AAXKTTTY 
VKTVDOYASWHSKOOTPtlirriFSGIVYAESOAOL 

CPn.0367 413076 413836 

No robust honolog present in Genebank/EMBL as of 11/7/98 . 

SFPLNRYFMTKT^IPDVHENOSKLSVDERilSESPVLTKKEVIAKIIKLT 

VCTTAWAGVlfiMPLMAIATGAALLAAVVLSCIiJJ^^ 

VOPSVPLITYOiaaiWEWrLVNTLSKNISWTt^PNORYY^ 

PRUCrSGRVMIWAANSNMOSGGASTNAAI^AATHFTC^^ 

RSAPWINRDWTNX 

CPCU0368 413766 414107 

No robust honolog present in Genebank/EMBL as of 11/7/98 
TLAIOWLWVNAACMPCSIETCRIinmiPGEA^ 

YUCTOEALNN0A7W0VPL I SSS I YSPCCKLELEPVNOTKPNSSAYKLYHIRT 

CPn.0369 414345 415562 

CT058 hypothetical protein.2 

NXMrDSNPLPSYTCASLYRTPAKHf^PXRLPLNRTORXEKtLKlVTXT^JklACALCFSXA 
ACXLAMPXFSAV\VrTLAXAAVSLYSLlJCKPKLYEXLPOXEPESEOSSLSP5raPP6XX}0 
LPLQIDPLPDPESL?CVSIJU>LTTPPEELTAIT\TPCrr 

TETPOOPCF IWXLKDSKLI F X STSCD XAVPR X KT0GRVM X VNAANEN X5REOGGTMKAL5 
U^TSLOCMNASRLPRAHSRSGSOLOPGECRSAKWEMSDHTSNDKVPGKAHF 
AKCNNDPKOAFEV'rKKAFHNLFOEAEI XGVDVIOLPLIGCNLFAPSRLLNLCICTRAEWIE 
AXKLALXTSLODFCWEOONOEEOKX X XLTOKOQPPI IPPRFOLTTP 

CPn_0J70 415755 416912 

CT058 hypothetical protein_3 

KR IFFKLIVFYLKiFMSTTEPNLTWNLTNLISSESMPTOLASHia^CLDLVAFXLX IGX 
AVSSOTAAX I LCXPLLFI LTALAVLAFSXLLYFLLREPKSPISVTMOPTPI IKTTOLPPV 
PPLALTPVPTEA\T.EEPPLPSPRTH0TliOEfft«roRiroU5ANTWPFIAADN0TCYAW 
KKSNLTLICTLGPIEKPRyCTCCI WIVNAATPf^MANNVKCTSlJVLAKAT^RCWENSKK 
SPDPLRSKOPUJLOECRSAWENLrrTTTNAGKAGLPOFUXLl^ 
CROAYLNCLNEAKRRKTT^-CLPLl^raiFPa^PKDEinTSLRLCWIfX^LA^ 
COEAENONOPWV'I - LTTLARH rL IT P 

CPn_017l 417141 4I7VU 

no robust hoak»t»>u pt'rts«*nr. in < Urfurrvink /FKDL .»t: nl U/V/Ort 
rn&wrs APLvr aai9snrNu-ij*riHv:KAtyf>rtv i ki.(, V k i lva i lv i bvlo 

II ! AAFF X PCTPP U*L 1 1 U*'4.I LTT/l/ VIJJ.7 1 X LAI .V? IKT R TTTAtVO IKRKUISK3 1 
t:Pn.0172 4 1 *»••; I 4 (hum 

No robunr. hom.*l«M i.u»»»:ttfir. if, <»iM4.mlc/KMm. n| H/7/'»H 
NYRACHRN U^HK.^ PWTiTT.'U-JC^^vrr I / IKFI.VMJ'/ M yn : 1 K I AFAAr?rALLLLN 
TFVflGIVAtAMI rVKTtNiHXmVM* I J.;:.I UJK tMI.IMCYK ItllPlWl* CtN 
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':pn_0)7 » 



41*15* 420218 



NSEt FZ I FKT- rTPAINSSRWCTWTVR IGNLY ICSDHSXrTOSHTrTLTTDXQSTVWIY 

ALAEHNCD I VRVTVOC IKEAOACEK IKERLXALGLN t PLVAD XKF FPQAAMLVADT AflXV 

RINPCNYtDKJ^nCTKrrTtASYWUJUXEJCrAPLVTClW 

ER I HOIT/CTT I JEMVAOA I £Y I A\^O0>Tf ROVVFSWCSSCTKIKVTAYTOUUOaXARC 

WLYPLKlC/TEA«CVOni rKSAWI(7nXAECLCOTrRCS LlU.^ I I E IPVCDglXRKT 

• -Wr '* •* • ;i ».-r-,-A.\wMy»»wT-.^'n if- trinf r-fff * -n 

...... ,^ WAfl ^^ v . A , .^.KrvAi-rw: :*ntri;mD«icvf ^LvnMKEtvsro* 

VHv^P^^FHA^OPFIHTSRDFKDCO^OUKPTlC.VrSRCraKK£CAAl JIATCTCAIX^ 
JEIjSEAVVLlXP^PWOVlJCIAFCTW 
KRTOHUCLKUXIOCXVNCICEMAfiAOPCPVCSICTQtX^ 
LXRXXOEHCVWKDPECTXLTV 

CPfV.0374 420309 420961 

CT0S6 hypothetical protain 
VT3SMTXJFHTHPLNYWITEEWX4»IRHGVrSK0^^ 
UtORHCTSVRCVTPTSPTYOPAIXacnWPU^UfrRH^ 

WRCLiCNIYAVTVCTHKKl^KTKPOOLFVAICPS IC POYA IYPUYATLF PRSPLPIWPX 

mFOUMXARKOLTNLCISKmXPXSOLCTYTGffiArPSSRYXJU^^ 

NVTAVLLLPRD 

CPnJ»7S 421112 421615 

No robuit homo log present in Genebank/EMBL ai o£ 11/7/98 
fU^Km*STNHKWEPWPKJU«tJ^ ' 
UilAJfcXTFWFLAWFPLXOAYSIAfillTLVClJUCLV^ 
LLPLTIXVI B00PITPKPEI PYSYLTKLAXXTSL FLTLRRSSSQRXTH 

CPn.0376 421680 422294 

No robust homo log present In GenabanJc/EMBL at of 11/7/98 

ncWTAJCAFHLTEIRlMakRVPSIJUSFraHWTO 

TOO6RKKAG0WAIFN5PTPGVSSTLVIJOffI^^^ 

CrLI«XJVDU£NQrrSVHtHA££AFTPU»TCKPKn^ 

VSA0njFTLTLTQDi3Q*PFTOJOTlC 

CPn_0377 42)441 422)47 

s ucB-D ihydrolipoamide Succ my It ransf erase 

I MTTCV RIPNXA£5ISX^mrASXXVTSSAtIQX«OCLI^IESDKVW0LIY 

VSECDVVWCCVVCK I EPAGECEELCmSQ SKETI EA£ 1 1 CTPQSGVRQSP PEMCTT I PLR 

OOWXXaOCLSACTRCETRDWreiRJCTISRRliSAlJl^^ 

KOEKTI^RYGVlCXitfMSFFVlUWLEAIJttYPR 

WPVIIUKlJlOiSNCEIEQnJUJIWtfJU^^ 

PPOVGILOOOCiaOPVVIINEJVIAXat^ 

SLLOL 

CPn_0)78 42619S 423445 

sueA-Oxoglutarate Dehydrogenase 
IWXiniYFMD5EFV[WVYSSX*CJWIESMY0W»»^ 
ASTKIS(ZNETIAm4EOKS0FlOTYRYYCYL0SOIST^ 

VPSAStiJXA OVSVR ELI EAIJUCCYCGSLTIJILTCTTEIOEIVWNIitEXRQVERrAEO L 

iasyioxjcxattfeefwikittoiwfs:^^ 

!CIU/IVLTtlVWKPYWrVF»ffiFEX»PAA 

NASHLE SVPPXVtCVVAALQHQCHACKEOSSLAI LVHCDAAySGQCWYETLQLSRVPCTY 
STBTXlJflVVNNYXCrrAVFRZSRCTPYCTDIAXMXiCX PVnn/K5EX7WACX EAZCYALQ 
VTlDtrSCD^IDLDCYRKYOCiESDDPSVrAPLLYDOIKRKKSIRZL 

snrtAsiacnQE5uro£Fovijccn^PEP7PiacBCHHCDR^ 

LFHMSSRi£GF*DNFHPHPKIXIXL£iaW10j<A£QGW 

SCODS IRCTFSORHLVHSl/rv lU/lTSPLYW^AXOCSVOfYKSPXJEYAILC^eYCYAQ 
OAUCTLVLWEAOrCDTANGAOIirOOY LSSCIQKWDLH5DIVLLLPHC Y EUJU PEHSS5R 
IERYL0 LAANWNFOWLPSTPVOYFR I LREHAKRDLSLPLVI FTPXLLLRYPOCVSS IEE 
FTEP03FRAIL£MFNYDASXLVI£SCXIYYW^ 
OLVSUDIOfSHl^FVWWEKIWM»YOYMniAlwODILPD^ 
LSROELVTCMETLTSLR 

CPn.037» 426268 426765 

CT053 hypothetical protam 
KNKKKl^TCSRIODQiIWMKSDlIJCKIiSELHDLTtJWW 

KXYE£KSIUALLKEICE1E£V\TPRJISPAX1VYPIX;PSM5DIEP/EPTCTCXOXOPCX?TV 
EiELT9ECREXX»VEVDYSHEODEDPrSDRWRWRRGCX I0POANEW 

CPn.0380 426671 427876 

nenN-Copraporphyrinogen III Oxidase 
KCTXPTKTMKTLSAUXACDAVVSLiniL^ICAPtJC.YXHIPFX^ 
SESVSLYCNAVIOECLRXI^IQETHFXETVTFOGGTPSra 

FS^ICXrYCLPTOLXIFt^0U4OAl.TLPITHISLYNLTIDPHTSrfXHRKrLVPTIA 

OEEILAEMSUAmXI^FXJRYEl^AXPDYPAKHNLYYWrDW 

•jERSKNYSH I SHYLRAVRKNLPTOETSEX LPKKER I KEALALRU LLST.A>OL^£n>STL r 

.^LTODVXWt^SVKCXXLJkUiROCRLTO 

CPn_0)8l 429836 428037 

«.T326 similarity 

-n.PfKrRAUfrAPT£SRSSPPTU^EPLSPNPIPAOIOIPRXTISPPSLOVSWA 
EO rT/TlAGCPRSSSSASVASI^rELVCU^DEDPEPPDSEVRTLYVWSWOTKOEAVO 
EULY I3EVRCEAVRLLYN0CSCMSPWPISPCRT LPTLDMPLCOAU.7\VEOrFSAPEWN 
REPLVXF*^COA£PYI00AL1X}SRHSPRXVVVCXSPTVFX0C0F1M<NYRVSCDrF5SLI)C 
PCTP AENTTILP YSSCLECVFLPS IRCPSPTWAVR FOE0CLVANRGEDVEDROCLS0DAE 
P5QLPIWEW»AWI0!nT>PSSMSR£VEWLN^ 
t'V:^LA0EV!UV™Er.U)WlCY5Ll 

.1. ore r r.vLvnxt wijiklhmpoeilra I p isa^t tans ivfvectrwmtwilrhrvo 

VK V»V IGCfXPVnTVnAilYRORAUr t ICTLOTVHGCLYLPVSIKVUIO IA IOV PR I LV 
HKITfrAWDU«NK^AEEM«dCa>VLAVnOTLNF lUTAFVU^OJrfTFVKSVUUISRRRRR 

'.d.JM*.! 4J07S3 4l001h 

v-it>./vi-W.-."JW-r>»!|*i»icnt M»fr»iyr.r.innr*»r.iae 

l rt m.af.Mn , f/rrRAVCTLn»VICELV!m^LIVEOTROC«Al^t^ir'EVf!KPPU^I 
'-•^HAPLi'KAWJKYLEI'IVKHCOI^LISOAflLrCIADPGAilLVnRARAtyJIPVOAFJCP 

• •:: rru«i«Lu:Li':vu-PFunfLpy:;HCEnvKj t kkaat rv:jtsvi * t et.*; yrnvytfe 



<:Pn_0381 4. 

CT047 hypothetical protein 

VO0TTTLTLPWa^TSrW3QA<ACXV<>A I A^r^A^ETOKOA^^ EOT^CC 

ocutPATtjcsvriTrAiJocHiTt^HA^PLATiren^ 

ECfMLSKALPSALSLSLTCWPADROKRI rRUiORAERVT. rSCSCSUCtrUULWT 

SLPOitSErao^r^iorrsLDHSDiKELVViocEK^ 

lledceopu: : rrrurrocLYCLRS x eecsk enxm rkfvl yt; k erlhc Ai/cuYAm. i 

KNNVQDP I VAVETLV t RHVNL 

. i .■ t .'■ ' : • 

ncetf-Hiexonv-i&ft* rtotein - 

VXTCtXK3XI04XGAOIOXOSGXICfAJRAVTtKPAKir/A^ 

AXJCWAXXTTAXRTVRjnVAJCXPAVXKVAAK^^ 

TWAJOCSITCICAAACALACRIWHJOn-SSCmrSSTA 

SR 

CPn.0385 4)4042 43252: 

pepA-Leucyi Aninopeptidase A 

ri,VT1QSErVVLFHAOASCRN3lV7iAOA t VL.PFVHFKDAJCNAAS rEAZTEP SYXPALEXPGG 

KTGnttLYSS PKAKDCRIVI f AT flKWTEI.TSI!AATgTYATLTRVLiUUUCCSTVMI tLPT 

ISEXJLSArrTLVCLSSCXI^I^YDYPRYTIXVCT^ECPl^KV^ 

AA I FlCVYLTRDLVNIWAOErr PWO-ATVAIJ^KErPS IDTKV^WDAlAKlXltaii-A 

VSKCSCVCPNf IWTtYOGXPICSKIJHTVLICKCVTf DSGSULKPCKSMLTIOClXllAaOAT 

VUXI£AXAVLCXJXNVTCXXPATEKAXDGA^YlQ«IO\nr^^ 

ADAITYAUCYCKPTR X I DPATLTCAHWSLCEEVAC rFSKNDVIJkXSlX£A5ArrSEPLW 

RU!.VKXYDICTTJtSDIA»aWLC^^ 

EEORYPKYASCPCVRS ILYYXENSLSK 

CPn.0386 434543 4)4046 

«t&-SS OKA Binding protein 
KSMCTIJOgtreACYbGAP PUJU ff Sl u ra ^ 

IS0)fl^tJ0CBSCVZV3^XSVESYIISmZSPOSSLVI5VDSXJCrSPPCPJ4GS^ 
NHQC3VS YESVSVCrTS EAIJAEA nCDXafyACYO0EQQYVCEiy/Pr 

CPIU0387 435229 434699 

CT043 hypothetical protein 

NNWLXamSUtSRONAEBOJOTAJCEXJC^^ LrVDCETSULTYCIKSD 
RLYVYAPLLDCLPOWTORJCLALYEXT .f SHT fST»QKACGCVCVATK EQL ILMHCVLPKKY 
ATXfOIJCA/AOIJ-XETVVICHirTVCA^X^^ 

CPn.0388 435323 437)20 

glgX -Glycogen Hydrolase (daoranching) 

STIffXVSSYPSVPLPLGASKISPNRYIlFALYAfiOATEVILALTOENSE^IVVL 
TGAJMfXEXECXSXX3SSYArRVHOPIOa4CH(3YSn(CYl>^ 
YArOfLKEEPrPWXXTOPU^KEIMirirDOfVRSTO 
HXtAXNAVEXXPXFSPDETAHPPSNSKFTYLCimi^ 

EPKTLVKttJtQECIEVIXJVVFHHKXljQCTTCSLX^IXnTSYYI LCA OUHf T WMJ Ottff 

LlrT*OUPTT0«l^IlJnfWVlD0WlJCfRft3L^ 

A^TXZZAnWAOGLYOVCYTTTUPRWSIMCXPYRI^^ 

ODiYimpTNSXNYVscKoeTTtaTvnrmma^ 

EOTIlXVREROtJ^NTTLTtVIVSOCIPMIOSCt^AKTAECW^^ 

TAKiTOotrLroLXAjTuaofKnjNRcn^ 

M(VYVAPNVGAOOQLATXJ»IUkS8mPYOXVACQ0CI^ 



CPIU0369 438254 437319 
CT041 hypothetical prot ein 

lVFNFICRrYCKD50RONSNTTCLRPF KKTCKEL IEFTUlRTVXLLRlWLLCLFFSMSISCf 
SEVKWDTT^DTVVEmRVUJ&iESTTALIEAXCPYRIYC 
AtYlCIRfcCITYPaOTJCIEPVDITrASLFTWIOYOC^ 
YXJCSVUXKYX^EXJXKXgUMXXIXRTAXYEiaiJUlNF^ 

FYCVEEAIDWrARLVVCSPOCU XQAQCIX05NVDRLAI ECFNARQ X LEXFYKX3VDFWI 



CPn.0390 439171 438134 

ruvO-Holliday Junction Hel lease 

r ksdrecsykthovavlhookk fuvs ulpktteeftc^knlkerlolrtjcaalorcevpc 
hcxjfcpp«x;ktsiju«xvaytvckclwj^polikpsdllcixtsloec^ 

RKXKVAEEYLYSAlfXSFICVDXT XDSCPGARSVRVDlAPFTl.VCATr R^ 
FSAALSYYSWOUCEXLVRSSHIXCXEAOSSAU^XAJCRSRCTimAN^^ 
RECaCINCOVAEXALAMLLIDDWCLNEIDIIUXTTI XDYYOGCPVGIRTLSVAVQEDXICT 
L£tyVYEPFLILXCFIKiriTRCRH\rTOIAYDHIJCRHAXWi.SIf;rCO 

CPn.0391 4)9701 439510 

No robust nomolog present in Cenebank/EKBL as of 11/7/98 

KDOLYKOEXPIPKATIl^RNLCV>G*DNPKCKROTLFXCRTSCRSALYSYSRRILVX,LNA/ 

HMSP 

CPn.0392 439814 440393 

tlcd-dCTP Deaminase 

MSIKtOKWIREKWJ4A»<IHPPVNCOVtAj74EriWLtSYC^ 

VYNS^PKCFTEOXFISITOOWIVPPNSFAl^RSVTV'FRIPRWLTMCICKSTYA^ 
1 1 VNVTPFEPEWECHVT IE I SNTTPLPAK rYANECIAOVLTFESSTTCEVSYADRKOKYO 
KQQGXTVPCV 

CPn.0193 440229 440723 

<T0)8 hypiithetic.il protein 

KFLTLRHCORKITLMKOLPR5Y3LSLVP.PARFLMC3TEKE3 1 KSNXASPYLVSRVSVIUOOjl 
VCFRLLEEVM I KGfcWV I FS I L I OCFVr DA A I OELnT EELR L05KV3SLCQ 0 X LSA08X08 
OL0UILOHW02AA I EAAL lORLnt I r-KCYKKlJCVnpKO03ENX0 

J-Pn.fli'^4 440727 ML'm.H 

tlvi/.rrB:; Ouowiin prot«»nlMt)»»*yly:;in ih«aiiliv|i 

!^ TnUVSi^lS: mf fl 5 p ISLri0 ' A »'»" *^ : ^ i -*:irYK«::K:;KKyf.wVATrxLMP« 

t ai'^vapl r i*vnc i FKPt«LHM , ;iv r ; r/rrwow i l:k(>j r o I KH\)iXKKvi^:XTtOPm/ 

VNOEESRLLYCYLSLSDCSVKERMO^VO I ri / Y0t0TI1.Ott.YIAI>;:KW* , :;HVPrC>CN 
ITN»I I AT I tX*l. I FO ICT I FTf IJXTtfll J UwVAl tIR I RHVY I HKUYU 
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CT257 hypotnetical protein 

CNCKTKSALfV ICVN t XC r/L0CTY3»*<IMAC/3FKR VRLOYYLTKOKKKARY rNTLI RR 
PYRLFGTVMUT-WALCVGSXSSRJOrRALGrT^ 

PEKLALWGA P X L YY5KY X FY PL IQLIGSLTESLYYLLN I RJCEXIViSTl^RDEFOKALETH 
HSC0OFNT I AWire^TCADOWPLZOVnfLPSSAKVKDrCRT nWTtJINTTPVYHX 
ARIQJVtGXAHPKDFVNXALDZPLXWLHSPWFrrAXSKLI^ 

cEPXGiL3LNArpKrLfw^iAMUPifrtsvrptTriosRimjKni)iQrp0Ym 

TL.v.y.7: vv rr." •^n:KT. , .-::K?K- , „: 

CPnJ>396 444J5* 44JW41 

yhfo-Nifs-related protein 
YSMYlJNNAIfrPPERGIiETfcOICTTLXECTAJIPSS^ 
STOGRVtYTSGATESI/JLAXASLPICDShVlTSGSW 

VLTXEQ X ERAVTPKTSA I X I/JWVNSrTCSAKAO IAA lAHFAQD^LOr HCiATANVCSICEIl I 

YXncn^OOaSOEXLIH^FCXAIICARXPtMfZKCADOPIUU^^ 

WIAl*XEC/ACGTGSACSSGATAJ»FXSLVSie3VDE& 

GXXEJCWERLKNS 

CPIU0397 445124 444361 

PP2C phosphatase family 

EHFVWWTCtSOlGRVRARNtDrWOVNLHSOWAIAD^^ 
rDEOOSPOiCYGDIXYKrTUaiU^VfCW 

FWW3DSR X YR IRIXIEIJUU.TEX>HSLE*OLKNRYGLP KQSDKVYSYRH t LTNVLGSRPYVM 
PO XRNLPCEKBJLYCLCSOCLTIWVroXD IRD XLNO PATLEERQ4ALISLANITOGICNA 
TWLVRIQ 

CPn_039B 44SS1S 44)700 

no robust homolog present in GenebanJc/EKBL as of 11/7/9B 

IlELiMOXENSSILrAEVVMXWirmSAI^ATU^XTLIPICIX^ 
P 

CPn.0399 445759 446523 

CT253 hypothetical protein 
YXLHHVUOCSUJCES t ttK SKMTO^ 

EKWLXK)«ASPOTOOIYAPXAimi*rrXXEOn.PAEPXVAT^^ 

ASVRVRVFD X RKKX IAL rYOEI X ECSQPLTTLVNDYHRYOiQGKHFnSTPHG LMKSRLFR 

EWARVBCYVCANYS 

CPh.0400 446527 447306 

CT254 hypothetical protein 

SKEHSKF ILLLSLCVAAIASXNFT XWPAPSGKTPLICIWlOVlJOTkliWSSLVAI-SVSSO 

TA£LI£11ITOISLAFAn^YWLPXDrrWUX^E^ 

TOLXGIIC4S1Cn.TLVLFTOEXhT0EVTgETO 

GitmnnmnRiAMvzssiirsFtKiBBijsswTvvvirnst^ 

5 ? IALTCLfNLTSLLTLC IK 

CPh.0401 447884 447495 

CT255 hypothetical protein 

KRX«A7SKLIC7TVRAM\A^XKCPWSIi»SLVS«VEKILC!rO^ 
AOJVLTLVLXirriXIJUXmASEX^ANEAKXaCX^ 



CPh.0402 449012 447888 

mutY- Adenine Glycosylaee 
NPIQlFCMIKIAFSQCAiQ^PVEAlJaCWFTJMCAS 

AXSlA0XRGVGPYlVHAXUFAnaUUUUV\nX5KVlJ(VURZFUCrSZOL£ 
A4AIXI>HKSI>E\ftA£ALXELCACXCX!CVP^ 
IFU^VAJVLYDGSLVVEKR*PICEHMAGLYEFm 
TLEF^GNXJCEQRHATITIHlO/HIjCPIXro 
LXYLG0VR5RESIGV 

CPn.0403 449009 449710 

ycec -predicted pseudouridine synthetase family 

NnCLSNDXRAALOYFHEOTSWlAT^ 

IERTESYKVOPCORVSLSLIPSTK00PSXLWE0OYSI IYEKPPHLTTEOMAHMTUFPTVK 
Rl<OXGTSGCLLMGXSKQMTnJOaLJTCQRXIH 

CCAVIFGAAGPSOGEP XKS AYKWDCWVI LLSD4STT0UWSIJ>RSSALSSMLTP 
CPn_0404 450962 449871 

No rofiusc homolog present in GenebanJe/EMBL as of 11/7/98 
ELEALEQKYG KAVLL I ALSELG I OTHS LLSGKRLECFPP I AEVMAACORCSMDFCE XLKS 
QSMOLWAOAASCVDGLtOOPFWSTAI ASG IAKSSLQETEFEC ESKVMVLSSWCEXJGAOVC 
5PFNLER I CMS F PSLKVF SLX KNGCENMG XQL5ASCMNLLMS X FFVATNGG ST P I WITKE 
rOULVAl.VLSHYOCYP/PATCDPORGNILOiPEVNA XLARGMGKRVXJLERKRGGESSSS 
R Y LELAARC FOJSLTKTSLLSDANN^ERCWCLLONSTSLMHT ACLN LQRP PVPT PSGVT 
AHPOPOPDPVVTSOPSLLGAJtERSPVSSRCRFPWLPLSVtSPRS^ 
EVKF 

CPn_040S 451814 450966 

CT105 hypothetical protein 
wrOTSHSR\rtAXKFSRtmRTYRSLCFTtmXX^ 
POAVSSAVOGFl^rrcCAASSTATTTTASCASAU^POOVOAlX 
^ACTSGAJSSSASMQQOLLOLX LPKTTCSGGSSVSSEQ LOOLLSLV SQ HT T SCCGSCCTQ 
AGQ AASVL LNLLSATGSAAAN PLCTAASLAQ 1 I YAAVTS PCAKICTSEFCYNYCGETCOGN 
^PTCGCPOCOCCXa^GRFFCCVWICNCCGICEG.TOEPAX PL 

rrriJMOh 4Mr*«»o 4S28** 

i.ibt«Enoyl'Acyt -Carrier Croroirt R*tfuceas« 

tttFHLK I DLT* JKVAFVAt S (COOQGYGW* I AKLLAEACAT X IVCTWVr IYKI F30SWELCK 
mRMtUMTVUZl AK I YrMQAGFttJrEOVPEC I AENXRYKG tTtTFT 1 .1EVAB0VKKDF 

• an n r lvw;i an: :i»k i <ac iLUrmniccYi au^vu:;y:;fv5:ll:;hfoj i hhr* hst islt 

VI A'»OtAV^M»::lAKAALEnnTKTLAWEAC:t(PWr3 IRVNT r:^;rLA?;RAJ^CAIOF t 
tMXWWjFMWlVFMWAJXWM/sAFW (GPEMFPK 

u: 

i'l1t_04il7 4M7TI J«,2H^H 

IIAU mipiTi.mL Ly tiyhn Lino/tjlwuptut .i:>», 



irfYrtUHEXLL'.TO r CV7T HL0WC.-/ER LYAi*C*;WKLFFLT^PYYirYAAPLFS3 
FDAI^UX0NGA^VW,-.7T;^NLLY3IWLP35LL»: ; LCX^EnATALF^VESGAPYCTHY 

"rswiAQOLairypraig™^ 

Qjr rr rcnrxL T tf A ' l W IU <»^PWXtt^TTCgV^KCKAL W WEI PTVMA 
SGOQAN0LCS.IEKGOFK tVK3SAPE£H)A/KADFLAPPAX)ltt4GlIJAWEACVRYTtXRJlSL 

CPn.0408 «<0?C 454581 

CT102 hypothec i»:Jt protein 

.Tpujerrv : :::.\rr:: r : *\-uk' -?rvv r.rrr: 

»■'• .*VWAK^' ,; " "• "~ " u " f " : " "'" ~y?r " " v ' ,: : ".".7I>"~*.', - * ' ■T : *r>'rTI^.'T 
EFPPOTUlWLWUA^wwJJJAT^bKwKA i^ii i .KK % \ri7A#t 

CPH.0409 ««45 45512? 

CT260 hypothetical protein 

KntTTLNOflNLTXFLICSSOEEPFLERESCLTYXNXOANCNXLPLrFVIRS^ 
L^YQLMESKICASTARLLKL^^iOXPGFQ 

CPh-0410 455087 455833 

dnao-ONA Pol XXX Epsilon Cham 

DVMJKSNXIOJVKSSQllDVLZrmeT rCTCXERDRXXEXAAYNSVrrESrLTYVKPEX 
PI POfy^XHGXTTDAVLSAPKFPEAYECFRRFCCEOS ILVAHNNDGFDFTT rfiCTTWH 
SLEPLTWrriOSUCIAOICYIVX&PXKNLOYUlQ^ 

GDU>P00V^l^SYHPPCVF1CMPFGKYKO0PLVOX PKSYTEWJ3^GAU»CPO0CCIKA 
AXALLK0PT 

CPtV_04U 455794 456609 

CT262 hypothetical protein 

WOSRYSSXTSTWILTAAFSPCIWirLrwriJCDTOFRPlJ^C^IADIE^ 
Wtf^UOO<SAALFPLVSOYYNlJC*TaTO 

LClCI,YYPKAia»XPHPYmXLSAXL0GICVXX3GAX,IHtX^ 



CPh-0412 456515 457246 

CT263 hypothetical protein 
EPX^TiaCPFtfrlJCUaCJXYXCSGRPWUVNT^ICXIL^ 

LYSYRCroYKIJ3LYIVH^^*3STAVl>ULOSYCOAYTIT!fDLfc^X NPOFVGACSPEX PLO0CY 

TXtXIAKLTTOTPPVLSEXJPPYXFCALPIWIJItSSLVTSPVLYHYG 

XASOAAEIWIPCSWITSt^mmSOCTFSIU^EVSOJa. 

LPCP 

CPh.0413 459209 457227 

msbA-Transport ATP Binding Protein 

GKI^«E2^KDIlJ2*OMSKI«rrLTVSDATTYIAEHGKCT 

VSRntGXAXFUCVAIFICAVTLrrORFLGOVVAXRVSROX^D^^ 

XGNLSNIWimSASIAXAVNSLJCXllYXOAPXTFXLTLGVCLSXSW^ 

P XWXAAXX XKLAXRX0XS0OS PSSVLYDFlAGVMTVXVTItTEXFAITICYCSOBBKXSAL 

EfltSAAYGlXPRK-LKTIASI^ArVV^GIYICFAIPPErU 

ENTSXMtCaUUURFYVVLNHPOLHSCttBUXEr^^ 

STTliO03EALCXVCPTGSGTaTLVXliPRLYEV5<X»ILIDSI^rT^ 

QQOMJOAJUtfAPtASIUXJEATSAIJAISEWYIKNX TGTLIQ00CT0XXXAWCLTTUH 

VDRVLYIX3«30KIAEGTiaCElX0TCPEFLI04WSXSCT^ 

T 

CPn.0414 460203 459172 

aecA-AeCoA Carboxylase /Transferase Alpha 
LCXAXVCXJTCLFXRGEHXlJfELLFHEXCWEYZX^ 
RI^JKLXEXIYSDLTPWCTVOICRHPSRPRTWlf XEGMCEEFVEXGORT^^ 
VKIOTRTVLICOEMSCOTASRLHRNrcjKLCPECFRXAIJU^ 

AYPGLTAKERGQGWAIAKNLFELSRLATPVX XWXCECCSCCALGMAVGX3SVAKLEHSYY 

SVISPECCASXLraPIQWSEAASMlJQTCEN^ 

VRETI IQEWLRIJCDLAX EELLOCRY EKFRS XGLYETTSESGPEA 

CPn.0415 461522 460221 

CT266 hypothetical protein 

SQTGFLPGLTLIFVI I IVVOiArLlKLCVIMGLOSRLOKCIEVSONSNrOSOVKOFIYAC 
QDXTtJl0SVlJCiniYHPU.KIHDIARAVYLlJ«AIXECEOLC^FLW 
GCFPVOtGLPYPAEHAEFCLLLLOXAEFYEESOAWSWWHrOOAl^DKOGSVFPSLWSOE 
t&RJXKEKrrLSOSFLTOLGKOXHPr/SLED^ 

SaOVt^IAYCPCSCDISIXnrYFCCCCIAKEFVCOlCSHOTTEISrLTSTW 
YLROSYVHLPIRCXITISOKOYRVHAALAEATSAffrTS I FCKCWCOWDCPRLRSCSLD 
KVT^E^RFFT WN^^^ N ^ VS ASP YME X FALQGKEKFVftjADFLXN X PYKEEGVKL X FEK 

CPn_0416 461871 461557 

hinO/ihfA-Xntenration Host Factor Alpha 

EAUNMATKTKKXLIST ISOOHK IHPIfHVRTVIONFLOKXrDW.VKGDRLf3TlOrGVW 
VERKPKVCRNPKNAAVPXH X PARRAVKFTPCKRHKRLX ETPNXHS 

CPn.0417 463047 462244 

amiA-N-Acety louramoy 1 Alanine Aaidasa 

REXGMXLTKYLNTKOLRSM I SR LFVRY SLFMSKQLS FFALCVLGSH P I FAOTFNPPORVR 
R^EVIFIOroHanCD0CTASK£L«YEEXSLTLriJkLTVOT/LXR^^ 
LnKRVALSNROOfXVF IS IHOWSSNAAAFCTEWFYNnr/GSPTRNRMSEVLCKNILAA 
MEXNCI LK.sRCLXT ANFW I ROTSMPAVLVETCFLSH5P ERAALOOARYRKH VAXG I AEG 
VHNFLSnPSFQICPKON I AX IRKPOXOAN 

•rPli_04lH 4i>440t 4ii2*»51 

(wirii u A«;«»ry iwiit.imuy Ut.iny l^luta«yl OAf« Liu>i»i; 

K1JI *KLI JttTVUAK I Yi :KVfl PLEVRNLTRD3RCV5VCDIF fAJI WYjP YUVO^AVnAUNS 

At AIA;::U.YNI«KUWy f fTrNIJJttLEAEUAKYYEYPEwKLHT I'r/t^JTVSrrTVTGL I 

KAiaic.YOKi-ja ;i j* rr r »i ru^a.v rrtrFTTPTPALL^r/L/\Ttf/iffjNRr>A-/ , /KEVrL'; i 

• Jl A:* 9tVAY1NFirrAVI.TN ITIiJIIIX>FllCTFrr/VAAKAFLFiLVI»MV^W ft/TOSPYA 
'IFJIAKAI-V ITY»:i l-.^\,\bYPATOtOUUi:^TKYTL , /"/GOOK lATl^nPiraCYNWHL 
I AA f l.TVIIA: H4fl 'I >J 4-3 4 4 j-jc I f \U? 4 PI'l » :RLOPVLMGP':» i »/-n UYAIrtTllAI J JWI.THL 

iiKijjTr jraiLi vw i \»a h«i A:;ri<rLMA^ , WERYCFA-/ , /r:;»iM'::Kprw/t VNP.tt.-ty; 

Vt: :kN*/K t K 1 1 * K^iA ITVAL' : I A' *M VI. I AGKGHEA70 r FKI *srtAV\A*'jr* *KVLA 



91 



WOQ027 994 [ file:/A\dcwas03\rirmdata\ l p\FoleyPat\PatentDocuments\WO,0Q2 7994,cpc] 



Paae 9- of 330 



WO 00/27994 



PCT/US99/26923 



CPn_04l'» 444977 4*4p7« 

pop J* tr-inoalycn Use/ transpeptidase 

AiXXWErCVPDPrRRCTrFANTTVWCMWOPFAVDITrmrAO 

ITDYORSYPFOKLLCO^rrUlEIKDErraCAnT^ 
NRLt7TNWtKLP«rzmrYLTtNWariAIZEI£F^^ 

: ,v; v :■ wfui -r.-TKVKEBrE- r ; 3 rrr 7.7751% 1 » r Mr r : :r:r- : .•^zxtfxxiur.zz 
yv \ ft teeth ' ^ttjt :w s s:; :.rL : *::.^:ra«K\iv fc, . , . , -. , .v"/Ao:-»rn : :.^r-" 

VAWOOKIAALCFCRKTGIIXJSEASGLVPtfMRFHXi^^ 

cxoKWYAiukNOcrrAVRprLmrvsASGezyKUTicmRLrs 

TLPOGSGFRASPKHHSSAGKTCTTEXNIH^^ 
VSIDDPEYCUUDCTTCNYICCRCAAPXISRVA^^ 



CPn.0420 467120 466824 

CT27i hypothetical protein 
KsnwNKSRnjuccarrcGsxjYrYxi^^ 

LI OKItRPOHLKEXAALPEYOVLEYPSCES I5LLSYELP 

CPnJ>421 468007 467108 

yabC-»PBP2B family aethyltrenseoraie 

£ I LMSERAH X PVLVSCLAXJ , AORPPOTrROVTU2AOCSKAYAfLCAYPSLTGftXSSOroL 

OALAIAEXPXTTFODRVSFSHASFEDIANOPTPtlLYtX^^ 

OEKEEUMRJfflOTOEI^ASDVUSlJ^^ 

SIOCVKEALLCVrPHYRrHRXI HPLTLIRJAtJWfWSIXWLJCSLLTSAISWLAPOCW. 
VI ISPCSSEDRPVKWPnCEAEAJKaX^rrxiW 

CFn.0422 468233 468784 

CT273 hypothetical protein 

GLAKVEI?WSTSIYEO*ASIeWIVSDFJ«£ICKffiISII^^ 

R&0XTPFLSYXDX * - 1 ™EDP- I ' t 'OPWOOGXVLLF^r^^^VXSTO^IDmSRIFO 

CPm.0423 468788 469216 

CT274 hypothetical protein 

OfZAlEWXAXLCUSODaXEXJlXSCYSnJtOGHYSlCAXX^rEA^ 

LYLO XCSKSOAXAVUX2ALX140GOKLPTLX2KTXA£«FCIX3 RX EEATAI ATYLSSCP IP AX 

ANDAfiAXiMSYSKATiaOiAALVR 

CPn.0424 469S28 470961 

dnaA-Rep licat ion Initiation Factor 

SRCNEIFSPSlJt3WVDCXVESrXMC£SraTa<&CTTm 

lOVUEETOEJttBI-EVPNIPVCNYIIJWYKRia^^ 

ASOKfSNEOISENTEEITOFEXJOJC^YRFIWreEGF^ 

FXHOXMSLSKIIilAHAVOHYVEgHH^^ 

LDLLLVDOIQTLONRQNFEE Ei Q/rF 1 T L INLSKO IVirSOKPPSOLJCLSER I XARMEM3 
LVAHVGIIULETKVAILOHWEOIGUJPNE*^ 

FOcffiTEWKETiJCEiFRSPTTOKXsvETix^ 



.*H0PSt;^APTXJ«R.:L 

K rKTTLncyVPHSWP^YrUCRYSSVLlHSOCUSYR IFAJCVLAXLLTSLAVAFAV? 
LFSCCCJOLRLC ALY I C I ALAX CVLL7 T V.'YC I AJK I A? ACXK PPS I SR I C I V 

CPn 04*2 474817 47*514 

\ " : •„-.: i* • in- « 

? •>tnr.v ; - v'*:v'v.*.* tk:. • v :«w ?: vsv: ".rAwunw 

JLFLWiAPLwLJ IN lAA^.IT wKXV.VAOtfkVK IJNALtK rKVAAkUS 

CPn.0433 477327 476929 

qcsH-clycine CUaveqe System H Protein 
RTmLYGTL7RTCSWCVMWYSDYHVW!UW^ 

SXCKBCtVLVILCSSXSA I EVLSPVSGEVID INXXLVDNPOK INEAPECEGWLAWRLDQ 



CPn.0434 479471 477276 

CT283 hypothetical protein 
pjWRIYOQOLFCPXCRDPAWFFSIXSrTUnra 

CV9CICVT>CRPSRXCS AESTSRRTVTTT ASC KRYVtKDFMAKKKFTAH tAY PfTCMPRA 

teVXNZCLLTOYPXTTRWCXLPlJCVYMICCXXFSXEX^ 

AFQXJXILXVFOOIENPISKEGFIJUUKLFI^ERJIFPH^^ 

TTPFVOGKOSXQVCFrRX^KCrsncnCOCXJCAFCNnrXJ^ 

KAIQgPXVSRRfSIOYKRVAl^DLAATVPtfW^^ 

DFQJOJCPAlJraXSlJTMCgllJUMFERILOSI^ 

outtVLLMVLttYSOtwEiTnrrxiv^ 

HfWj r? ^ 1 1 * PCEglASIiCRRIJWKVVEWKRIjCRKLECS FSNSLORSLXTTSKSOKZL 

POCFTKIFS>OWCDYSSVl^PNIEPCYYOCl^KIXYI» 

YMERFIBOCWR 

CPIU0435 480908 47947$ 

Phosphollpeee D aupertenily (uncleavable leader peptide) 

GVHKSPJJtFPdJUkI£IFFILLVPNSVSAJtt™^ 

AKFYVEL CP LXrUO R T L KOfVDHLEARHDLVPELCSY 1 1 tQ PTF Tn A r P QXI t JCMJ3XH 

Hm mv rro c>P S TS ii*giiviD«iiw 

ACPPLTLEOA E&rV F W F Ol OgDLVLVKSKIRXV^ 
SVXLAKMYFI PRD£LIJiAtVt7VS10«CVHl^LITT*3CKt^ 

YOTnrPUflucwt^jttJariKvsryEFAivrTO 

OYESTWIESPEVAAJCAWCVFtODIGX^XPVSHOT 

CPn.0436 481433 490902 

lplA-Llpoate Protein Li gate-Like Protein „ rir t .. 

FLLSWf ACLCLOAAVR PTOQGFVFHJCGOYAFSVIJtS ATH PSYS SSVUKYKTYNSFVIXV 
LEJCVFRXOGKLAPTOOSS SRDSGNFCMAJCTS JCYDVLFCDXX IOGAM3RXVQOQFU82CS 
IJLSGSSSmORFIJCPrVIfEXXGQIOXKAFFPLCIXAADEVLOm 



CPn.0425 470965 471564 

CT276 hypothetical proteina 

FRCCPlCFRRTCKCPFC)VQTLYE£grSSPSSYSPYSRSE^ETPPSLFDWP 
HNLTCESSLPQySSTPRTESLLPLESPerTtjSEUVrP AU£LAF£RLLRXDCTFESXLVSK 
OCIIICPKCVVICACIOLOEAXXB^ramSCKVllJlOCAI 
ILSYLAXAGXTOKSEREROL 

CPn.0426 472111 471536 

CT277 elmilarity 

MN^SLlJTKLCYCCOAPCAYrCSNCLOCIIiVEDRBCRCLKC FRYLGSSETPXCSQCSPS 
SQlOAFSLYLPSOYALSVYARACEXaQtPALOFFSXS IAFELASLOCTPSC IAYITSTIS R 
KIVVEVAHXIOI*IPI>IPWLPIUCROIEXLPK^ 
VSISLFLS0N0Q 

CPn_0427 472153 473715 

nqr2-NADH (Ubiquinone) Dehydrogenase 

AVCYVFERVCA5TFUXTMLXXFXNSLWXLC0OOKYORFTPIVOAXDTFCYCPICT 
PFIPJ>SVDVI«WKJCVVIALFPATTVAIWNSG^ 

FVYKEIHIVPILWBCLKIFXPLLTISYVVOCICCVLFAVVRGHKIAEIXLVrCXLYPLTL 
P PTI PYWHAAlfi IATC IVV5KtLF0CTGKWXLNPAX«30RATl < FPTrPAI0tSGDVWVCSWP 
CVIKDSLMKKNSSTGXVLIDCFSOSTCU^^ 
VIHSOFSLWIWPGWLDWTLTOL0TTOTAW 

KFSAGNLFWCNI ICSLCETSTFACUXUIFLIVTCIASWRTKAAFCICAFLTGWLFKFIS 
IVXRLINPAYPGSVMLAXLLCaiVFAPLXDYFAVRKYRKRGV 

CPn.0428 473719 474681 

"nqr3-NAOH {Ubiquinone) Oxidoreductaee. Gaima* 

NMSKC3SKHTVR I MOTWXVSFILCLSLFACVU^IYYVUPIQEOW 

AH I L3FKCH FO 1 0O(KWPATFI3KTOLlXVATKICVSgV5YPEL ELYAERFVRPLLTDA 

PP l3GrCLWDP (KCYXCVKNDCX7TVIjCTA]^f00QETS*GtjCAN ITNPBWOBOFVCPCKI 
FLOOSCCTTNFATTtXCLEVVKCSVRTTLCOS PKAL5 A I DC XSCATLYCNCVTCAYVOSL 
ACYROLLINFSNLTHEKRTCE 

::Pn.04*»V 474i»6*» 4753 H 

t» ir 4 'NAHM tuhiiiuinone) Rtftluctase 4 

KFNPP#1TI^(K^VK^FFO^UC^NWILlAlLCIC.^LAVTTT , V0TA 177*? IAV.1 1 VTTJC13 
: SFFV'Jt .LB K FTPnSVRM f TQL 1 1 1 :J[ J^' I V I DOFLKAFPFD liJKTU TVFVCL I ITNlTlVM 
• HCIftLARIIVTr I TAFLir,FA3GUiViVVLLVXC/IPELFCFr7riJ«;FR 1 1 POFV/AJET 

urr/iYyrii^u K/uxrrjsKFLu; in iwxu i Ra^cKPrn 

■'Iii.'l4lll 47S12i 4'.'t,0*> 1 

tm\t'* flAlJII HftiiipiiiBtue) HisUi.rc Jfw S 

»w /;Avivfi /ww : i lluaaf r on i llanflocl-yi AL':.Tnvr7rAM :u*c;valvltvt 

•U\ll»f»^VIIAKIT».TKAI.TWl::p.'nA:.-VNUUFLELI rFIWIAAPTOII.KLLLEKV!;nNLY 

i ;i n j -l i avh:a i ii j ivlfc rrRrnrpF r i f:xj w rx mki a c v t latikeklay 
riii irriu.i m;i : ;f irra. i aha^tclta* i o i ^kk; ak iyj<APUTrewFJTrmpLKE^ 



CPn.0437 481810 484350 

clpC-ClpC Protease 

VFifFIXrTTO^GVXXLAIOCEAORXJJHNYLGTEH X LLCliKLOOCVAVWXJUIUJIIJFIJT 
AROEVERLXCYC PEIOVYCDPALTCRVKKSFESANEEAS LiEHNYVCTEHLLLC XLMOSO 

IXKDCLSALJCAYCYrtTEWFISXXJPVXGRSSEV^ 

TAJVBtnJSOXXXXJ18yPQAXJU0XRLXTI£UUJ(IA 

LLnOEUfrXVCACAAXXSAXQASNILKPALARCEXOCICATTXO^^ 

OKXVVHPPSVIOTIEXlJtCZJOaCYEEKIO^XTCCAXJCA^ 

LDEAGARVRVNTH»PTOOOX1XA£X12TOATO 

SMKOKVfDWKECKOVPVDEEAVAOVVSLOTCI PSARLTEAESFTf TJa.lTfrtJUUCVIOOjN 

MVTSXCRAXRJtSmiXDPNRPTCISIlJ'I^PTQ^XEL^^ 

SEYtfEXFAATmtSPPGYVGHIXXSKLTEOVRfWPYC^^ 

GOCRLTKrCRJCVDFRKAXXIKrSNLCAX^XRXSCEXCFCUCSH]^^ 

HXJCPEFXNRLDESVXFRPLEXESUEIIKLEXNKZ^RLXNYOH^ 

KSPEMSARFLRRVXEQYl^PIAEUJJCZSCWEAJUCLM 

PSPHLES 

CPn.04 39 485455 484334 

ycbF-Pf— loop super family ATPate 

NLTL«WPJ0VREII4D(nVXVAMSaWSSWAYLFKKrrNY7WICLI^^ 

SSTTOJYEDVERVajOLOlPYYTVSFAXEYRERVFARFLKEYSLCrrrPWro 

LLOUCVCELOGOYlATOfYCRUnTX^ETOUJUKOPO 

CEMNirrr/PJVIAAOAALPTAEXIUSTGXCFIGKRPFKEFLXXFLPfa^ 

GQHCCAK-nrriC^RRCLOLOCSDCFOrVVCnCNIEIWSIY 

WFTPPK^HCSAKVRYR5PO£ACriDYSSCPEVKVRFSOPVKAVTPC30TIAFYOQDTCL 
03CVIC/PMIPSEC 

CPnJ>4i> 485523 486077 

No rot-'jsc homoloq present In Cenebenk/EMBL as of U/7/98 
I ISSNTIP.VLFVSSTLNCVFPSSLPEESADLF XTNRCXVALCEKCNVFLTHS I PMH IAAXT 
I LVl VALACXAX ICLCCYSC5ILL IAVCIVLT I LTLLCLQALVCF I KFtROLPOOLKTTV 
OF IRE* IRPESS WL\miAORKTTQimJU.YEELCD^ 
* KTNQN 

^.044*3 4flt;740 

m ro&'i=r. honolo.1 rr.rewir. in fUmotMnH/EMCL >ir. ot n.*7/9« 
LAT lP r 3TfflMAT:?VAP:"rVPF.;J"PLSJ IATEVU1LPWAY [TO PI) P t PAAPWETTRSKLSTKM 
714 'FALTLLLTti OT I . WrfAt iYTYXW ( HVltW.l IVI.TI.I LALLLAIPLKNKOTrmCL 

iw: I zj tc:a jfvor yi:lmf.tt i k vi ili elttowektp i lne i eakkk ionlel 
k rTPJf.v:FiA0KOPRHK:ivK.';F:4p:: i Kin.'iKNPV ilftji: 

';|«iiJM4J 4»l».i<7'i 4H7HtH 

'TIHTI f.'/p.W )Ktf i I |i| r,r . t k „ 

KWK rU^34FKLLRI I AAFAiaiVl-TTr thM^I < A<;f;iliK^A/J'r!r«PH<PK:yWAVyiJWNO 

AFHtKL;«tr/rit;YrtOYO rr fi/ttl I* itm i: j ;u iy i'*JUy|ywKr»:;U"l::CTOPli3u; 
wATFOLr^FYNYVLL:;u^YTL:;ma*>f:: i iijy u.vritiaif Fit «ra.vyi WUXaCYOAT 
KKir^v r fv/rNcn;uiOFj(Awi'i.v»^::YK atijui.ti jf: i v i-/rirr: t r<.Y«:T: v ^cuxay 
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HLTHPnitKw»wiMLis:»r.irr/v:Nr.:sA»r/K.. n-LxrinnrrMr. fwotsxADDH 
knhxtj] rmrr. :AFTTjrr.AV>«iF 

-Pn_0442 4OTTj2 4«*52B 

CTOOn nyoornctic.il pron^in 

N ILFOTDMCFKII ICKQCSQLYLNC I FPER Z LAJUCUUCAJCSYPRTALTIEV^VSSVLGA*, 
KV TLX FCACTYAALTLPLRALFl IA IICTK3C0H LA3YAMAWLLH I LT I AV I »CLVFSLVT I 

pppwftsltxlmsvttt/tlfovhi^ 

i.TOUb^rvpotnBCicul protein ..„, fB ^«-y«Mff 

WSHSOPPXWLOOPOVPAAASPSGOPSVVXRLICTSST^W:^ 

CriAIWUXAXLSXIATASQISlJtt^AIAPAi^ 

IVWtIVrAXJU«3LIJWVASSt7nO.NTANPHPV^ 

ujwnaat3rrA£PavTPSAi3DMacwvauucDi^^ 

A0GSPTYSSRNPDCTRSFGSL5RF1CTICPSOAASTRPASKP 
GSGASSAJTPXMPASSRSPl*FST0nVli<PEPVY«C0CIO»SXPRVS^ 
QOOONQDEE0KOOSKJCK5CKSN0SLKT PPPOCXSTANLS PSNPrfiDCYDERDCWtHWNK 

CPn_0444 490266 494507 

pmp_6-Polyn»rphic Outer Membrane Protein 

KAFPOTHMX YSLPWU.TSSALVF5LH PU4AANT01JSS0MYDCSSCSAAFTAICETCTAS 
(7TTYTLTSVVSlTWS*Z79WteCfTNTGClAl£FVZADH&Vl& IALTHOGAAWTN 
•TAJ^PfiCFSSaiOSAPATOrSXI^ICVTTnXCCTATrrDNASW 
SAYS irjlWUCTTTAAIXDQW'grWCCSAXXSTAlfTTVQOCCTVT f SSOTA TO 

GGAICCYLATATDKTGLA I SQNQ EMS FTStfTTTANGGA XYATTCCTX-DCTITT^TlTJOfr^^J* 

AGOTUXYTCrTDFSUCSTOVrFSWAICTaC^^ 

SNSSAWEXSOTAItJinDSCSVSOTOliXAllNOTCLTS^ 

fCSLTTIXOTAGTSGGAJYTETEDFTLTGSTCT^ 

LLFSO«ATCPSNSSAl«£CCrXAILSFLESA^^ 

AIY AWCAUOITTX.TFtXSffA£TACCAIYTETn5rTLTCSTUIVTI S 1 WT!AJCTAGAL^T 
KSWTSFTIttnCALVrSCHSATATA I 1 1 ILOaXGO AIIVCHXSMUT^^TOgSW 
INNrAKRSOGGIYAJ>ICC\rtK^XNrTGOTAXTS^ 
KKSAXYIJWSGELSLEMKffltTTSGNRA^ 
YDPXTMEAPAaCTIEXaArtWWKAmPPQPlOWXAS^^ 

GiajsoDASXPANm:t*oiaNiw*flc»vvLJc^ 

TTTtWTDGS X DUOILSVNLOALDCSiaOIITXAVNirrSOClJtl SC3^ 

KANLNLPFIJLSSTSCTVNIifflniPXPSSMAAPOYCW 

AIOTPKPEXJUUtVP*SLHNAYVNIHSXC«EIATAMSW 

KEKAGTRLXSRGYXVGGSHTTOEYTFAVAPSQLrc^ 

WIPIiaSUlWfVLSKVLPELPCETPLVU^VSYCRNHM^^ 

PAmOGSLPVDUmiYtTSYSPYVICIXWSVNOXCrOEVAWre 

LTn0<ESAJCPPSAIJXTI^AVOAYRDHPHCLTSLTTCTSWSTrATNI^R0AFFAEASCT 

IJOIJCGIXCFASGSCEIJ1SSSP3YNANCGTRYSF 

CPn.0445 494739 497579 

pmp_7-Polymorphic Outer Membrane Protein 

PNFtVSKXCUaMKSSVSVLFFSSX PLFSS WIVAAEVTUSSSNNSYtfiSMCnTPTVPSTT 

OAAACTTYSLLSDVS FQNACALG X PWSGCFLEACCDLTFOGNOHAiJCFAriKAGSSACT 

VASTSAAI»NIXF?rorSRI^IISCPSliLSPTGOCAXJCSVCM^ 

rOOWinWUTXSCTSOrASFSRMOATOKOK^WATCmiQB 

OCALYSTCNCS ITONFOVIFPGMSAHEAAQAQGGAIlw: I ILKTVTLTCaOCN LSTTWT A 

LTYWXSCaJCVSISAIglTWSWISCSSACQGQrtr J klNXASA^ 

NQVTNBSTSTRNAXNI t DTAKVTS I RAATGOS XVFYPPIWPCTAAJI CTIW <UPANS 

EI EYGCAIVrSCEXLS PTEXAI AANVTSTI RQ PAVLARGBLVUID3VTVTFKDL7QSPGS 

RXUSCCTTX^AJCEANLSUCIAVMUSLOGTtflCA^ 

FYEKKNlJtSASTYPIiELTTAOANCTITLCAI^TLTWEPETHYCYOG 

KIGSIIttnTrrxrnPSPEIttSNLFL?gL»Cre^ 

NFFYTOSMPTWOTUUSOCTfAMITA^ 

ASLYFHKTEOLrOIANFLWCKATRAPWVLSEISOI IPLS FDAJCFSYUTOTOOTYYTDN 
5II10KWI«©AjauTLGASLPFVISVl*IAJO^TO 
KSELXNVEI PICVTFEimSKSEKCTIfW.TlWILnAYPJWPK^ 
LAWPSyRAANHFOWPHHEIFWAJ^^ 

CPn.0446 497602 500415 

pmp_8- Polymorphic Outer Membrane Protein 
LIEPKHI^MKIPLKKlilSSTX^PItJ^IATYG^ 

ANGTTffVLSGNVnflNDACKCTALlU.Li' l fc .1 l UJ LTFTCKgrSFSFWrVDAGSNACAAAS 
rrADJUkLTTrGFSNLSFIAAPGTTVASGXm^SA^ 
SAXTTKTt^ISCNTSSITFTSNSAlCKi/SCAIYSSAAASISGMroOLVPWNIffi 
CFEASSSITCWSSLFFSGNTATDAAGKGGAIYCEXTGETm 

CCAICA>CLDLSAACPTLFS>WRCCOTAACKCGAIAIADSCSLSLSAhOCOITFU3ffLT 
5TSAP7STRNAIYLGSSAK ITTCAAA0C0SrYFYDPIASN7TCASDVLTINOPDSNSPLO 
YOTr/FSCEKLSAfiEAKAADNFTS ILKOPUO^SGTUlLICCNVElJ^FTCrmSTU. 
MOPGTKLKACrTEAISLTKLWDLSALEXWKSVSICT^^ 

5KT INOAFTCPL WFTAAT AASD I Y X DALLTS PVGTPEPKYCYCGHWEATWAOTSTAKSG 
TMTWrTGYNPNPERIU«n/VPDSLiaSFTOIRT/W 

KKDitSG7TJ0AFP.HKSYCYI VGCSAEDFSEN IFSVAFCOLFCKDKDLF X VDfTSHNY LAS L 
YL0HRAFLOCLPMP3FGS tTOHLKOI PLX LfWQLSYSYTKMrJKDTRYTSYPEAQGSWTNN 
3GALELCC3 LALYLPKEAPFFO^FPFLKFGAVYSRQGNFKESCAEARAFIJCGDLVNCS I 
PVSXRLEKISEDEKNNFElSLAYtMWRKNPRSRTSLMVSCASWTSLCIO^ 
AC3HLTU5PHVELSCEAAYELRCSAH I YNVDCGLR Y3F 

CPn_»'447 500541 503351 

rwp_.f -Polymorphic oucer Membrane Protein 

rVKPPlALVMKS5UIWFLinSSLALPUlLNF3AFAAV^i:iLGrTNSFSGPGTYTPPA0T 
TTUCrrr fYNLTfinvS I TKAi wptaltascfketthiilcfochcyofllon I OACAKCTFT 
NT/'WfFLUJFyCFL'YUXl vXTNATTCTCA IKSTTiACGlQCNYSCYPnONFSNXJNGGALQ 

^ ; . : r j.u i pnltkak w atvkg^.aly.tttx: r t r rn rr ln ^at, f .1 ent,\akhgga : yt eas 

::F KBITA l.:F I Wfc JVTAT: WTii'A [YCSCTF.APK WLTLT,CNr.ELNF tCHTAlT.»OCA I 
VT»JL7l^:Ji a im JKNNT.AIDTAAPLOCAIA I AL:/SLJLCALt7:DlTFran\n/KCAS 

:^ vrrn»n:UN i< jwtnak 1 volra: w/t 1 Ypyny ittz :taaludai ^ufjpouacnpa 
/'/it rvrri :cKua'AFAAKACNLK;rr i< a; r- lt i a* 7 /, l ; l k» .vr 1 . v ak f ^a* : PC .TT L L 
wia* rrri j-.tai* : it 1 nni.vi/ivtvlkctkkati JGvrvALVrvn.-v ^jlvophtwyed 

V: MWII V/F:A.'LTt.TA! WAN I It ITDLAADPLKKNP 1 1 W.'l*/ *WA U =WO EDTATK5 KAA 
T! .TWTFT r/NFNI HP Rt ?n .VANTJ J* :: JFVDVR : : t C^LVATFVRV: VKTRO rWCETi 1 3NF F 
IIKIX7TF tf nci *KII IVVV JATTTl A JON L lTAAh v XiLt r IKOMUIIF I NXNRASAYAASL 

iKjyiiLATi^^it^a^YUi s:rt;EOPVLKnAot:ri' 1 v:;wnrirrY\*TxjAPi(CErL*vyNDnc 
At ,ki a;:: a j 1 r rAi ^ ai» :i *i iay yv? 1 kvka- : y 1 i iol: ;f k wrrrrLVK : :f dwol 1 nvcvp 

I'll TFKP F: :»'WKJ* A' ; YFATV I VVAI iVY« KM! •! » TTAliL I f ff TT. WCTT* TTNLSROAU t nHA 
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cpiu0448 ,r;r- w 7 *« qWM* - * 1 * ^ ^. 
1^y?Skwswpfiemfef^^ 

PT^tlol^WKCXYAACCP^lXX^ 

:v:;::v!;;a^ -v^ v-:,^^: 

aJFA^ECOMRMTEIECWKK lAFWEIAKfcl'wJ 

CPn_0449 507231 505330 

COO 10-PMP 10 iFWihitt iritn 0451) 
gS^ermt^SKWXVOOTACNOSAXSILAA^ 

TTWOK WWSTAKITIttJ^AI SCHS I FTYDP XTANTAAfi8Tl7Tti8M(ApAfiNSTOYSCS 
IvSt^EDEAK/AXWLTSTlJWPVrLTACNLV^^ 

fSjSsTTSvrtT«iPvcsw 

<-w*wt«iii wt cat r?r AlTTPirp A VFTVATrl HYCYQCT\OHVVOOTAS I PKTCTATLA 
W^5!raS^VI^L|4MWIOAX^ 

ncnrjuCYIUOCSOrMXOCAAOTCSEKL I SF AFCOLFCSDKDFLVAXKKTCTYACAFTI 0 
ujyw^^7 j CCLLCXLPCSWSH KPLVLECOLAY SHVS>fDtJCT1CYTAYPIVK0SI4O04ArN 
MttSSHSYI^YlJCFtTrYAPYIlO^TYXRCM 

k fE5»o cmdf5«ti^yvpouiwdpicct 

YATSPMFEVWr/FtVRCSSRIYNNaJLOCKFOF 

CPIU0450 508121 S07180 

dto 10 -Polymorphic Outer Membrane Protein 

SCnbxSOrSVWLSSTtACITSCSTWM^ 

TXTCOXTlXSMOSAAXTltOCrSITrTESLS^ 

SLTGPSSLTFlAAKSVITTPSCICCAVircOCTLTTX**CTIt^RODYCEIl*X^ 
SLBimSXSrEOaiSSATCRlcaCMXCATSTVDITtO^A 
CTXTOTOVFSIMSVrATAOJKattJCOADVTIS^^ 
LAfiaOOBVSPrtTX 

CPIV.04S1 508156 511058 

oao_10-PMP 10 (Frome-thict with 0451) 

l^^tfKIKILPSCFVXfKLXYXFCr^XOAWSSIJOJKSXTMXrSXPW^VSy ^ 

SLAKmi^PI^n*3NIDSOTminSATTYSLTGPVI- F Y tPO RCT P LSOSCngTTW 

LTFTJDOTSLTFOTIDAGTHAX»AA5TrA*0OILTFSCre 

5^CVKLl3nwaVVA£WSTAlX2<^ICr»S^ 

RIA»nxrrVRFl^ASTSXAIDO£SrrSILS2«10XYTK 

ELIISWm^FAWAXrSOttAXKAJOQjaJSOCFTEfTJtfW^^ 

EIJLSAETCMITFVlWT Lin ' lUS 1 Ul P W fttKICSNOUTEXJUkAiWTIFTYOPmE 

mSOVXJCXMCSAfiALIfPYOCrXLFSSCTLTAXmJCV^^ 

rrrmryr ^ mT?r , tn jfr}V* crrrrT gTTAfrCTTrTWT/TTMVPgLHtJm>VgLTAKCA&M 

KVXV8Cn^DIIZ9ITYXS}fl<rSXD0I^LtJC 

YOTO^OWNVWWTDTATVrltEATATWITTCFVPS PEJUCSALVCTmiCVrroiWLOOLV 
EIQATGMEH KOOrVAfSSMTNFLWCTCD CNRACFRKTSOOYVI QC SAKTPKHJLFTTAFCH 
LFAW^FIAJMCRTYCQTLF1THSHTLQP<3NYI^ 

VQSTCFSHSONRMETOrTSLPESEGSWSNECXAGGXGLOLFF^ XFQNXVC 

MVYVSONSFmSStXaCP^ICJUIi^XIVEMtfVOCDXCSSYTrT^^ 

POmTXVKSPrJS^K»&5RWXX»»NNYVY^ 

VCTXLRF 

CPIU0452 511304 512860 

pKp 12 -Polymorphic Outer Membrane Protein (truncated) 

FNE£TKTIU*NrLTCSAIJ-LALPAAA<^^ 

rLTJITVIUSNVWuXEDACVFIKRSONl^FKjNRCNr^ 

LSKFSYIV^FTSAPIJJWGWYSLO^IEKSBEVT^^ 

SRTSVNI^GNRYL\TR£t^OGYmXST)INLTXTTRCPSCFDmHAY^ 

APOISISIS\nCSCCLXnon , AS003WrXMNSIHL0SCA0FlWlJUVSESC^ 

SESHKITDLVXKA > ILRL T Y EC TISFSClJCLPrJHEVCAENLTSTILOtTmAOCTIJLSO 

CVT1OLHSFK0EASSTLTMSPGTTLLCSCOARV0NLHILX EtTTWTrVPVRIRAIXJKOALV 

SLEKLKVAFEAYWSVYOF PQFX EAFT X PLIXLLCPSFDSLLXjCETTLERTWTWDAVR 

GFKSLSWErrPPSUraRRITPTKKTVFLTWPEXTSTP 

CPn-0453 513156 516152 

pmp_13 -Polymorphic Outer Membrane Protein 

NCVXXYLFFYSLSLICRirWFHLYVOMKTSXRKFLISTTLAPCFASTATrVEVIMPSDP- 

D3SSGKXFPYTTLSDPRCTLCI FSCrjLYIA^Of^ISRTSSSCFSNRAflALOILaCCCVT 

SFLNIRSSAO^ISSVrTCNPEirPLSFSCFSOMII^DNCESLTStTrSASWIPHASArr 

ATTPKLFTNNnS ILFOYHRSAOFOAAXRCTS XTIDn7«SLLF>O«aiSN00ALTQSAA 

INLIK^APVIFrrNATCIYOCAIYLTOOSIC,TSCNLSCVLFVWCSP^CG^ 

FS>WDLTF0I^A»PONSLPAPTPPPTPPAVTPLLCYCCAIFCTPPATPPPTOVSLTIS 

CENSVPFI*ENIASECOC^«WKKXSI0SKKSTIFLCKTACICCCAIAIPESCELSLSAM0G 

DILF7<KNLS ITSCTTTRNSIHFCKQAKFATLCATOGYTLYFYDPrTSDDLSAASAAATVV 

VNPKA3 AOCAY SCT ZVTSC ETLTAT EAATPAJ IATSTLN0 KLE LB30TLALRNCATLMVMN 

FTOOEKSWIMX3AJ7TLATTNCANNTOOA X TLfOXV XNLDSLTJCTKAAWNV0ST1CALT 

I SCTXiS LVKN50XV J3NKCMFNKDL00VP I LELKATSlflVTTTOFSLCTNCYOOSPTCYO 

CTWEFTIDTTTHT\TCNWICTCYLPHPERLAPLIPWSLWANVID^ 

CK0LSITCITNFFVA*OmXV«SYRHMCCCY L INTYTR XTPOAALSLCFOOLrrKSICDYL 

WCI«HSrArfFA7AT^XT1CSLFC3SRFTSCCTSirm , SRSNEI^ 

NCHLCELECNLP Xr-SSRILKLKOI X PFVICAEVAYATHGCIOENTPECR I FCHCHLLNVA 

vpvcvrfckn3hnrf0fyt i xvayapovyrmnpocdttlpincatwts icnnltrstllv 
ca2^3vnovle:fckcccoxrrtsrqytloxc3xxaf 

CPn.04 54 516179 SI9U'. 

pmp.14 •PolymciThi': Oiicwr Membrane Protein 

GM r-L3F K.1"wFCLL.\J LC.1 A>C A FAETRI/3Q lf"/P P ITT JOCEE I LLT:IT FVT : "9i FLCA^ F 
nsr.F ltrSS.1llT.lL.\K'A JLTFTCC0Af>m^r/ALI^AAEIXTTKNF5:: INPTt^lO^L 
•CL frjKO t VFOS tXCL I FTTNRVAY3PA5'/rT3ATPA ITTVTTr^VUVOlTttW.TVDJ I 
•.HZ I FFPCNl ANFvVA I ^J>.*FTAWKF I WIT ATMS F3HHFTSS0r7W I WWM .LFOtJS 

ir 1 1 ftan.'X'vw:-: iw/rr^YTr/ALC'rr.Ai':! PT<rrFEiJOftcnKiTF:^Nirrit<OAC 

A X YAET.*N I V^VALLUfcWAARNfJ AKVLNIOCRGP t EFflRNRA KM ( F IIP 
: lyravr'AKOTJJTLT ! * A: JR.IP I AFCF «NMl JfrFf! I RNA IT/EACJUE ( WLfiAV IKM.VFY 

Of mi.iLprmi'tfKKD i t i hm* > :: wrr.';r;LS.7rELLLPAinTr r lu rrvx i ato 

|J< lTUIAWfr/li*.FA7'/2kVI.TI/r:3 yrru:LATPTnAPAAVOFTl«TKLAPrjrr:TFLFRtJ 

i-vriAjr/riAi rrr»ATLTt wuA'iiiuuArn N.Yr jt/t.lo:;pva i p i avfk» »ATVTn\ sfi-l^e 
r ATr'rjr/rnrv;KW:t\Tw:;R ru. i pam > t \yy» psat itlyavwn: si»t lvr: n^' i i wt 

NY' ;K r*/T3« JLW I ;:FU 4 FMFlUJ l UJiA/UAtAli^LSi ITAKALCAYVH ITPlV a Itt ^7/ Hi 
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yrs IYVAAL. JC rrTDMTTLT. L. J F' JOLYCKTNAf J F C3E0MYLL3FP30PP tVTOKSCA 

w i.jwxAAY'r^sNHLfrrrYLRPOKAPKSOOOw^ 
woLjnr tnA^uoGWQSKFTrT^DLORSFSRCiccYW^ 

K t AYK PO I YRVNP* W ( '/TW^NO CSTS tSGANLRRHGLFVO I HOWDLTECTOAFLMYTF 
XKNCFTNHPV:mLK.7Tr 

■pn_0455 520 )*3 519458 

no ronu-c nmnlno present tn Genebenlc/EMBL J6 of 11/7/98 

• v "/* *-tsjt.w"" ?;!^:RMir:.*irrr.v.^:L:c«rj:r£.T»*Ti*i;f.".' 

• • : -:..:;A^;K?r-/:r::-^J i7^irnt^«^i-v:p»:r.*: ?>EKEAADAV 

AJU.-KWAT DDEHL WV^H I AY A£E I R EKQEOTHOCi LTTOL^U-Om'STEXNLAf 
AL3AV I KQSVW FRKP DLFAYER1AL£XSVT0ALVSVVSNLI)H X PYTSSOG XVXECSSIV 
RTSgBHTLXVtCAAFDlO^ItPt£PSDVlJ>ISGXDP^ 
KT 

CPrvJ)45ti 521568 520327 

No rooust homo log present in Genebenk/EMBL «■ of 11/7/98 

IPCTFESKRKFUmCDeWTSWWWrVOAT^ 

MCVraXSHCFERGVSHPCmDlAfllJCVEXIAGJraiSRIEW^ 

V>COYOENPYAOHASSE\njajaCGVHVS£LG^ 

rrXVSPEEODOATUANEVIOLYPICSrrTLYXLXDFOT 

'/CCUVUPQGAECeOTKHPCtJfAXYCKOSYLRrc^ 

SRYRSRl^LPINTEKDJntl,YXEXSRTHHQUfrLSMSLAA^ 

HCHSYLAI3LTHEEIJ(IIXFSAFVD\KNISraCELRi^ 

CPn.0457 523886 522120 

No rooust nonolog present in Genebenk/EMBL ss of 11/7/98 
VTLPSRVHASCLSAWFSrVREHFYRATOFSLPFCARITEFVLGVXm 
LVSRYt£SF\nTPTFVSEWSUJCTOCVAGR^ 
PVHPFtXIQPVEVLTLYPEVCDATlXnAFSKlRNRVROAY^ 

NQCTW1COATCI»<MVYHCEN^^ 



EGHYCHSY^IJLEITOLQmiJUknjW^ 
KSRSDranAAnArCIJSWVCTOWKEtPSFO 

CRESNLWNRIOICIJOUD/RJCOKOCAAI^DCSCXYVO^LTAOQNFACS 
CPTV_0458 526344 524236 

No robust nonolog present in Genebenk/EMBL es of 11/7/98 

YFLCCYLKLFVSNF IFFWKPI PYISSWSTVROKTVnCA^WSRPPCSRVTOrALCVtKA 
IPrvmiVMGMEV^VSSCVACirrRSSrTSDVVOrVlCTOCALa^ 
:TPENQDKVHGXFPVCPFGRIJtSE£TIJCLiTO 
IRTI5IVSSXLrrP0DPS0FVSIANCXQIUJ(PEAl.VCL 

LKNSCLOSRIOCKDSKSDORCS PENPEXJfXGYYSREQOHNIXXSQV IOOCtCXSRXtf XPWX 
HVTEDTKDFYYP PNFT5YSHTRQSTDPTSP PRLPESEGDXX&LY^LSRSYHHEYMLGLG 
LKPl3AGlAKDPDRIYAPLOTHYCMSYLAI)IIlJ^^ 
PNIARLPLEUJSLFFRLVAGQQEGRNIVTIJWG™ 

NX FSYKSRJWIVKEROFFCt^EXJKSFTLIlJ'njPI SAADFRCIj01AA£Gl<VAJfflLPSW 
DICASGCSCXOFSEKOSPOAXEY*OWEARVEDEACE£AR£JVIYM 
PSLDAVVRQAJWRFRSKGLLTHERXALGEEFLTA IFSYUJSQERNENH3KRTTEEXEW1 
SFZZLDRXVQVLPACVPADSGNDI>TRI>VPNPO5NP0550NIXSS 

CPn.0459 527062 526619 

No robust homo log present in Genebenk/EMBL as of 11/7/98 
STKXOMHPGUWWRTSTOXLftEEGSVSFRRFRA 
SOQOJlLFYVESQJUCREXlCVStZEYIOSMra 
SS DCOEOATATST A7GATA5 PTDHHEDE 

CPn.0460 527840 526992 

No robust nonolog present in Genebenk/EMBL as of 11/7/98 

VXOHLLNFAXXCTPSXSVOYOEOEXI^PCDHSPEXGiaat^^ 

YKLNLGXONSX£GWLLDPYRVOTLSSPYS^ 

TFRSVS INPGNSSFGORHSETLSRVLMOEKEkKVAVVCN^ 

LV5SSLDAAQfU^SCCKVLWENPNASMAOEX2247TKOVE^ 

CPn.0461 528647 527844 

no roousc homolog present in Genebank/EMBL es of 11/7/9B 

rsrVACPSISSWFTVVROHFVNAFOFTHPVCSRITWrALGI IKAI PVLCH IVJCIEWLIS 

^PWmmffiMFTSl^SAIICVECTRGKNCLAPLEAYI^ 

?r/DrrPTEtVQLLPOEELSTVDSAIjOGVRSRLTYAYRSVEKmiOOL^ 

LI NFVP.LAhlGVQNKYPHTKVKLYLAKNIACVWDCEX SE£EKGOLRALCLDPKX ES ISLTS 

AGLPSVPEVATVDFMITCYGKDOEVODP 

CPnJ>462 531124 529037 

No robust nonolog present in Genebenk/EMBL as of 11/7/98 

L! FYLFLNLY IACVRFHFOCWF DPMACYIS IWI S7VK0HF I RAFDFTRPLGSRITNFALC 
VIKAIPILOCWtGVSWLVSTCSARRPGXPAFTSOVASIVKIEKTRCYNPLAWVBOYUlO 
LRVRLP ECDLCK r HCKVSR DYVC DRT POENLNMVPKQ YLC ELCRATYG I RNR VTKAYQRV 
TPLEVPCLTL^FTHLOPEISVNFVRI^NGIQTOYPC^ 
O(B3QLRSLCU)AKIKCVSAPALLLQlCYL0S£N^ 

NLS5EH I PAISVTYRPOOPFYSYYPTPGSGGGTAPPORIP%ISEOE«LCTYTTLSNPI«XR 
yAVHLGMEDFASGVFLDPLRVSAPLSGgYSCPSYf .1 JX JT SEELRCFLLSAF XDPNHSGQG 
NPRPHSXNFGNSPLGORWSEFLSRVLKDETEXHVAVVr^ 
EXOT£YLNIVSVSOERTCVXERRILSSDPSGR£^ 

'/"SALDAADACASECK tLEYEDPOOEWAOOYASFYRNXORAGOLOROGXPGEPLGVSAST 
RWLEKO IVFNLNAV 1 0OAMNKFKKR0LFAVESOALGXX3MRRALECY IGSSLLVECTIOP 
'WAflMNr, FATLDEAVCAACDSAOOAPSEENNTOO 

'I-I.JJ4'.) * 12490 511 191 

tut robust homiloo proient in Onebank/EMDL utt at tl/7/M 
:.:::pYEKTa>txrrrpNCirrpRvNi:rrvr::p:oETaiArvo3Mi«or;v^ 

• HHr/QI 1 -I >^F::i.KEEU;FLFDEig^LCAPLSEDirr:MSYLVOLVCOllLKOL I LSMFLDPO 
m:\NXt.lXVM fM3n:F:*.PUWnFL:WUlDEr;riAWVFK(WLSLPAT0VCKLVEE 
I STAY irtlTlX M\ F:nir:D>*::iCI J-KRKOXCTSCRYFTV KALYU «DTOMR;*LQLASER I H 
V.*M«{Plrt.VriAYAAki:si XX (OKTteiRPrTTFSWIADFAWVOV^FNSREFKLlTWNOC 
I U-:>;i:U-Ll-.;KTI-VFX:FI JVPCOnVTOTRnFtPHLDAA tKOAVWT)IKHrcLlC*£CEAL0 

i yvj:w.:\ v::Yi.ir/vTN::HicKT:*.Krjf'PtgKE 1 i act." p lk ealptc^oEDv pst.1 EDPs 



CPn 04*4 "» M.M*- 

no rooutc nowiloa or^ncot tn %>n«»tMnc txn> ir. •'. ,„ : "?9 

UETWRFTCrC^^WSLKB^ECT^^ 

HR rAXVYICVLDSI3»rLEPLtSYHSCrT3E30NrLRPrKyCIS^ew 

[ rccfFjEDavpesep rcur : rvrmwscpuPTXKRs.tjwtLC^TLPEs rYPOSiru-H 

RPRML5 

CPA.04SS "227? *. 2 «7! 

.. ■ •- •. -r iv!' . - 

•s*^"". 1 "' 1 ":" ...... .. V :.. ., j hr .»>v.«.«K'. >.«■;*-«•.- -.rx:.?KFALrv: 

ALLi ii J^LWJU*LJJit IKXiKKt J^U. iVhV CJ »-.-.«tLiikJhVEOIUIlORLJUr 
RDECKVHCCLPSAPFF 

CPn.0466 533718 536537 

pmp_ 15 -Polymorphic Outer Henorene Protein 

TSMRWFCMtiPrrFVlJWTOWLPLETYXTUPEYOAAP^ 

OriXJrWRSItt^TCKNLLISIWIGNWEXMVCPf^^ 

TTNLVSONPTAT AGSLLGGALF A I NCS XTNNl^OCTFVDNLALMGSGALYTTrKLSIXXM 
KGPIIIKONIWiStSLOGSXYSaCL^ 



t IKlMCPVYTl^ATRGGALLNl^AGSGFCSFILSAOCD: :FWWrASKXALNTPYRN 

AIHSTPNMNLQIGARPGYRVU^PIEHELPSSFPIIJ^FEfV^lUlVLF^ 

DEIOTSYUWrSELROC^Vn»GLACY^ 

NNEDPYnSL^KSLtWaYrVWAAOKI 

TT fTWPTSLl^ANTIOOCLLYANWS PLGYRPH P ERRGEF ITNALHQS AYT » f A/TT H 8T ttW 
OEHOGHA AST QG IGZXVHQKOKNSFRCFRSHHTCYSATTEATSSOS PNFSLwFAQFFSKA 
KEHESQKSTSSHifYFSGMCIENTLFItWXRXSVSLAYMrrSEHTHTWYOGf T. TTatVXtXT 
KNMTLACAI^CmPOPHC£SLOrYPFITALAXRGNl>AF06»W 
U^IRASWIOWHRVPLVWLTEISYRmYTODPElJBICtXI 
IIC^NTMSVrPXVTLSUJYSAOXSSSTLSH^ 

CPIU0467 536528 539434 

pnp_16 -Polymorphic Outer Membrane P roc tin 

NtlLTISCONRJUKEPLVSKTTIWFYMIFTAOIFCKrPAVYSU^ 

ETJTfSFPUJSLSTLTGTSPrTTTVGNRKNSSOOI VLSNYXS X0NXLLLWTSAO2AV9CN 

AXAOCDmSOJOGTmVKNSVNNiroaJTJ^ 

RSEim'XSgmPrYnaOCGMNaUIQTSVTVAIWWSGS^ 

OGAirn«LSXDCNPGTXLFNNNYCXlUX»XC7^rLTXniSCHV^^ 

LODSTCLIJAEOGNlAfWNlVrLTrTGirrNAI HCTPSSNLOLGANRGYTTAFTDPI EKO 

HP17NX^XFNPNAie40GTXLnSAYXPEASX7YENf^XSSS10^ 

FYKFTOKGGILXLCHAASXAT/TAKSETPSTSVGSOyXXNNLAXNLPSXtM 

PLOSSAPrrEXIMPTrTLSGPLTLLKEENRIJPYDSICLSEPLONIHL^ 

DNFMPESUlATEHYGYOaiWSPYWVETITTniNAS X ETANTL YRALYAM/TPUTYYVMPE 

YOGOLATTPL^SFKmrSLLRSYraTCOSDIERPFLEIOaiAIXajn^^ 

XOSTCYSLOASSETSLHOXISLCFAQf 1 1 Rl KP GSSWVSAWfrVSSLYVlLfnTOPtf' 

ATSTVLAYGYCDHHLHSLH PSHOEO ABGTCYSHTLAAA IOCS FPWOOXSYLKLSPPVOAX 

AXR\5TOTATEEXGIKPRXrvSOKPFYKLTLPLGX0Cia40SKFKVPTtMTL^ 

ONraXGVTl^SGGSUSXLGMNY\mMALGYICVKNOT 

LQAGSTLXF 

CPIV.046B 539608 540432 

popul 7 -Polymorphic Outer Menbrane Protein 

IY10IJ)NKIJar«;KLYFHIKV»a|p»«piCl^ILSTAirCS 

SAmSPSFlUl<V^PEPLVSSPT(PSNUlCFGHI)XTOOmTGNSXNSVXO^^ 

XLAOQILFX SEMCGNLSFXRNSSHSSOGALYSVRirwiS KNONYSF I SNAASLATTTTSG 

FGGAXHAXCSYXTNNLGEGCFUVMSIQ^RGGAIYVGVSLSXTOI^ 

nSGCIFCRAVNIERKYONXOINDNASGOGVVYrLP 



541460 



Protein iFrane-shifc with 



CPh.0469 540399 
pmp.l 7 -Polymorphic Outer I 
0469) 

CffmraiFSALGVIXSSNKEXXEXSNXSASSINTASCKLYPGGGCXIC^ 

UFWXTAAI^GGAXKTRSFIFONNGPTAFXNl^TSGGALINLSGIGS^ 

GOXLFNNkR'XTSSSPCPGYRMALYAAPGXNLICXXSAROCYKXLFYDPIOKX^ 

YEPKHWTVLFSG INVDSNATNPLNFLSXFSNSSRLERGVLA X EDRAAISCRTLSOTQCX 

LRLGNAALXRTXGPG5SXNTNAXAIMLPSIL0SEASAPKFVIYPTLTGS7^SEXyTSSTXT 

LSGPLTFLNOENQfPYDSLDLSEPRJCOXPPPLPPRCOCKKKRYFESHCRSHELR 

CPA-0470 541357 542532 

pmp 17 -Polymorphic Outsr Membrane Protein (Frame-shift with 
0470) 

ISI^U^ISPLLYUJATAXXIDTSNLIVEAMNLDEKYGYQGIWSP^tl i 1 1 1S «TVP 
EOTNTNHRQ LYVXVTPVCYR PNPERHCEF IANTLWOSAY NALLC I R ILP PONUCZTOLEA 
SL^GLGLLl^QKNREGRKGFRNHTTGYAATrSAKTAARHSFSLCFAOMFSlCTREROSPST 
TSSHNYFACLRFDiLLFROFrSTGLSLCYSYGWWLCHmiLKCSSKAFPVN^ 
LDCTFLPAR ITRTLEL0PFI5AIALRCS0ASFQETCOHIRKFHPKHPLTDLS5PIGF11SC 
WKTSHH I PMLVHTEISYVITLYRKNPEMFTrTXISNGTWrTOATPVSYNSV 
LFSRVTLSU)YSACVSSS*rV-OQYLKAE3HCTF 

CPn.0471 542561 545401 

prep.m-Pvlymorpmc Outer Nemorsne Protein 
TVONNRSUiKSSFFV^ALlLSKTrrLLNATPLSOYFttOAWLTT^ 
HRATLPGVRX)XTrNQPIVU3HCMSXESWFENFSOOOGALSCKSLAITNTXfO 
KRAGAMYVNGNFOLSENHCSI I FSGNLSFI»NASNFAr7PC7GGA\a£SRM\n , Xfi10«OTAY 
F INNKARSSGGA ICAA 1 1 N t KDNTCPCLFFNNAAGGTAGGALPANACR X ENNSQPIYFLN 
NOSGLGCA I R WICEkT I LTKNTC.T/ X FNNNFAMEAD I SANHSSGCA IYCXSCSTKONPGXA 
AF»NTAAROC^lLTOnLT:0OCnPVYrrW*OCTV<X^IM 

NNRHFK DTf.TNl CVA"NCTR NV'.TLTVT^OQ I ATFYDP I LQRYT rONS I QK FNPNPEMLG 
T I LF3:JTY 1 rCTtrrSROOF If HFRiei ICLYNrTTLALEtpRAENin/YKFDOFOCTLRLCSRA 
VF.'ITmi'EO: i.' A\ XJV I N I KNUV I NLPH f it WR VAPK tM I R PTGnu'APYnSXMIP I INL 
:7tMj:iXPOEM/)nirrADl^'t'IAEVPU«YlJ£\n , AKHtNT^ 

vwriPYw r rr iTT^rT^nKtmvnjiRvi.YasvT wiYrvNpENxnnr au:apwqufmnlf 

ATLPYVtW V I Al TA.'F SKATR I ,VI\ Vjttl'MM lAKrtFHMEATRY.Tfi rm^TAtM HIPVTVN 

nnt*:atM& HrnvvA:urrrr/Mjj t nni wi uer p.ttjasi j\Y::Y:9a»t t *a:riy:xu< 
k I'l.YMLTVi u; i vtfAWErxFK i.f tywn i w jiYyt wwjhk inv: ;i.rs.:r.r,:*ii u »nr 

l«AHNA I AFKi 3<f JO f K C FI-KI.: VH4^Y»/ ZiVlU «JTPnr^LHACrTFKK 



i.1ti_W47.» 



'^1 ' M4 



'.4%'.HI 
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no n»m»:W imaolog pr^nnc in Oenet /OfflL is •>* 11/7/M 
JiSFCJEWEJO^^AASSPCFravWHa/DPAU^ 

jclgrlvctvror iv^pscapptosej^lyeuttju^suloclsoiosndoltpeocae 
atvt iqol i q i tefoccyheat03svslaear f kl» v f j3 dei nslcsc.t3pelqelhs0 
-dcujkuxctadoleaalsktri^fslookptpionnptlxsoeepiyee: 

TRO«STRU^IR&VLV31LSMXlSIl^IUWLRIARHAAA£AV^ 

tr!." ■ *!■::. r.rt-vvMST/Mirr." v-?! n*::vv;r :.-:rwrAPSr.:2!r/ 

rCWbVPtrJACPSKDXNIYKrPIUATPAXYOL^SRI^XS^PSJDfMUUjPfSUU 
VPU>PVP5PANSEBCSIYE0MSCASCACCJ0yEt^RSP3PRC0L0EPrYMI^ 

0 RNI W IL0E»SOCASASPVEPlYOUPWIHCRPPATLPRPDm,TNWUlVSPCrCPEV 
RAALLSESVSAVMVEAES rvppTEPCTCESCYLEPl^LVXTTKIIXOXCWRCISKA 

CPn_0473 54960: 546070 

No robust homolog present in Genebank/EXBL as oC 11/7/98 
SSXMAVCGVGGSRSPSPX PPNRJ^EDCKVSPW»LCEKrvSSSDSStASOCPTIED^ 
OLOCTMIPIJSVKEPCDSOTSGRSCVLORIWCGV^ 

GO RLPGLEEFRD RIQKRS ENPEADU3«WtSYSDGO U2RVCHDSNEDST EDSRSEZJCE PS 
SX5S5FL5GV1^VSXNAS3AI£OIKSlCrOR5ASCDOLrr0CEDSM 
SKSSSFIJGVRGATSTVOCALSOAKIICVSAFX^^ 
WNXAAKHUWAUJ<IJ3CVAPEQVSPEV^ 

VC50SVEYAS IVP0 DCSOA PA£TTA£APritXr/ECSAAOCSAWKAIJU)FWS I FQAVASFFR 

OSlJOU^SCNVTOPVXOOCLClARSrAPBCO 

CPn.0474 SS1600 549807 

CT365 hypothetical protein 

LK 1 1 IS 1 5 FMSTS PI SNDPR YUSt^NATECTSLLANSRSLS PVPNSLVPSNTEXJTGLRXS 

1 FTHSVTIJACXWLLVAVSVVVVAZ.TVUPCVPOAILI.CIA ISGVGIGGFS IHX5LVYH 
VRDYttSPWCESSRIKSALAVCTCFTVMC^VHKV^^ 

TTLASFSHY rVTKFFRSEKVAKCEXLTEAETIKEAKKLHYITLS IAT IGVCLAVLG I LLA 

XACTVLLOOAPATIAI ILAPPLXSXCXTTVLQTILHSSXGXURAFXXTOEiaCDLJV^ 

KOIIUJCn<PPSEVCCSrrS0SVXEVPOSECXAETRISA£ErDTIUJSLTT^ 

LLLAS X AAF IVTCFOC LTVMOVLLVASVC SAVASVTLPKVSSCT SYVAY0 LKARLN I SXL 

RWKEAKNJOCRVRQFLIESCVlASDREFtCMWKTVYTO 

SALVGGILLGWICI»DXALVPAFAPIVPGIXJUX»STLCIA^ 

LYEWWNRRELLYGPESKMRS lATXVVEALAASKXMTJtttlXH^WIIJVIWIlXWA 

CPn.0475 553850 551685 

glgB-Glucan Branching Entyme 
PSHVDKLIHPWDXJSLLVSOTQKDPHXLLCII^Ere 

AVAYRSCLFFLSVPKG XGKGOYRVYHOtCLLAKDPYAFP PLMGEXOSFLFKROTKYRIYE 
WWI»MEVOCISCMJVl>^HAOIWSVVCTPKFWHC^ 
EGIRYKMEIVTOSCHVIVKnJPYOtSFWPPOCTAR^ 
PVTIYtVHUKV^WOECRPLSYSEMAHRIJ^SYOTX^^ 

t ^*^SR*CTWEIOTFVI7n^{K£I'l IG I ILDWVPCHFPVDAFALASFDGEP LYEYTGHS 

OALKPHWVl'JTf UVSKHEVTTTLLCSALTWLO»<H XDCUtVOAVASMLYRCYCROXawr 

PNIYOKENLESIXOTJQIXJISVIHKErSCVLTFAEESTA 

OafflinTWfFMrDPHYWCYHOKDLT^SIJWATO 

WWAOWRVII^ICLlCKiajJWC^^ 

LNALYIHQPYIifliQESSGECFHWVDFHDIENJA/IAYYW 

SYVLRCZXSVKHCELLLNTX)DE5Ft9GSC^ 

FF 

CPn.0476 554877 553958 

CT865 hypothetical protein 

LOVQOTLKRFAVFOOLHI«5CXAVTSEJlYKYYI4JSCECro 

HKKADWQKVRCRR0UCEILPIJrfFTlFAAMAPKOSYRDIXrTAIC^ 

IAPArJ , S T *LAG FSEC FLPRSYDEZFOG ILFODCOPEOCVPFELLSYSFGKIOD X FLRHQ 

COLVEILPALPPEFPCCRI*IHVAIJNLCniSIVtmacriROVELIUW 

CSAIUJlEMSEJUtLSCSKRLSLCETLEXIUCTT^^ 

CPn_0477 556112 554844 

•ygsv^Bs Hypothetical Protein 

RYHTVAEVKCTTOVCUXHIVNOYEVW PADLC I INTCAVTA 

SJ^SGRHAVTOLCRONPTAHIVVTC^ 

YDTTFPEFK IHSFEXaCSRAFIKVODCCNSFCSYC 1 1 PYUIGRSVSRPAEKILAEIAGWD 
O^rWIACIhT^YCZXIERSI^LIESMJRIPCIERIRISSIDPDOITEDLHRAITSS 
RHTCPSSHLVLQSCSNS IUauemJCYSRCOFUXr^FRASDPRYAnTOVIVCFPCESD 

2S^i:^ E ^ rr ^ SrprMWW ^ l ^J e ^^»KXYLAEVAKRVC0K 
EMMKRLCETTEVLVEKVT^ 

CPn.047« 557640 556210 

htlX-GTP Binding Procein 

SSS^^^iiC^t^ 7 "- 1 ^ IFSSRALTACAN IOVOl-AOAR YXiPRLJOUJWGHUS RQK 
SOCGSOGn^GEW 

?:i^^i^ PL ^ I3 ^ KI0NLLSLKrEl IOEXSLHVTLNF PYTEYCKFTELC 
DAflWASSRYQEOFLVVEAYlJKEtOKKFRPFISWFPEtXrCDDECRGPVLESSFCO 

•:pnj>47«) 558434 557616 

pitnp -Met* I Dependent Hydro Uoe 

Si'iSSSiSSfSJSSI? i£?S WBC 1 wrc ^ Y WTCXKRLR5SVLIO*^flCTLVI 

™^?*™^&MtrriuimmEEFVi t ^tyv^ycksdivtcfr frnl 

l.( mit;UIf:i^ERDOIIPCVTFAYCXa4EVUm. 

•:iii„t)4if'i ■•VI175 1SN<5U 

•.Tim «iyrn»rh<?r pronein 

: :ij«h i :.>r; i : *:h w;ravj:icKBoa.-MU^;i .pcyp* »a< it i ( eeyknry fvcu lcagw-Ipyw 
i v i wnvy y^pnt ; t ujvi icckohkfv jlhvho p (tclwalcpvtucimi^lesamyelcs 
'.•vhniiji*.':; rvanrnsnu: i pacl t vtjvmveapl t atjujawv i pi: r lurvrjt i lclfa i l 

HAYUI* WflfcMXJILijllEY rWKCROtOAII«>A\T/m^PA^%\riVJ-tTKLPNCSRR 



CPn_h4Si 



ii L W"1 » ASAJ .. , _ — » — » 

No roouic nwlWpni»*fitHafc2x>nebuiwJ*taL .ii^ir* ft* "Cll3 
3CLR I EC ILiiATSmTSJTT^CEAMSSNER FTCRT5RMYYAALVLSALJSCL IFIAItrv: 
FPOVGX>MWl£FALCCLLLGLA I VFAVXLV^KTLEPSRLfcTPPEIVACKEVrTXrv 
LCNEYWRSELXJLTJCa-HE^l rVCSKDRSLT X DC.7LON 7 UCLETLJTTLJVLSXPCVH 
I N I X LMLVROWNLLCVSUPr/TAHA FFt If FLISX?YY3FS:UC.:ilYCIUL0ATSPLI< 

■ — - — v • . • • .*-.* ftrn»^.rr;r: r* •.-.t* 1 

r i>-.j| M .: . — ".%;rY^\. . — • . • ".'Wro-Ar 

VEJAJK-MLLWU-' iAi J J^VrAJ-IKnLi'LKV J./.VKVwt.Hy,' ik^lL^TEILENEJJT^ 
LY EYPLSYL Z CWA'.ILDCVRCTE I SLEXO AITtT*^lOCLCSKLS0 r ASRLOSGOJCV"LNPR 
DVl^tOAA\^VKCLAAOCV3 FOClJCAiifYLTAV^R>arfLGALPlJXSFPVTTWOtErLC 
ESLCO 

CPn.0482 561764 560961 

srtJ-Arginine Peripiasaic Binding Protsm 

w^yraottt<ikqicrffrafifimplsltscesk:c*^iwi\utnatypffeyvd*oc 

EVVOTIDUUCAISOO^KOLEVREFArDALILNlJCXHRI 
PYYCDIVOELICWSIWSLrrr/LPLTOYSSVAVCTCTFOEl^ 
WEVirrCXSPVAVlXPSVCRVVl^rpNLVATOZLPPECWVl>OCC^ 
QOAITOLKECVX3SLTOCVCL5EVAYE 

CPIU0483 561830 564964 

No robust homolog prevent in Genebank/OOL as ot 11/7/98 
X IUWRAI FWMrPXPPPICPPWIKNNFYHLTTXT^ 

UilHYYWOOnAnUCIiaJTPPTIJTCCPEPKT I EIAWPPlOXSnaoCIWPlCPCTPPPII) 

TPPPPPKAPSPASPKVPROPAOI0XPTPPPt>PPPPVnVATPMPLJ(P$$0GY)^^ 

MVLWUPLPLPA^40WPILCBFtlPHrVASYPNRimEP^ 

R1JU3L5LEDALYUBMYYLVNVPC3XSNCFYJIA 

tt^AMSPAI**NLCAlX*£lLOLCSTYCSF:a 

mXAA^SNECrAJULrXSXMSDOLLPSVWLAANRPYSELFONLtOH^ 
[jpLLUKLPALFLTtAELOXMS PX^XMUUCQYEREJ REATAJO^PJUAOSGWirrEltFNAX 
vnKLWIRCOYSRFXATXENIWSCDlJ^PALSFFA 

POX^rLtaaiiANXiJOwxsTSFmpx^poxxsm.v»nou.VAi^ 

ARSLBLTROBSSCX £Q yOPLL,>ffOXVPLL FT T LQHPQVWiB Pir^fnAmfTT jrvr j J .p>t 

YAFIOXVCNEJOXL010U^XLC5LVXJCPPARYPSTPSiraiCEm 

DOS^FUUTFPNYOLriXAIlXEXEIEST^ 

OOFSJCSrXXFCFLWYPlO^OXKTVLAAJUJDAFOR^ 

ATXN0YSRARX»UCNXaJOVTVTAStX7t3lSCmSLXCrnj4SLSSN^ 

EA|IX>VAAIfrrVPLOPFAVCLlKSDRDTVSEEN X ENFVAMHCFLNTI SPtRDARITLIJtrP 

NHYGCXXPRNPRTEDONSKPDSSNP 

CPQ.0484 564931 565824 

aroC*Deoxyheptonate Aldolase 
ftSELXrCOLKSLVLHZVULTFTYPLPRTL^ 
. TLCYCHTVSSALTVXX^SAOVFRCSXIUCPirrSPF^ 
TEVtXWI«](VEITAEHVDILRICAia«ig^PLW 
ttLASSPSCPGVILCEJCirrFEHSTRYTtltt^ 

LPLASACI^VGArJCLKIEVHAHPEKALCDAKCJO ITPEIl^ AWCHFCPSrSJUSHAIS 

CPB.0485 563993 566229 

CT382.1 hypothetical protein 

SSL tDLLYOLKOLPAPTNE 

CPQ.0486 56T799 566405 

hypothetical proline pernsase 

AOHRSLLKGNX FHLCCCVLYFHNrSLFLFFLI A IOC ICLYVCRJ^SKKVinRZSYFLACR 
5LKIFPLV4WTTXATQXCSCVI^jAAEEAF^YCYGGXLYPLCVALCLXF^ 
SLTTWSXFTVTK:SK)UJUCXAFLL5ACSLrFILVA0VXAL0RI^ 
VLASTTSTCCFRCVVJmjVXOACFXXIAVLVCCVSVWLS^ 

HIFMPMlJMLVE<^MV^RCVAASSPIWLOWAAVCACLVU^ PLFLC5LCAKAGLKA 
CCPI^IAYFCNTSl^VMAAAICVAILSTADSLMNAVSOL XAEEYPTLKAPYYKYLVL 
(^VAAPLVAXCFTNXVDVLXI^SI^VCCl^VPVCrYI^JU>K^^ 
LCYCWQ IVSLCltFCELLAWVCSLVAFSFVCFX QTWKNKVKTQT 

CPn.0487 569833 568112 

CT384 hypothetical protein 

IUm»ISLTYSSFRWASFRCYSLIFFCrCCSLFCSESLRYOLLI0DFAKVSEECICLLES 

KEYStiOAKLVLIUI^NSSFtJDWPRSFKKCOISYPELAHDRDVLEEPC 

SVTVMVSVLAICIJWOFRLVPLLMSCNDDSAI^ 

Nt^XHVRITAYOWAIXOIE£LLPFLRERADnu<\roWERREAWKAClXLSSOFtXIW 

AXDD XOQALFTCEVLRNGMLP ETTE I PTELLSVEHP EVQESLLLSALAWS HOLQNMKEFL 

SXVRHVMCT5 PFAXVRFO AAALLHLKCDPLOR DSLVECLRS POPLVCEAASAALCSLC X H 

GVPl^KEHLESLSSRXAAANI^ILLLVSREDIERACDVIARYLGNPEMCWAIEYnjrfD^ 

WNLJCOTFPLYSDMIKREIuRJCLXRLl^VARYSOAKAVTATFLSGOOAOGWSFFSCMFWE 

EC0VKT , 3EDLVT0ACFAAXLECAiA3LC0KKD0ASW 

SENLDAVPFLLOCCHHEAPf LRSAAACALF5XFK 

CPn.048R 570147 569767 

httA-HlT Fdmily Hydrolase 

RJCLPTCFAVNVTIlSIUJKimTKOIIOCLIXEr/FENENFlAIKDRFI^W 

PI PRFQO X PCDEH X LXAEAUK I VQELAAE FC I ADC YR W ZNNCAECCOAVFHLHX KLLCC 

RPLGAXA 

cpn.04y# r rii}\i vnwu: 

CT187 nyporhiic i.r.jt putntn 

R tVFAI ENYFnLVTMEDWIJ;R ,"/TMO rPRil t',THDC£FMADGVTACAI.LI XFDLVOOIKX 

YMK1MITLVIS.VDfXHW;RFK.:Kr;FO::F::DI I Y. lYNM>KEF.ETN;;DADr.a:AUIFTtOF 
U.-HU*KrFUYW<Vi-ki : J VnF,\MLTFXMt;i.YH,KI'LAK«NFFKIi Y IPKN PAAFVPFPJjrD 

i u<i t pini jx«PW •'/f'vrKPFJiww 'A At ;ke;."kvv; r mAVR:i iKf ;i s\ i rwnwir 

(.HiALKLTIOHfll 1 

t:Pt»„04'itl •. v/llii 

t.-PixV »«yr«»ri#. i ruir.Mii. 

:;t*juiw*vttmiyj mmkvxu u r/uit w i atamlelij:i* -«->vi .ki^aai » *hi . 
vh:;i«.-v inHNFTirnifrrstii -.i-p:rKi.wiF in JF. t tMORLVVFU'i j.m rr irywrr I f*\ 
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iswPCFPFucAKETDHrrsocvi^RYTPSAi^^ 

*ijxL^t::::^TNAYviA^ 

•;.ir: « •.^"*Mrvrrri:r/*v 

:Pn.04'it 574 W5 V/JJJ* 

TTJ89 hypothetical protein ncwvp 

Il^LYTVTITWTA/HSCYSWFCVU^FL^^ 
CHl^W**X.TGYVOS^^ 

PFXCAVEICEJWECITRSSAERPLLPKOCrX^IFCGf^^ 

CPnJ>492 574843 574804 

No robust homo loo present in Genebank/Bffll. as o£_l*/L 
t^SLIPPICTOWSQQTYICHLHVESACni^PUCIHWSSPTGrPPITfWU^ 

CPn.0493 575142 574855 

no robust hoaolog present m Genebenk/EMBl. as of 11/7/ 98 

RKOED^SflETPKCIIJOtKa^PICCXHVKWlCOS 
CPn 0494 575370 575146 

No robust homo loo present in Gwebank/EMBL as of 11/7/98^ 

VT«RVNW<WYiraWPSPEIXaOB**^^ 

SLS30CVKQXSEAKFK 

CPn.0495 575507 578793 

aepC-Aspartate Aminotransferase — 

RRXJQXNOKIIAIOICAG^^ 
EKPaSVIOl^IGtnTQPt^rrOAIXl^SO 

YENMSPttiFisnsMcroiiwsiroim^ 

WC€TCrXPnJNQ0SU3Il£I£VPNNinCTVLTTOX^ 
VSDPSLPKSXFEIPEAXYCAI EXMSFSKSLgTPOUAWapXZLTYBMN^ ^ 
IJATTTtCASXJJCEAGYYCiXDLXPTPPAX ^]f55*52^H2?^5SSM?iif^S»fr« 
WVEU»EGISC£EAJWTUiOYHttWPQ^ 
SLKETHVU 

CPnj049« 578751 577812 

CT391 hypothetical protein 

pr^Ywro^scKriLW&soYxrFfsi^ 

PI4«SKSCIETlJOft2<tPEIVVrJttEDSIVKWU^ 

VMSHX ETOKIVI YAAVPORESLTLPKWHWIYCVNDTLDDCVCT 

KPSfWPSDlOTIVmilAMIEViriSITSST^ 

ECTAMEIl^PIITDDTSUSIOACIACSVIJrTOMTO 

RJU IAOM^PTTTTlin)IIK«fiIKXJOeriBW 

CPnJ)497 578107 577820 

CT3B8 hypothetical protein _ 

ITORVVLDDSWIlXVm , PKWtI2«IW 
LPKRXJ^rrLIACETSRKKKFLIJNRVOOIirSLHXDV 

CPnJ>498 579082 578085 

No robust homolog present in Genebank/EMBL "of 11/7/98 

YCMJUUVPF»mj«ARlWALPA^^ 

PEAEWVEaiESSSAWVOTEXWFSSILVSICOVra 

DCAWFOSPr^ALRORUJFLVREWRCM^ 

PTEENWESOiJTVTPSSEEEJAKrWHOK^ 

OTlAl^FRSYOEAFFGRVLFVSSOmGIJJCRVGOFFKG^^ 

RXCLKTIABUCETNOCUUSTCCFG 

CPn_0499 580404 579205 

no robust homoloo present in cenebanJc/EKBL as of 11/7/98 

fJVYLLXFYFaJCSTMSSVNOSSGTWPEEVTSPESTra 

TOLSLPECVCTSSEETASNFRVDEIVAEVSSSRAVW 

TSroSEUa*TLPAWKSSTIUUXrK»GD^ 

SKLTOVHHXU^LSREDISIAFDNND^^ 

DSMLVOXK*Vl^PTVEEIJtTtj0(nTO 

CYtrovrwiwivRRifiiiPGLGCLPFw^ 

EMXVRAXSESIHPESPKCOCRNOFSRGDKQIJSDS EEETEL 

CPn_0500 580647 582382 

pro3-Proly 1 tRNA Synthetase 
OPHSMn-SOLr/KTSKNAliCSAAVl^NEUXXJ^CY^ 
:REELNAICCOELLU>LLrOWEtJWQHT^ 

CSFVAOWl^SKROLPLHLYOIATXFRDEIRPRFGLIRSREiUJ<^^ 
EKLRSAYSKIFORLCLAWt^XIXZaiaCCKaEErOVLCSIXSXTriCVSCSYCAWIEAA 
V.* I PPO*IAYOREPLPVEEVATPCITTIEAtANFPS IPLHK1UCTLWKLSYSMEE1CPXAI 
JMRf5OR0VNLVKVA5KLNA»IAtJ^0EEIERVlCTTC 
Mr/r:«>IAKOKHYVWNWt«Xt.PPOVCOFUJkEffi 

I. rTT'/Tr^Fr^lFODEHr^TCOCWCTYa rCVCRTLAACVEOUWJDRC tVWPKALAPFS I 

•rtAFtry37r\r«EUVETIYKEWiWEPUJOR0ERLCn(LK» 

C::^irFEIC:R»EKWSPE^FP7WCONi(LA 

i'It.JjMH SR2424 **ViSO 

UlCA imi Ti inr.ctifjr.iiin.il Rispicnom 

t LLTTOttFP I MUVT I VL\^AFXMt:KVSKftD;?K I tJD I LFATTELYLKTOC/P'^ISKTLK 
tt'.K'tf'L: .TAT I HNYP Aia-EAU JFLKKHinTTS ;R I ITDLALRJ IYVT* SJEBCPEAE X^AP ( 

rc«r i:*jU:;£:i<m ikw/^ateu^eild^ 

: UTEP *J I ►TOTLWL1 •KACDTL;; I KRIEKFLQNY I PFLPTNEELi;KKEEHL."JMwX.YNEV 
VWYI .TflYl W.KaXSflJP K ^UJCYEAFKOPBVlALrXr^EWRCML'.KUJI IOW1KC 

kataf I' ;kku :u i I * a*:»H« x-jv it i f*yyknr;ipu;al« i mr I nu •ykfawi.lklfan 

K U»^|,TV;JFYItFKU < :n'W«,T:^*:KL:a« , .P tLRTEYS-HKliLPJIKETI, 



TPKEMCE 

CPn.0501 



t»ASJI^VTWFIXTLJ , JTKRf 
KAMLIO£ltKETA£AYI£rTVTE^ 

LTWFKLAASLXI3ttK$PCIICAiW 

CPNB^/HPDEWArGAAIOCXT/UXEVKiyvUiXrVI PLSICXETlCCVKrrLVERM » * I P 
I)SiaPRAll(AXRITYKSOXPETtVmraXlNV»ttIJ©W 

CfJTCJOJA^AAA^W^^P^TK^Tr 11 ""crgnrPP^-^ *m IBEAPVtl IW 
DDK 

CPn.0504 ^ 5BB514 

S^PDWreEYP FD X F^ARXgJttAL.riGnHV XVSVLPY PRDCQXLKCTI SEVtARQHTLTCT 
CTaV?PWAlJkYTSMSCS0SLXPVllI4>CRT^ 

LEF KM ITHAKAOFOA IOATYWAEETPP EVI EEASLTSOCT TTWlHSnmJBUtr: 

D0WSHPWKXl*ia*ATl3I0^SDIRE£«JXIiV^ 

5cunFMlXW«V^mSH0GWLPF»SHEPW«^^ 

SSoYtXOTSACKPXJXWiSOFVRSJOff 

t to wptj J WPLgTPQTHLEI XVRACSTTElWSAXAaWFBiLiaCWFTJlK^ ^ 

lUYirTjSKKXSFVVTOara 

SVWiTQRIVWSIATrTEX»CP10XXRXTPiiu6swI»J^ 

CPn.0505 588471 589108 

•3-aethyUdenine 0NA glycosylate .._ 

WOOlJUQaWOrXPRJIVWtXFfT^EjWT 

SSScMAYNYR^ 

DQOXXaiCIORJU?WROICPPMLL7ICPCXVC0A^ 
ATARlGIDYAOEYRJ>VPWRFLLSPEX>SCKVLS 

CPh.0508 589055 589840 

CT421 hypothetical protein 
C^KPIMWtHCSFIlJW^ 

vnmtm.TOPnrex>sxaTEKFsrrTiwwaj^^ 

FKSIi^IYXaUIJWIPIIKIVYWaOOVKrTXroSK^ 

UTPttACPSSPLPOEtHOOOOS 

CPOJB507 589898 590122 

CT421.1 hyDothetical protein 
ST1»YPOFPl!^IK^IEIJ7fnWSTOARiUlAl^ 

NAtSTSDSZFlPXXC 

CPO.0508 590133 590300 

CT421.2 hypothetical protein 

SRIMSRHI^atSVKC^nTWNVUro^ 

CPn.0509 S90299 590808 

NXF^Y^IR^EXWrMVSN^ 

DlUdJlEIillflCVFAIJPSLTOrXTl^IWPCOPAYPHVl^jEAFI SPOAALRFLENTSPNOED 
IYEnSRYI.VKSIU{HLCYIXrreSEE}WXXRWENOrt^^ 

CPn.0510 590804 591973 

clyC-CBS Oonaint (Hemolysin homoloo* 
OLJ«UaLLAIFCIUJXAFGLTQPSCHCSSKFlJm^ 

TLLCI LYGALCTKLYTIXPPKTAHXDLLFWPLYSLSAX.I AYCFLPPWISTTVPICETTAHL 

RFl>SVFOLC^PLOLLITRRRPW0QVRSSTSFOS0LSEALSAlTJ^ 

ALPEETTWEALVLVSEIX^SRVPVnrKI04LWrroiLLVKD^^ 

KPPPYAPEIKKASSliiOEraOKMRHlAIIVNEYGFTECIATMEOIXEEI ICEXASOOVO 

ENTPYXKIGSSWrVDCRWISOAEEYnn-KIDHEHSYOT^ 

ENFOIEI ITCTERMVCKLKITPRKRKmiS 

CPn.0511 5^>2141 592488 

rsoV'Gigma Reoulatory Factor 

NSOIOKEEHCSTTIFHLHCKU5G ISSFEVQENI SQSLAACSKN I ILXAHLDYHSSACIR 
VLLQJ^fHOVCOMSCXrVLTTVPKTXEOTLYVTCFW 

CPn.0512 5-»2538- 594412 

CT425 hypothetical protein 

SLPLTMRRSVCYVNPS lARAGOISTWKFtYSLATPLPACTKCKrDLACSCKPTWEAPAT 
DL50TnWXYAENPECEIIEATAI PVKWJPVPOFEFTLPYELQrVCETXTIVMOARPWMPO 
VDOaiSjrykOLFAORRKPFYLYIDPTCE'W^ 

R FO rTVRFEDEFCNLTNF?rEETR X ELTY DILP.CILNWOLFI PETOFVX LPMLYPNEPCX 
YRXOUtNLSTOEI F If. AP I KCFAOSAPf fLMtfllLHGZS ERVCSEENIETCMRYFRDCRAi. 
NFYA'inSFQIOENUTrO (WKL INOTVnDPNEEDf FrTLCCFQYSCEPHLICVRHIlJfTICE 
TK SSI I: IKKKCflCI 1 1 rt AK LYK:7TVNI< DH t ( T: ' FTAS K EHTJFDFENFY PE FERWCX YMAW 
< IStTCTTAAU WPFr l& '.KCVEOfftt 7TV I RCLK niLRFCr/VJCLDORC I YK0YFOSPOV9 
Y:SMH.TAt |^7IKYT^FJUA•^•ALKAH»ll:YAT^;r•P IVLJFNITSAPie^SELSTOSKPOUIV 
NAM I :X JHVA^yTALI JCTVK I lMwry^lJrPFFra':itil^-rfrjOKVrLiWTIJCOrt>TKAPF 
VFYYI4«VT0AIJ»WHAH.':.:r WTAM 

» '-NM'. •.•••,7 r .i 

K«« .-i »»x i*lr.f i^Siil.iit** 

v i • 1 1 'i x* wkno i aakk r i ivtko: .tvi jyt n xr i vi to'jvi lOMTrci.tw pktsplys 

I KKKIJjAVFJ'I: :CBTjAI JIW44/IWEf/yin-|JWIiKAIX>VPKOPVCt7rVYYr^ 

• i ipt> :r F»r?;PYAKn an-*™ ,y::ii j i wf n rrr itf/m i vaK:FP!x^*i-uYYCTLF 
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TKtKPTDryHltlUI.TArCYAYU»U»iUJIR&'. JCOACX^IPtt^ILVDJlRW 
FLAPKRU:UDFLNIHiri4AiCLniH5NIT*LCYHjr^ 

F tLUPACENWLGXRLRICJOGQIA I PUCSLMAVAR IFLCKFSNKXALWYU: IEAALDL 
LiCCANDU^.TTU)CtX*/F 1 OKAS.'WEP IKKOAIEHAALl i PCLTNSSHV 

CPTV.0SH 575690 596520 

CT427 rrypor.net » 4 1 protein ■«»...»»•? 
•W,PHKrrROWCnF;L0PC tSLC-CY^Y INSrTULCLXKRNDtRC/t^PPABUJ^X 

•- : »v ■ • ■•*.• ' *.: : ':.r-?..\\wr?m:." . * ■•.•■at:..- p.- 
:.. 'v:-:phi^ -•• — rr.:.. . rr «.--:■:* ^LLiJnAA:.;-: * : ^7i^ 

ASGWYOLTXLPFVFALLLJiJTSWCWPlJTa^^ 
LLC EY/lf ALCOYJIUSEEKYESFWfMVVCTLYOOARL 

CPIU0515 596450 597181 

ubiE-Ubiquincne Methyl transferase 

□QflTKALKNSGNXME PSTNKPDCXK X FDS lASXYDRTOT XLSIGMHKFW^UQ k LGS 

CVSLLDU^CTGKVAKRYIAAMPOASVTLVDrSSA^ 

LEJft^Pi^AAItAYGLIWLSBPHlUUdEXSRVli^ 

RAWPWICJXSVSICDPWYSYX^KSIQOLPKWO 

LEKQ 

CPn_05l6 598904 597255 

No robust homo log prtscnc in GeneoanJc/EMBL as of Xl'Ji.'JL 
RI S ISFRVSVFVK 1 1 LAVLCRAIAXAYYVCKVARCLCOF PTLVPNSRIJIGPFTVPQMTS 
GAKGJCEFAXRNFSI I3CIJ50IlJa£II^RPFAU>OCSVKStJiTOCPAICIRSW 
VSOISPAiCRLCSTLVOWAPIUaEEOLVWI^CTOCR^ 
Wn^I PAASrrVtSS WAPGfTTrOTMPAASCTTOT 

PS FSLRYALWQNVPYPEP PKTPEVKFTT3ESXSIJ LiATRARRKELIJLYNCTlfLADYILSK 
DEIOWtVPOLPEWWRTWRWSERL WT F R T KJlECI ^IflinCEUSWILARg^ATQTO 
ARXKVFNSLVAWU^FW^CTAKPLPTO^ 
LFKCLRVLEPGIEGWYDHPTXJAGEIR^^ 

ELVELLESLVASCEIH3FFESSCEECAFI IONEPSKTAKLKORFKSCVRTiaVGSFADES 
LPRCRFTILV 

CPn_05l7 599637 598795 

No robust homo log present in GenebanX/EHBL m of 11/7/98 
FIMSSUSCXTO CI^IWTCSLICWUWSOW^ 

I PSttAtATSrrVMCLILFVHSLLCXJVAI ISYLTWTWSVTONXRAFElHKPARSVYYt 
GVPJWDLGRSSLGTGEXPIVRTXJSPFTJNHGLNHAI^ 

CLIP4)FllPKVSSLCFVIDC0CSSLRTICE3tn' ICEAFRSX3YDAHFAKVDCYRLIKSKLI I Z 
KMCUCNIDIIPSVMVRIlWSRPGIITmiaiRMYGCW^ 

CPn.0518 600806 599832 

CT429 hypothetical protein 
PMMrYTVPONPIXUUUOifDAFSXSOOERDF^ 

EPimttCU PKLTTYXVKKIMrrrrNEKIYDIDTICEXfTJI LQSKHAREOFL ETIYDHE 
A£LEXhTOFYVEP3UMZEHLPJOfl(PHF^FEEXXU}FTKI^^ 
AROU^NKAKmSNEAU*PRPia«»PPlCOSAJCVET^ 
mPSSXTFSEKFOTEEEFlJtf^LWMW 

CPTU0519 601707 600904 

dapF-Diaminopimelate Epiaerese 

QPTKLRI LVYWMAFYS PSTI SKYTIYSGAGNRFU^ETLPEVEDVPJT^ETSVDCFLYL 
KPSSCADAQLI I FNSOCSRPTKDDJOJICAIAHLASOKCKSIJI SVSTDSCLYSGYFYSWD 

RVLViKriAr*mASvwuiSRPOTU'M^ 

HQTTSPWMJVMFVOILCHCOLRVP^TERCW 
IKTTOEUfnrSONBflRVYLOGSVTRDL 

CPh.0520 602233 601646 

clpP-CLP Protease 
QUfYFMAIXSEVKnJttl IEXEU£AJUlvm 
*VINSPGCSVDAGFAVWrjQIKMLTSPVTTVVTGX^ 

HQP5ICCPXTG0ATCCDXKAREILKTXARX I0VYVEATN0PROI tEXAXDRDHUKTANEA 
KDFGLLOGILFSFNDL 

CPIX.0S21 603803 602241 

glyA -Serine Mydroxymethyl trans f erase 

KSLLKV^EKFIOCFAmiFTKWAWSLLHKFLENASCiaCCOSlACT 

PS ICERI IDEUCSORSHLKMIASENYSSLSVOUVMCr^TDryCECSPFKRJTSCCD^ 

AIEWECVTTAXEIJ'AADCACVOPHSCADAMXAVHAILTHKVOCPAVSKLCYXT^NELTE 

EXTmiJCAEMSSCVCLGPSLNSGGHLTHGWT^^ 

RUUCEYKPKVLIAGYSSYSRRUJFAVLKQtAEXrcSVTJWVra 

:{>YADI V. ] !'l H rTL^GPRGGLVl^TRZ^ESTIJIKACPL>0<^PLPKVI AAXTVAIJ<E1^ 
SVDFKXYAHQVVNNARRLAERFLSHGIJUXTCCT^^ 
CIAVNRNSLPSDAIGKWOTSCIPUirrPALTTL^ 
CSSKKNKCELPEAIAOEAflORVRNLLLRFPLYPEIDLEALV 

CPn_0S22 603825 604655 

CT433 hypothetic j 1 protein 

REPUPEKTSlAFKVKNVNORMXKKNOCKlUUfYFOY I PLKV0KLROPSFYPKRLKTLYLC 
Lf 1QKTARJCYQAHYLP I LTLFPY AKSTPONKRALOFLPQATHV t LTSFSSTHLFL5RKTSL 
LSKATUCTKTYLC ICESTKERLLSFbQQVKYWATC E XAEC X F PLLOALPSSARX LYPH5 
L.TARPV I REFLYNRFTFFSYPKYTVKPRKLKXNXLSr^KKI I FT5PSTVRAFAK I FPRF P 
£TTrt«OCRMTLOEFCK T SSQKQVSLLETLGKS RT3 P 

CP*J052J .504720 605052 

fir* rooucr. homo loo prnsenc in CenebanK/EMBL aa of U/7/eR 
hT4A^ATWFtXrrAFULFPPATRPRYNFKLALFVT I A IALVW t ALt ATTI AICLC IHPLC 
.'.F I FLTA t r LYF ISR V I OSHYARNVY I ALDWrDI t.'.T.LQDMRCMSF I FCDR 

•XnJfiM ft««imrt tiilf.l7«* 

IV. nit*i::r | H «»tK«i pn;6#«nr. in Ometvuik/KMBL .a: ot 

r/s"/FWWM: v.r : ivrrr. tvur.v prokki apkf.oft r ak < :.f lai la;;lalcalvao 

: . :lt i V! / wi*vh ai j . iitai :wtplwi icwrr.y/r^NWOK'yt £«.<nfkpu :tc/ifc«racN 

'/!.« r/flll WjPrtJWILlJl VKICV'A tyTOA; VPTOFTFLT^LRV I KKNQl.Tt J 1 1 FNPVnPTNL 
I UITATTK. rf 1 1 .YlTTI JC1 *:^A^^OKErr:PAKOF//PF.*:rT^VVKLPrl£ALOaTFNL 
IU IZ'JiVrr:: I IJTVI * HIVuS JI^^PXLPIAAfKYYRwALl^YttK.IJCAAIEJaiAA I VALTLF 
Ytr/TNVM'RKI I l*KF> rTF\*WHNi/IVAtVRI'AI4XU I VITAI>YI VRSLLVI LQDPFNTIE 



•J724J 



SimLL^KDRWPIEWRVC^ 
VKAOENSTAKFRRRRAAV 

...... fit, M 

■••t^» • ' " - :, n i: :- «•*»•!• • 

yFNL I LlUtV/CVU^JUJJ ^KKNKHr SVH I JT3V\:l x C: U>UjKtAV3FFFOAF0PXEAM 

OLiSxLCHSCWVFFSGVr^^ 

VC^KSSETQEXJ^DTVimLItt 

pnSTTCTXrSLAiCl^ 

HLOarvSFSt«fTSAYQC0r/CIVrWFRU43IFT^CXa^R5l^^ 
M^ITE^XAXAL0LKES88PVAVLPVUX<X3N^^ 

CPTU0527 609910 608726 

lucfi-Oi hydro UpoMide Succinyltransierase 

RYKIFEFRFPKIGETSSGGS rVJOflJOILCOHVARDEPLI EVSTDKIATELPSPKACRLVR 
n-.ttniEr~^~r " ™ w * wimrmrg PTgrmrgr^cg gccvwrSPAVLSL 

j^iacicaiwiLOKiACTwoawrTOiWYisxsoovsire 

ASSLSXSSCEVPKASLVVtMJVrTtaiOWSCERQRniJT^ 
FPtXNCSLDC^XVHICKSVIIVCVAVNUncn^^ 

unriijpsEvoocsvrwwFtamjA^ 

KWTLTFDHRVt^IYGSETLTSrJWRl^^ 

CPIU0528 611165 609921 

altT-Glutanate Synport 
UflruMirXCIJVCVTLSLVUX^XFm 

Oi«LC3lIC IICSAOJfI^AIAIVICI£FAWirePGNaOFAWSW)SAVW 
AAYTWIIAOVTPSNPVRSFAIX»tWIIIFAIFI£ 
rjUMiYIItSFAPVSVSAAIWISaWCI^^ 
CXKSTSXrWSIOCWSCAVSTASSSATUVrMF^ 

CTAIFOQUAVFIAO^YNCPLST SST II J.VVTATF SAVCSA CVPQa aCTU tfVI^PrSL 
PiqCrAlLACIiyPXRPXVCTPHSILCPAVVATWASr^ 

CPIU0529 612298 611165 

recraRA FXi LTHIU W 

PJSHSTPY7UiRSTVXSVCaiTVNAZSACJCTPTVX>ILA^ 
iQ.TVVDSXVKSASYVCDOLUtAnaj'ECSVWVKra 

!Ehigw^vvnqcopx«^fpic 

SSSS^NYTOTMAWOC^ 
OTLSLLNMIEQtHnSGN 

CPh.0530 613323 612460 ^ 

■DoU~rRNA Hechylase 

ICAlJWCYljCOHVrCSTHlOTrBCEFLYJ^^ 
OKXVi«IKIDFU0RJ0WPFYXXXEOVIXPWWAXLRX 
NVVRSSlJCAVFSIJ»IMISTtISaaLFEOECfcrTVrm 
EJ03aTE»^EtlFSEIAXJMl«SXWl2aAT8VMV 

CPIU0531 614198 613245 

SAN dependent aethyltranst erase 

GPVTLXJtfSSXAVWPKSWEXMSr^OLOYVRECERCAMX^^ 

RTPPGKU7i^EM)C^WPAIJCOAIEXHKER0VT/Il^ATfTCAGS IFAAXCGARVTHVDASQ 

AAVIBIAORNVEXNAFP83UlXFWVIEXnaSFtJCX£XRRlJ^^ 

KXOKDLFPLLSLCSKLIJtfHlASYTLLTSHTP 

CGEGVCALPSGSFVrjl«A 

CPn.0532 614716 614075 

ribC/risA-Rlboflavin Synthase 

ESFCCKDSVVlCWrjOIFSGX IOEUCEVCFFEAOGNGLSLC X KSTPIXVTPLVTCDSVAVDG 
VCLTLTSCNESKIFF^VTPETt^ACTTrjGEXRCStJOVNlXAAIJOCOS 
A£IFLIKEI«YYFI^ICEl^OYlJEXCFUIDGXSL^VSVDSI7rTSW 
GKKRESERVN X EX OMSTKlOVtnVKR I LASSCKD 

CPn.0533 614918 615385 

CT406 hypothetical protein 

EVAPHOCPFCNHGELKVX DSRNAPEANA I KRRPXCLKCSORFTTFTTVELTLOVLXRIX5R 

YENTOES^XHCUMASSHT1lXGOrX}VHAlASNVKSELLCK0N^ 

KADNXAY IRFACVYRRFKrA'CELKEVLLSAT PDMEX 

CPn.0534 611389 615784 

dksA-DnaK Suppressor 

LNFXRSKWPLSOOEXEOFKKRLI<DOtAKLSHTlXCNAOEVKKPNEATGYSOHOAX)OGTO 
TFX)RTISL£\nTKEYElXROINRAI*EKINESSYGICOVSGEEIPLARLIAIPYATMTVKA 
OEOFEXGLLSCN 

CPn.0535 615763 616296 

LspA- Lipoprotein Signal Peptidase 
KOTPIWKLSSHATRFRSTlXVXTWLIOWVrTKLWr^/ICDLO 
F3 tAPVFN£CAAFG^rSWY^FLFLL^^ 

CPti_O r «:ii tflolOO <176«)1 

'J.igA"l* Al.m'.ly i^MtPMa** 

YR:;ictri j» YFFmT^ixaLL3VFODFFw.rryAF i l r r/LGvzr-jutoiwvmrsQ 

rt: KLFP YY:\>NroraETKOi:VHPIJCW 

N 1 1 FC-: I VKYSEWI*: I KFRKLWOCV*/On r ?PH'^FLIKAFICTPVVriV tVA I U/. tYlTVF.t 
■pjmi (TOTtUMk "WNI-nCVYrMLGLLFLVFYA [PflGLOP ICK ICC I VLPFPHC4«Yl*ALEL 
7 T I.VKKKI rTLfl IU-rrv^;:UKr:fX>CAL^FAGCTVATT I KQC I SRAAY* 3i I' J lOFftl t 

c o::r^:;AFDP;nvAv^; r vc i a t chl i r .TL.*XLKVLA3 r isw3 lcltnasc^vei rrLAJ yf 

IftWCFFIJTKWVnmTt |:^PI.Vra'r^AKFLYCffTCAFIYTLY r JtJ*ILI , l.^r'U!0>rr 
ALL IMf^r: > :AU4a* I -Nl Xl A*F t U'K £*/ 1 FPAPAASLTETC U7T E 



r:Pii_ft'. t / 
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.TVM.; nvrmr.nut i»M ; or«ii.*ui 

l { fllpndnv:^ iiLtrrrr.-; i-xrr i fvht c:fuwlnx iucnthrvtt tlnfeajc i 

LAPL«U;KKLU*;WUtKRKNR<;CLJEDtDEU^K^^^ 
F»NKD 

CPn.fl'jJB il« 129 6185 I i 

::T9H nypricmHtoii Dramn 
TKFnJ*V\OHV^.-pCKrr/TKTraRDIVMFRNW^ 

■ i n.- 

CPnJ)539 618678 S2154S 

pap_ 10 -polymorphic mesorane protein 

TTYSUUCIffrVCOTAGHSlffl<PGAATI/<I^^ 

E5NVTriQSLHSLVI£NNESW0CVLTTSCaa5r rNWTWUCWW ISYGPOSALUjCTCSK 
ALTfWMdCTZLTLIOnUWIOOttHWIfGQAVSSXSPCSPt T 

AIYNDQGAITrO WrOn SI F SWKASrt^VYSWfCWLYSQWCDTLfTIOaAAKVOGXrM 

ADYVH IRIXKGS IVFEENSATACGAl AVNAVTP XHAOGPVWF XKNSAEGUCGAIYIOAT 

^SXIAUtANOMIEFCGNirVRSOFHSHIN^^ 

/DP X I^TCWN5LYirmOMJXACCA\arSCyUUJPEHKICSN»ICT 

LSIEOCUILAVRSFYOBCX^JU£PCSia.TTQ^^ 

£ RA7EXA5 XE XSSVPRVYCKTESPYENHCYASKPYTTS I ILSAJCKLVrAPSRPERDXQKL 

t :A£SEY>CYt7fOGSWErswspw7TXE7^ 

SGLNIASNIVNNNYLNNSEVI PL0HlXNmx;PVVDXKB0NPI(O5SNNIJ.VQHACKNVGAA 

IPFSTVTIIJAALTtt^SSSCONVADKSHAOILICT^ 

'^OtHVTPYKCTSRCSWRKYCWSCSVCHSYAYTICCIRYU^ 

YDPRYPsssEimiLSLPicija«RricsRssuxovsTsrxiaaJuwNPOSSAa 

YTWDIOCVPLGKEALNITLNSTXKYK IVTAYH3 1 SST0RBCSNX-SANAKAGLSLS7 

CPn.0540 €21631 626862 

pmp_20 -polymorphic membrane protein 

FIKLIYI^UEFVNISrarSSKKWLPATAVrAAVLPAI.TAFCOPASVEI STSHTGSCDPT 
SDAALTCF PQS3 1 CI DC I T VT XVroiTTSTrrtlIPV^VTPDANDSS5WSSKOCSSSSaA 
TSLtKSSMXSDFDrrKDS^LYKLrFPSASimXPlLLttSSSGGSSSSSSSSSSGSh 
3AVVMPPKQQUtfTSIgANGTLTrrTDSC»PCSL 
KNLTTIQfESOKSCGAAYTESAI.TrQAJVgAVTPTCWrS 

XFEKNTSGQAGGG lYTESTLTI SNXTK5 1 ETISNXASVPAPAPEPTSPAPSStJNSTnD 
TSTLOTRAASAT PAVAPVAAVrPTPISTOETAOCGAI YAKOC I S I STFKDLTFKSIiSAS 
VDATLTVUSST XGESGGAX FAADS 10 lOOCTCTTLTSQ/T ANXSOCC lYAVGQVTX BIX A 
.VUttlNOTOCCEa^YTOCALTIN^ 

FSTOOCTCNYSAP ITKAASKTAPVVSSSTrAASPAVPAAAAAPVTNAAXDaALYSTECLTV 

sGXTsiLsraMECorqoQGAvvricrTOSDSHW 

TCmF0ENSSEXHOGCH£lASGKSLTWSl£^^ 
TFTPNEPAPVQOPVYGEALVTtOTATKSSGStYTniAArSNX^ 
TOXAACXTOCSriYXTWNrnWATGNBKXAaaCA^ 
EDAULE»OTBSVS0>ffATESG0SIYMa)IQL0MJ CSff 

CXYSSGAVTLTNI SU1 FUl l UW SVXtffATSQ^IQOOSmTTSl^IWOQIITILrSNW 
SAATKKTSTTTO I AOGAXTSAAVT I ENNSOP X I FLNNS AKSEATTAATAGMtDSCOOA XA 
ANSVrLTHNPEITFXCNYAXTOGAICC X DLTNGS PPWCVSIACWSVLT00KSALNROOA 
IYGETIDXSRTGATFXarSSKHrcSAXCCSTALTIAPKSOLIFEKNX^ETTATrKASXK 
NLCAAIYGNNETSaVT XSLSAENGS imUNLCTATWWCS IAOlVKFTAIEAfiACKAIS 
FYXAVNOTlCTrrttQEIJajiElCATS^ 

LSVVSTTOSPCTTITKTOSVlJNMKEAfiCIAIWJVX XDFStXVPTXDNATVAPFTLXL 

VSRTNAXJSKDKIDITCTVTtXDPNGNLYONSYLCEDRD ITUKIDKSASGAVTATNVTLO 

(SNLGAlOCGYIXnVNLDPNSSGSM^ 

KOCILCJtOJiNARraPAmaPVASAXCSFlJIKEVSW 

FILQAAFSQVTCHAESEYHLDOTKHKGSGOT 

GYMQHDTTTYYPStE£>»MANWDSIAWIJ^^ 

OEKFTELDYDPRSFSACSYCNLAI PTGFSVDGALAWRE 1 1 LYNKVS AAYLPV X LRNNPKA 

TYEVI^TKEKCWVNVLPTRNAARAIVS^ 

RFVF 

CPn.0541 627137 628003 

Solute binding protein < -yebL-Synechocystii Adhesln Haoolog) 
NNRSSYOTAFVMHKVI VF IFLTLYSLKSYGNDVX DKPHVLVS I APYKFLVEO IAEETCFV 
YAIVTNHYDPHTYEIJPQO XKELRQOTUTOIGEAFE^ 
K PCCNOHTTNYtmnv/WPMCRVOVET 

SILTXTSKAKORHILVSHCAFGYFCRC^YNFSOHTXEKSSKVEPSPKITVARVFP^XEOyKX 
^SVXLLEYSGRRSSAMIAIJRFHMHTVNLDPYAENV^ 

CPn.0542 628000 629737 

abc Transporter ATPase 

FMTIRXIAEGLAFRYCSICGPNX IHOVSFSVYOGDFICI ICPNOOCKSTtTKLILCLLTPT 
FCSLKTFPSHSAGKCTHSMI<»IVP<*FSYOPCFPISVKDV^ 

EAVDHALOUXItJOHHKHCFAHLSCOO IQRVIXARALASYPEILI LDEFTTNIDPDNQQR 

lL5XLKKUiRTCTILM\nT<[X^irrTmTNKVFYMM^ 

FSC5PH 

l!Pn_054J 62B710 ti2?603 

i Metal Transporc Protein I 
KUGIFMl^5LIRDSFPLLXtXPTFlJ*ALCA5V^^ 

CCLTLWIOYKUiLSFFPMYCAIVCAIFtALCICKXHLXYOEREDSLIAMIWSVWArCII 
F [ SRLPTFNCELXNFLFGNILWVTPSOLYS 1X3 IFOLLVLC IWLCHTRFLALCFOERYTA 
U^HCfT^UttTLLLVLTArTIVMLIYVHI^ 

LNt LC5F5C IC I AYCLDF PVCPT ISLLMCLCXYTASLCVKKRYNPSTPSPVSPEXHTW 

-Tn.U',44 niflVM r2')W 

ylit^-'HT' tiimliiu protein 

K..:W I K^FFCLKKOKNV IMP/TV tTLEUWCKfX;NUVVAWRKEKYLP>?3CPYOCN 
• i 1 EATTnVY: FttAYRN t PFLKArC/«JGATTlNRTnR.'?3KDL I VSVPTCTLLRD 

Ahm IE 1 1 JIDFTVmKLLV: V * W nKLWFfTCT^mAPT1CATFra(PCErROVELIlJCl. 
J AI>U a,V» JFI -HAi K:?TLhTm-MrTEVKVwAYPFTTIJ\PSU:LVU.*KDRLYOKPWI IAOI P 
: : I HMjHKt IV :l JTKI -RIM a<TLLLLFV rriV;UCRKPiJSPECTLCTLIHEUICHOPDFtK 

kiw.vai icuij Ji«ystj w:FURKFP*:YTtVLi:xi.Ti :o jvu&yrpftorlav 

•||,_u\4', •• UMI1H ».»U». It 

i I*!/ i 1 1 * k si wa 1 1 |n»ir«jiii 

tkahwhummikp* v :i<t» »nc;rj:KRur/KW w »r^:.T* t : 1 1 .vfcyw rrpvffiPAO«vn 

*NI 4/Tt «FAI «VI I * I V-VMKICrMlfrY I \ 7A! VVXA. 



CPn 0S4« 

ri:T-L2i Pi 



EXL1 



«]219B 



CPn_05<- 

tALKJLK?N&K ; JKVA iT I cL' R WFWKUJALROK iAsIVWLrrTchlTAI 
FCCCDCVOCFCr-r/KEYCO 

CPn.0548 «3234 632191 

cy«J -Sulfite Reductase 

IWYXXJEXFKAOOVPLVLREIXSCSDS XND5DP IYRMVTDSNDTT I S*/TCVCDALCVLPIXS 
KEVSlHVMU^SPmVNVICICTSEXVSAOKFXCCYVtX^ 
YDAJOBWWXPttLTJtfSVFPIAPRi nfStASS^ 
CSSrTTSia^VNI^YIr^PTlOWTLSTaTIXaPtVHICASTCIAPYK^^ 

rcwLLmoucscvNrmOTM^ 

KAYEECCF FFVCCRKVLC X EVKHAIXEX LCJCDTLAS UUCEHRYWDVY 

CPn.0549 633662 633255 

rslO-510 Ribosaael Protein 

POrTVCHOPWNOHSLUmJCKFXJOUiA^KOCKK^ Rt RUCCFQQCCLSKSTAOXVI 

TAWWCARVVCPXPLPTWlEVrrVlJlSPH^ 

KHLALPAGVDIKIJEAA 

CPtv.0550 635668 633580 

cusA-Elongation Factor C 

UIYGINNirjlC5N0EFX}LSAIIWlCXMAHXOAClCr^^ 

DWKAOEOTRGITrrSAATIVFWLCAKXNl I WfWVDFTXEVERSLRVLOCAVAVFOAVS 

CVEPOS ETVWROAZKYCVP RI AFVKRIOVCAXDYFAAVESMKEXLjCANAFPVKCPXOSXS 

r^VOCVIX,XSOKAJLYFU3XmXUmEiniSESLirXRCAELRAN^ 

KRVLEDPCS XTEXXX KOVKRKCVX OnrjNPVl^OOTAFKinQSVQQXXWIVICHX^ PURC 

NXitCIinJCmtXSUPRIWPLAALAnaKrOPYVCRXT^ 

DiaClKISRIIJlOtA NEir f ORXit rrVCDXGACVClJff XEFPOP 
VIOtAIEPKSKCO R EXL A QALSSLS ELU/1 1 RVSTWEETOgT I XSGK3LKLDXLMMCX 

REncvEA w ca w jvst wjriTVSGNSrricrvro 

KXVOCVX PKTYI PAVI ICS X EXSXJnxn/UCYCLVXJVnCVS XVTCSYHEVDSSEIIAFKXOSS 

KAVIDACW(AKPVILEPWKVAVrrPEIWLCr^ 

PLSlMFCYTTSI^LTSCRATSTMEPAFTAXVPOKIOEErVKX 

CPIU0S51 636174 635698 

rs7-S7 Rltesonal Protein 
MYlGIUUISAEJrjUUPODPrKXSVXIArXN^ 

VLEt»tJEAlJ24AXPI UWISRRVOC^TY^ XKHAXSKFCXfla 
VdATEL IOCFWCQQAT t IOQIEI7THRMAXANXAF AHYKW 

CPO.0552 636698 636219 

rel2-S!2 Ribosonel Protein 
XOACYVPSSSOaPLPTiauaiYXSMLVVVRlJCIlEEW 
WKCroKRCVCLCTVTITWIOCFNSAXJUCV 
OGS^LPCVRYHmCTIJCAAVlCMlWSRSRYCAKRPK 

CPn.bSS3 637753 636812 

No robust hoaolog present in Genabenk/EMBL as of 11/7/96 

GCJ4WRVVlJimiriLWVrPIJUtfESFSWOT 

WICTISSTWIAXmilXrYKEFCrpEASISriaUCEPLNI^SHL^ 

EJBWOCBCPAXjeOPICXUJUA/LRCro 

ILYO)SIEXFLIC£TiaU(NNKTLVDLCDSO\^^ 

XPOUWATOIXSKTWlXFlYTT^IHXIEOGItESSFTYNOOLTEPI 



CPn_0554 637606 638141 

CT440 hypothetical protein 

VFSYLLLCI ILVYVRFMYEGJCS RMASPTPQQLHIjOOKVESKAYITY S RSLAHI AT ALLTF t 
VXLIL^CI^l^POVFLPTSOAYTIICSFI^IAJXIIlXINCVrKJCOYLTSS 

CPn.0555 636298 640241 

tsp-Te i 1 -Specific Protease 

MFVMXXLVRljC^aXSLi^NVU^SDLLREIS 

SYIQSFDPHKS^'I^QEVAVFl«SPrriCjaUXRirtXACNFAXYRJ^ 

RNEMVKNPKE1.VLEA5SY0X SKO PHOBIC S IXEVVORORJUiLSYLSLHLAOASSSRYEC 

KEGQLAALC LRQ XENHENVYLS XNOMGVAMOROEEAYQFN XRWKA LAHSXDAWTAYF6K 

D£AIJU4RI0LEKCK?CICVVLKEDia?AnffiEIIPOCPAAXSCDL0U^ 

HLSFRCVLDC-RCCHCSTWLD X HRCESDKTI ALRREK I tXEORRVDVSYEPYCDOVICK 

VH'UfSFYECENC^SEOOLRRAXOC^ElCNUjCLVUXREtmSFl^^ 

WWSRYAOCTMKCYRWSPICXPYIEPUULVSICSSA^ 

TYCKCT XOHCT ITCOA500 tXFKVTVCKYYS PSCKST0U0CVKS D XL X PSLYAEDRLCER 

FLEMPl^AOrCONVUIOPUTDUJTOTRP^KYYLPNl^ 

SENSW0AFL50 IKSaaCTDLSYCSHDLOLEES INILKCMILLQQCRK 

CPnJ)55< O40'i2l 640325 

crpA*l5tcOa Cynt«in«'Rich Protein 
ENGMSSNl^PVCCTCTtiVAAPESVl^IVEEIAA^S^ 

KFrOPXPESKLFOCRATCrTIXVl^XLLWACLACMFIFKSOtXWUFWLIIPAAra^ 
L LVTS L/.TDEAJTS EK LMV FQKWAGVLEXXJLDDQ I LNNT.NK I Ft3 fVKTttimT.RATTPVL 
NTXlRCTPVU'-rLVnK t ARV 

CPlU>S r .'» «.4.W) «.4 M'M 

rjftctt ♦•-<*kr* i -V"' «Mih» •*tir;h (^f* 

E I PR JKL I RKWTVl JVl.TXlHArTCPAiy I HAAVAESLITK I VA^AKTKPAPVl WAKKVR 
I.VRRNKWVEOK.ria W\hVT4CKr/lCEfTW VWFAWEHCYnRLYr^KVNDI* HVF.ICOi? 

vpwrATVi: :r\T t l 1 1 a u jkkw.'/ov/ iTOt.r*:EAEFv^: WETTrr.n* sclvwk i dmi, 
' im 3)»arK rr*WKiM jcu > >taatvt tat prur kyt w T5o pa ic I rob ;rr« ^ rua; w 

t£W»m^U I ARNVTVlWrVt-f/ r/.'3IAflrXH<VI ::f-T<LCOHPr«»»:KVFTVRW:PO(«RiJ 

ViiNVM*m'i > « a a iki :; :Awvrry7!iEt« .vjvti ViOWriYVCicpVfY;: f; im.vlu 

OW KH /TLJV • VTVI ti.V.I'.'* .1 tX*/AVH I K l>» .'P/ am-OFK LWKA^VI* JHIT»e//AV 
TflEiM t .T: V ACTTTt MKf :| ^ATI &* .VI JTWDP I CM lOHVYR I I'VTNRf » lAKTimV:! 

I.I iJttWFXCF IA;L« utk. rr r: Firrr/VKftMJttWttKfeVEFSVTI jo; l aii uakjea i 



98 



WO00279 94 rfSle :/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO002799 4.cpc] 



Page 101 of 330 



WO 00/27994 



l. ^ :orr.T: ti v::rrrENTwvy 

om»:A-'>kaj 'Vnt'^ne-PiTh Lipoprotein 

KLKXKAVL I AAM FCT^/^IJ*2CCRI VDCC FEDPCAPSSCNPCEV I RKKER5C3GNACGSY 
V PSCSNFOS3TESN303 PQVKQCTSPDGRCRQ 

•r n _r»«;';i . « <TW 44V»27 

.;i:;i-.T:*::iK;it a»".: v :.. ;; :;r.:»K. v.: :::ir;Trtt.:v;;,Ai/;r<r: .'.tnkam..'LL' 
FrLAWVfTrpcwmL 

CPn_OS60 MS666 644098 

qltx-Glutarayl-tRMA Synthetase 

RNSRFOGMXSLWSKttCRI»ea*ENVRVRVAPSPT^ 

RXCOrORTRSROOrEENZPSAUacSZOWDGSPtMXPYCPYKQSC^ 

TDCAYKC rATPO ELAEMAAVXSTI£YRQGYDRRYRYI«S PEEVASREAAGOPYTIRLJCVPL 

SGECVrEDYSKCRWTFWATJVTOXT/l.VKSDCrPTYVirANVI DDKUCITKVLRGEEWLSS 

TPKMUXYEAnraFPVnJMPUXNPISTra 

HGYSMECDEEVYSLER 1 1 ETFNPRRIC3CSGAVTOI0KIiWKhnO{Yl^CCSPECLlJCli0 
CWlXNDEFrLXXLPUttSRXrrUttFINLTSrFF?^^ 

SYVXYUXTOO^CTr^SKWljyO WWVHHKXAI I PIXYVAXTCKXQGLPLFDS Itl I 
CKPRAJUuUVYAEKLUSOTICXlJUl^^ 

CPO.0S61 646407 643871 

euo-CKLPS Euo protein 
l*ACECHIGCTrELEERtXIEDZJCDOTTKWVSZ7 

TIWSZSZKDLEEYKPJfftYSRXXSLYQCCLVFtMSKCGrS INOVAOXlfiXPVOKVYYATRT 
CTIRGtWCCAAWWHVSEIERYIWEYUKOAAIOaJ^ 

CPn_0562 6480S1 646918 

•CHLPS 43 kDa protein hoaolog.1 

NYIH^IKS IAXARGQYAAILAOiPKPSIAHFSS EOAHTSWCKROAHPYLYRXXCI XWCWX 
FLLGLX FFI PLGLFWVLQKICONF IT rr.AflTWIF7U»ICRnSM.r JQAYAARLFSA5FQDH 

vssviucva^OYOCVFzreLELtuj^^Pinim 

ESOSiaUFIIYPGVMKSOGNZTIWNVVKSVOXC^ 
QAEAi£KEIADGSDSVRWTVVKDP*ykRSTCAV^ 

LHCPEIJI YGKDSQCNLr JSCL f l Ua TCf AAPnJPWttJSCSCXXIPVAQTGLRKDHIL 
SDWZKEVACHZORKPON 

CPn.0563 650113 648393 

recJ'BBCNA Exonueleass 

CrrmXWDrSPKCICCIKFKmsrKASMGLLHAHPro 

:SRCPOTZOCIHKFLYSKLSSLYDPCtPUMSKA\nRZXIJUU^^ 

CT/AIiVmJlDIDVHVSY7Tt£AIU<OKCrrSTLZAJCIJCEIC 

DITP4»IWIITIWHMPraaPHCVATLNPKlJWrYPNR^ 

SRNLVPKSOCSUOCLLDLVTtrTITDVCr^J^ 

CVfXSEVTSTOIVlJUAPKUiSUailXOPAXCVEIJLLTOro 

XEAeVTODVQEILKSNPEILICOAAIVI^STA^ 

ZGKSSAATXGSrPLZCTUCKCSSUJ£YTCHD^ 

KCim^KLEIDAYASPDAIDYOUASMEIJtPPQOa^PZP^ 

ia.YXSQXERNUCVATCLSWUDMJCASW^ S 

SEPRPSD 

CPn_0S64 654359 65014$ 

secusecF-Protein Export Proteins secD/secF (fusion) 
SGAJOXOKVKWAZXXCVFAIALYYVLPTCLYYAKPLD^ 

RJCDVIPRVSAI l^SWLRGHIQOHPAIPOIVSVRFXRCEDAEDFIGNLVHCEPNVPIKSA 
RLHWCYS REHDEKVIOVASS EMT5LVESDFSPVSYS SENEQEKASS XLORVYSACTFPX 
OKOCSCSYPSZWETAFKEQI^AJg^SGFEreSSPXSA F C ra 
SLSNDAMOVEOOXUJCSVYrr^OTACIRSLDCWIECLRU^^ 
RKIFWLHSDUJWRTSI^ICEORWDSIUAVIXQ^^ 

T^GXIXLQGERIXOGZAZHLTALTLHRPAAESCC&X PENFPVFCROPRESEAFOCYIFSP 

NTDCXXFSKGSVYI UJCGLRSIVAKYOQflnCTFT QSFEXBLQNLYKC FS HTEA I SVTIGZ 

00VLElW4PLQ0PLOWCBCrvZGKS0CAFl£VKDX0OIUATVN0IE^ 

YIWAJOCSMDWERLSAPIPYONUT^ENKXI/WRKF^W 

HQCKOLTDKEDZIJCVSDEIXAW^KLCWEXXXRJlEnDYlK^ 

HSFHWNERPSSYSASRYEVORFUJYtJWFTSQAQGCT 

H EAITKLKS ECLAFS PSCCETPSTOLrTTT FSM I A IGKOXEQKANPLV IVFRNYALDCASL 

KDIRPEFAACECYVl^SVKirrSPKXHAEiajPTESFHTWrSAYCCECISCT 

RGWRIWV^DGYWVSSPIUA^LXNHASVS^^ 

TZSSDLCXXQCTQCZ rSACCCIJ^MLIVlJtWYYRF«^ASCAVmCXLIWAALOYU3A 
PLTLSC^AGIVX^HGMAVOANVlA/FERXRESFtXSOSL^^ 

7LASAIXFFL0TCP ZKOFALTLI U3 X FSSKFTALFMTKPTTMLWOCCTOHrrOLWOtNXFV 
GIKHDFLRGCKXLWAVSCSVFLLCCVA^ 

CMRCKVVHKMEACXSSROFRIOTFCSSEKIKZYFSOICAl^YT^ 
AWU^TOLDFSTETl^ETONFWSKVSSIO^KKMP.YOATICLLC^I ILLYVSLRFEN 
OYAFSAVCALXHDUATCAVlTlAHFFLKKXOXOLOAXCAiXIVLCYSUJ^rLIXFDRXR 
EOROANLFTPMHVLVMOAWJCrFSRTVKTTATTLSVLUlLLF IOGSSVFNFAF XMTXCXL 
LCTLS3LV I AP PLLLFHVRKENRSK 

CPnJ>565 655741 654533 

CT44v hypothetical protein 

f(KUrFLIFCFVNISAILFDSSFlXKIKR^KR>OJlSMKFPRISISDLIPTO«vrwWRCC 
ONVHYVPNAONLPKKI LCCVLACPULALU3CAAFAACVCQT Z F PCXGUf 1 LCLVLLC PAY. 
(0YS1ff»aRFERPLPTlETKVFEXPINWLKl^LI3SW^ 

C t VTK I FOKKCDRNTS X FL ( OEKDOI ALRQG X EKS ELSRKTFA XDPSW5 SLLSE IOREE 
VT^U>PICVISW^JEOOAJDRTMPK3AIVVNI3DAACEPOCRCYIDAYTXAFFTVLOOICD 
I ft IVKWIT l YVLTP I l£VPOALPKTOEWJ(LLSOAAFLYSAWARRKREEKODS IRIK 
F I FTOPTtlPT.ILY FilPlfl ISfTPHSVT P XSLSGFVC EQE3YTFA 

•:i*iJ)S»i«. *SnfW«» 

ay ivrrAijriwvoLU ArNNAc:rTP::um^ 

A I YY» MVKVlJtl U JIAV 1 1*C I KVLTLYTF!7rENTY3P PKEE t OE I FN I FYTOLDKOL PYLM 

hjiKn:u« 'undixue* : ictk iwivraofr^FSPXELvi jvvKYnriKDELVftAFKKUivo 

1 1 MV. I AitWUlKa .! ^^YLC^rriTOPOLLIftTT^EMRVlMFLlM)! AYTELY ITDTLM 
J 1 iFTI VI «U-'KA I HVYOOlfcfRW > ac 

.Inji'.'./ ».S«M*i4 t.57Nl7 
•-IrJi ii*k:Hmi yr njy ly i r.iric tor 
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PVRXST lCAL0L5C\flTFj ZXTVCl 8 1 WlKLHVLtlinr effiBCEPYEGKWBAfflnU 0*5?* 
DIFOTFFClCAF*IRT(XAtCl3Pro^^ 

LiPLtsijajc rrcFTCD t : c :fxk»ku«snk^ 

TOSKIPXC 

rpn 054* ^57»0« «R4M 

EEPPFS FTFATG0PLE5FFTCH LiTS CTTUEVANAAi; U-JC LPEVRAFHJOLORRYAOL 

QgCVfXOtMCTCVrVNAOlJC: FIT^ 

AQRAHOPLVI PENG IVIOSjCLT I ROVLEX I LALLFRNEL 

CPn.0569 656398 659099 

plsC-Clycerol-3-P Acyltransterese 

LFtyWXTSSCWSrriSKRAMlFRICXFFTWVAFSUTKU^^ 

NHNSrUP IAUOCVHEC: YKLARASLFKX PWl>flQ0**XFWRODZ»4SAAFKIAJRLrw 

KRJOO.VrrPICARSPt«OLOPOCV«ICMlAAKSRVPIZPW 

t i tl v fu i p w y o dz zohpez wncrrroz nwgnttocz aexjcawesoccdvp 

CPnJ>570 «5»044 660789 

arcS-Arginyl cRNA Trans terese 
TXLPSSKHGWRCA«TSPnJtSTLLSZL^ 

HYQQilVkKKIJUWLiqCAPRAIAEAI VAELPQEPFSL I EZAGAS FINFTF3 PVTLHDQLtW 
n0»L8i477V!S0Pt(XZZZ0FSSPl<ZAXZ>OfVQKIJtCT 

IG0HOTAPCMIJTYME1IPCX3TSI>L£ZILTS LYXXAYV^FTKDEEFTXRSOONWALOAXO 

POAXAZVEX XCrrSEXAXOXIYDILD ZVVTOGISFYNPFLPEI Z EXUXMCLLTVSNDA 

KCVTHEAFSIPFWOXSICGYWATTIUAAKRYRIEEIKADK X X XVTDLDQSLHFOLLEA 

TAIAACYX^PCIFSHVgFCLVLOI'OCXKZJCrRSCZIWK^^ 

LTDWQEMPVIGZNAXKYSXa^SHRTSOYVFSFmzjtFXWA 

RJUfllSOLSlXGPPEIQEPAEEZIALTLUlFPI^ 

FfTOSW0OSPYAJCSRX^U^UAIQVLATa«^^ 

CPIU0571 663179 660749 

murA-ODP-N-Acetylolucosaaxne Transferase 

TFEJCVNW FSDFDAXCERRJC r^qvrrT^txrrr^Kr.M.rfii.A'Trr r v act J AT^iTTt . 

RNVPDZCDVSLTVeirjeSLSAKVSWDXCTEVl^ 

AUi28CPGCmVTTWCDAXGtimimiFECXJCOI^ 

t^PSVWENLZXJUlZKAJQCarmiQWAXXAEILDtVlJ^ 

TCCLCUV0HTXLPO»EAA5na4AAVVSGCI^FVRM^ 

SCIETFXJERPLVatP^XrOVHPGFLTtJWOOPFAVU^OAOC^^ 

L0HWAEC13LFH0CX^TlUtfnrrAI^FP*SA^ 

lAIPOSSXXEimaJJRCYTWWajlSLCA^ 

CPn-0572 662349 664616 

CT4S6 hypothetical protein 

xKAAPiwQpsi i ronqiOT nrn ro sLswsvrrrsscAAAOTsoTvn^ajwaMot 

IA£OTKAVSFSAEHSFSTXPPETCSWTAOSAOSAGLFSIJCRT()RJU3SEISSSSr^ 

SXSRTSSNASSCETSPJk£SSPDUS2IX)SLSGSI3UaSAiS 

ASZlJffWPAVQOIWKaWFVYVDEARSSriFVWtOT 

KissscTTPSAwwoAKvrropivrxwacxjQcii^ 

CPOTTVGATWPWNWUXIXVDU^ 

rrSTCS05TXEEZ7nOFtX)Pa}GEDONAIPCTmTPPPGPPPN15SSR^ 

LOMV10HLIBAYC«iaBV8«Js3PlW^^ 

VTEa^IRNlIOP^TOSTOOSmATPTPOPT 

TPOARTACTSTTSXGTG7XSTSTTS7CT<nGSVST0STCVOTPTn IHSTCTSAl'l TTSS 

astqtpoaplpsgtrhvatxslvrnaacrs zvloqocrsosfpxppsctgtonmsaqcma 
aasqvastcjoovvnoaatagsopssrrssptsprrjc 

CPn.0573 665413 664691 

yebC faaily 

VEIWAGKSKHAfmCHRICERADHKra 

EM«PNlNIERHXJCKATSAEOimrErWELYCKaCTCI rvEAKTCNXNRTASDKRIAIN 

JWOSSLVBZJCSVLYNFAIWCACTVAKSSIDEEVIFSYAXEACAEDIJTO 

SEXASVKEXLISOGATCSEDRl.IYLPLRLVXXn3EXDGEAiaJ^ 

S 

CPn.0574. 665978 665394 

No robust ho&oloa present in Genebank/EMBL as of 11/7/98 

SAEJWFRKPIVMVrrVUttroRYl^KYLYRVFRFPCR^ 

KIYDWEIYEEU^LFCaU-RUJICWFCRKATRXCKSVVl^ 

EZ PRFFMEYLVYKEMVMSWTREYSLSSRS I F KGKXFKEYEQRFPL YDRAVAWEKAHAYL 
LRCYKXRVOOCYCRA 

CPn_0575 pp6524 <65962 

YhhY-Ainino Croup Acetyl Transferase 

SIFCRVWR3FKTAOC0fTOILCliIRYTLPSDATYMLKWLNDPKILRCFPI0TEAEIRET 
VMFWWCFYR YHS jLTAVYM3NVACVATLVl>IFYVKVSHH ALI S I IVCEEFRWXCICTALL 
NNLIKLAJCTRFKLEVLYLEVYECNPALJ4LY0RFCFVEA^R0ffRFYKOEXCYUUnTMEXO 
L 

CPn.0576 re7^43 <6»i494 

prtB*Peptide Chain Release Factor 2 (natural UCA craiM-sAift I 
HOENUMCRIXALRTEZSLAARSL. 

CPn_057fi.l rp7S'^H 

prtB*(n.iturul mca t:am^>shitr. > 

H&ENlDKRtX/\LRTE tSLAAR::L 

CPnJI577 U^ffiL . rnw. •AHIS'i 
JW(0 (YH74) cut8i)U».x pror.«Mii 

EniWIO'KltlAFMirVN [ ;rrM.AV I II-MPRTE f VFKVWKVI KKlU*\KWMIlR«lL 
MiANI>KVKti:-.;tp|- tw^vwir au;ki I ( VK 

y.«<rl -Phr<i^«h»tnyi(tnl.n:.; 

TTttiFIVLIli I : • LATLr I LAF: tWA; IF J Kl t Ml .PTTA I r«Rt.PKW(AllU k IUi I Ay UXUt 
HiKtlVPEXFLNKV::K^(|iMF^;N#l,(VF Y 77l/IJ/;i<ARLFiiKERLPTFL/rr( < EAriiVFAU< 

rtatuYA.*Y(:norrnii>:m-ifEKF::iMM'^Ai lAW/xn^tt^HYORu.TivKninu. 
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ULLKNTTLTUJWTTHVtPWrLNTVCLCOLF'ARw. -«»EQAFWYDPSL?CLLLSHWP°G 

NRCLttJLKR rFFCSPPEICYrrCSYO 

CPft.W.i «i693l0 $69991 

yQOp/y.vat-.'iuodr Nucleotide pnospnorylase 

KEFASAPLT-IT 1AT~MVPMTggTf .TT f /yTTnffyPft^glf TPlCfTyt-Pt>y?TtT.*./TJ<StJCrLSS 

-.r'rs:. .n :&v>k: i-jki ..-..LVKHKiiro::' :. "tet.lt. v. 

CPn_0S80 6699)6 670793 

truA-Pseudouridylete Synthase I 

ASSWLPWISNDCPSPPHTICVALLIAYOT^ 
RTPLIASCRTnAttflUYCOVAHFIttPIWPLFA^ 
FHAAYl^IAXTniYSLSRIJUCPl^RHrCYTPRW 

HGRDYKSTVRT XYTLD IVDKSD5L5 1 XCW^n.t70<VRNLVCAIl^VCKCAYPPOttiO 
I WKKRRECPSAAPAYCI^UOnCYSSPYMNFCCGQCSVSTSNEG 

CPn_058l 6715)3 670745 

Phospnoglycoiate Phosphatase 
ECXJWRSVKSFLROCWIYSJttVSDEFOLCtRSGMrXJXy^^ 

FLQACAEFSL^VHWDFSTYYSHTTLGTEI FSKXF IECYPQAOEYHAXIFAXRL0 XYYK5L 
EHAGPALHPGVEAf I CLVLSLNKTTCWTKS PRX3ATHTXJn7<YP I12JKTI*FWT11ENYAR 
PKPYCDSYDYAYRTFARBOOCVICr EBSVKGLRALSXI PATLVCXMSHAEITPIDYPSJC 

GxerrsYpsruvtTEKCsooxLL 

CPnJ)S82 671305 672177 

CT465 hypothetical procein 

IWPKALUOUQHRLTOOTOKNinyLTlO^ 
SNrVUaCLfllSTVSlATSCSFSXNSRTCr^ 

XSFTOEV1TAYSFYCRGQGHSIITPECVLYI£DCIJ{HSXT1CZ£FRYXHPRLIEVVRIXM 
DKPXVS 1 1 EAFCCPKH FHFLEASG I SLSQLKLOGTAAT FALDPPLPKDCLLAT IRJCLYKX 
NSDPSLSNT IVTEATLT^IPnJlLTQQOlflSHrsrrVEIUMLOWyEAf <SS A 

CPn.0583 672349 672717 

CT466 hypothec leal procein 
ivi^pLamcvTwueJBrrixi^^ 

IFLSCIDRVDI0I1CEFRHAMSELPODIOEEIXEIRI^IRXLDTDIOW^ 
ERP 

CPn_05B4 672659 673798 

atoS/ncr8*2-Componanc sensor 

IRXKATMHRRRH1<LVF)WPOSKMLHPPAtXLX£XJ^ 

SETOHrtrC WSQAREIXCIDDg^L^PTiyVLI^rCLCFStOa 

CXESXEXSVOSXIUQttXSGYUTQXROXSt^^ 

NPl^IVSFASItjaCEISSPIUIORMLSSIISCTRSXJOttVSSKIZYTlW 

OFTSSLIPU^FIVCKFVREaAOPlJRSIDPOWWSVVWNLV^ 

TSGDXSVTNPGTX PSEXHDXIiFTP FFTTOfiSNGLCLAEAOK X XRLHOSO IQLKTSDSAV 

SITIIIPttlAALPKIRAAS 

CPa_0S8S 675880 673865 

•sinilaricy co Cps XncA_2 

ISLSmXAPWirSXCKSSNMATPAOKSPTFODPSrVRIX^NHP^ 
IARVQQCGWHTIVXVSLI IIAI^ILCCCXLVCLLPAVPMF-ICTCLIALCAVI FALALI 
LCLY?S0GlJ>CXlJ>PVTtPOQI0IEDLR«gTRE^ 
CEXUjQfLDiUaARmrLYItSTAKXJQDEERYm^ 

ocwrywEttncraRjJOctisujoaTVQOLRSRiro 



LQEDLRLTGXAi 

I^OKIlANLKKA^VWEEajCKOOQSJYIOTOEIR- _ _ 

QQRYSRLQEEKOVKEX ILEBSHfflFAPLFtXAQKgMAYlOqcr API mAAAPTt XGEPDO 
WVLTOSASLSOIOCXRELVEEWELLKAXAFICSKELTl 
BClJAXJWOtAOAXKXjCEAAORXCCOUESI ,T.TF 
EVERLEQEOFQG 

CPn_0586 675993 67718) 

atoc / nc rC-2 -Ccaponent Regulator 
KWWPSRGENHAlRMLVVDDEPLIJtDFLSEX^TSOGFXFOT^ 
LVISOKSMPDGSGLOLIKX IKQSSPKTPVLVVTAYGSIEHAVEAKHQGAFNYLTKPFSSE 
f^ ^^^^^^^^^^Q^^^^F^XASSKAWroi^XAJUCAASSSANXFXH 
GESOCCKEVWrriHHHSPIUNHPYIKVNCAAXPETU^EtJa«XCArro 
FElAHKCTUiDEXTEVPVNLQAKIXRAIOE^ 

I DDKSFRQDLYY RLNV X PLHI^PLRDRQODILPLAOTFIJBCFOIWHTPLXTLS PKAOEL 
LLNYPWPGN IRELSNVLERWXLENTSLLTEOKLAUA 



CPn_0587 



677378 678124 



•yvyD.Bs conserved hypothetical protein 
SYCEIJILSTIAIMHVTWDKKRPKROT 

I LEKSDH PHET I R VVLTSHKDKLCTEVKWASHCKEX LQTKVKNANPYTA V XNA FXK I 
RTMAN1WSNKRICDRTKKDLCLAAJCEERX A IOEEOEDRI^EWLJ*/ECU»WDSLlCrLGYV 
PASAWCKISKKJCMSIRKIJttDEAXIlQU^^ 
PSLKFGFCI 

CPn_05«R 678033 678626 

CT469 hypothtcical protein 

TSKSIKSNAFI KNMTAT7GLLNLPSS0DSA3EDSTS0S0 I FOP t RN RELVST P EEXVRQR 
LLJFLMHKLNY PKKL X I X EXEUCTLF PLUfRHGTLX PKRRPD I L I ITPFTYTDAQGNTWI 
UJOPKPLLLI ECKALAVN0HALROLLSYNYS XCATC I AMAGKKSOVSALFNPKTQTLDFY 
PGLPEYSOLLNYF I 5LNL 

CTliJWl »7R*14 <79)9S 

<T17fj hypotit»:tic*il protein 

:::>w c^/n^/Jt ;RPu;KNjrrL7TLPTPEnLFTFrAr:of^rux*DYRt-rLvp i clckyt 

t JIIIN^RIJ^LTlK:D[LMAFEAlKv7r/ALUA^K1<tQAU^^>n(EKr^HKLF:iLPLNF 
I JIM I tttMVEFFAA I FVLKU£Y ETC I U)LTI , ACiILCKAi;LPYACYHYv.\ U (K LCKKI KJH K 
OA I ~ t RKEE(X> ( LCA [I I IA KQF5ELLA I AEF P f A I AEKI PY LFO: • LOEOC ERR11F.D F 
MIRir^|j:RWIII«Y 

»'Hi.«VHl •«NUn>; 67'i^lS 

»T47| tiyj»« ih.-f. ural p tot* in 



LFLYnOMNL^FACT/LFFF I . .♦JWiINCLL.TVP* .•WUtLKr^r.THPFTF.-rK JYPOKE 
fettlOAQRKRRVCniLTCEFPKUST U/YJlU FOlLRAKrw IW rvLYALNF£CCCTO<OM 

0FwocwMCTr7^wtiJ:iTjLKi^ig?v^ m jww: 

YHMOLVCF SEVTC? FNV33EFJCT tTTT ' 3 — ^ ^«- — 

CPn.0591 6R0)o-l ei.<l0:i 

yjqE taaily 

r,U rKRCTAYCTASAYNLHVLFHLLKPRYPT tU"REY\XANL05TCASNOt>rFFPFCVAV 

•■v/^:--.rpTrKr.:.;-r-:A. • ••••: •*• . -: rf -crrrmA: 

\:-:.v."/F:.w i r? ;•*" rr.-y '■•■\v.Fr: :: , l.-'l:k.v-.v^ 

HSOILOEPOFFWMP ETQAI YRXA"w^U I JiARINVL I V 



68113: 



661461 



CPn.0592 
yidD family 

lysjoif^fkrfwxpvricu.;:ylyowlispli^ccrftpscshyaeoai*s^ 

UtXIWIICRICICCSPWPaJICWICTALOEVliPYCEIDCCWSHrSE 

CPn.0593 682494 691391 

CT474 hypothetical procein 

VLCMOtAFTOJCTIWaS^^ 

VYUEDFl^IS0ASLICLISLFKDERYKYCRPIia>iAi^A:A5KHIDrr^ 

TCLRCSSViaRXPNXOtKOrWXLDYCACKlCYmSXCLF^ 

COTEFLYLICIXXWXWASSVASLAFWIRCCSERFr^O^E 

KSYUXTE13lAAUJ^VHDSOVV^ErCDEDLEKVIRIJHP<XSW 

LAaCPORVEAPRVQEZ3QOEEYVVCOC05LWLIAXRFCXPKDKI ICKJCLMXHRLFPCJCV . 

LXLPAKOS 

CPQ.0S94 682517 684958 

pheT-phenyUlanyl tRMA synthetase Beta 

fHO«T0V^VKSLV7n^XJU^S«RIPITU>0TYF5EPt^^ 

LYSFASVXTAiaLHTIPHPKADXIJlVATl,TtX3EKEHOWCCAP^^ 

FOSTOAYTIWSnJlCVESOCHCOCAOtXZajJMIOERAU^P^ 

lfrSmSLTPm£KCA5Fl>OJUtEICWrOAllLVXPKE7^FE^ 

FCTWmiSAOFSPXJajOSSLOALKQICPXKArVDXTW 

I^VEXX^PCSLTXLNSETWIJSC^PVVRnnHSr JCT iCCVMGAKAPSrQETTTnVIXA 
AYFXJ^AIAASQKTJ.PIPSESAYRrrRCXDPQNWPALQAAIHY ILEIFPEATXSPXYSS 
GUCRKLKEVAXJUIClTjORtttWFSmLW 
EEXDLVEEXCRT^SMrXCTONPVSCYTPXYICXJCmAOFlJU^^ 
LTWCZXmSlOSSIOrmLRSSXJJCXJJCSAATI^ 

CT0nAXtXTTXX«E51lSWLPfCPSX^FY5LXXAnfERXXYMHHXJ IDALTUCSSALCtFKPY 
QQOVLRIHKQSFATT^WVHPEliAKXAQXKHPVFFAFT NT DT. T dOCJOCTTHLYKPYATYP 
SSFRttTLTVPtDIPANLlJWiaXHECSlCWLESVTI ISI YOOJCSlXTlWOWSLttWOO 
YlRTLSNOOIEEEYCRLVALLNEIXTtrrKCTIKS 

CPn.0595 684943 685926 

CT476 hypothetical protein 
ROVOmOXJJiCYCVrAMSCSAYAS^ 

TKVLIOCATIi<E\nfSCGlliCEXTLTFPinTAIJJSA^ XYOOGRLVSRXTFFVKSLPSOEC 

I^NEDCTFVLTRWPONNI)St7rXTXFYFlETrYQCHVXEC5Y^ 

VFSSHNXLLSEZTFtOXIVIMCyTTFYnOWPESXTKYQ 

W RYUKJlAVrn VP i O^R TSXIAYVRCVICBSX^IXJlYNEOe WAEIVSWKKDTLHOOXIY 
AGCICJOOVYYICRSVSKAKrERLKAAC 

CPn.0596 685930 686457 

ada««ethyl trans (erase 

FAVMAXWTLIPKllQWSLSQACSECXLIJUCYPPWIWt^ 
LCPAAKKAMOEIVLMCSRYANKEHPPrSSHFAJCIiL X PSOYLEI LUCVAC X PFCtO OTT A g 
IAJO<TOTHPR7VOAACK0NPFXXFFPCHRVWSHCERN^ 

CPn.0597 688215 686479 

oppc-Ol ioopepcide Pensease 

lCKKP8FYQRFlSAYYR)aiA5I^WXFFXSVALICXYAPLFASS1U>aVWKCCIF^ 
RYtJTPCYYTW^rtJLFFNVUIWPFTIWFKLTRC^ 
KVODPAUk£l^lOOUa3CVRi24ISKVNSEWKIiPI^ 
KYroaOEASVMYCVAFEEraOSPMPTUWl^HKrai 

*#*RATt3NYR PFLMALTRI ENnJUADYKNUGOPEXAC X AY ANVEXRAEPYKK5LXXIR0V 
LnJYAXXR5AXSFXOOXRX>rXEXISEOLRXLXNPFF55FHWEOOACCSREM41C^^ 
SRVTfUCOLLAALVFCXRXALWACXCIT IALAXCIM IGLV SOY FCGTVPMXLSRrrXXWg 
TOP^ILMLVISIT^KSLLLmVUjCCFSWTCFSRYVRIEVUCOR^ 
SHYYXHVKQXLPfUXVP\nSLVPFAIO(AMXSCEACLTFT£LCEESSASIOQJGUXnrre 
PAESAVLWPPAX XLTKLLXAIALICDGVROALDPRLODS 

CPn.0598 689712 688219 

oppB-Ol igopepc ide Permease 

EG0S5VUCY I LKRUX I PLTLF A X VS X NFVI LNAAPGDVI.ECX SROALGEAC KSDKMRSY 

KCPORYLOFREHYGLTLPXFFNTRPX ITHKKIOTALOELANANNTTPSAXNAAKSLVYWC 

DCAKrVKPALLFEAPnASRODKYRM I AAOLF IROCVtOCFVCPNI^ PCORAQNKEXAISN 

AFXVTOUiEEOUyTKVEAlJCCWrOIJMCCTEVFCYSSKOF 

FXTTlJWDAHirrVISEVIKRLRCSLVI^ILPMIVCFVL^OirCMIMALKRWW 

FL I LFS XPVFVAVPH I LDNFVIHKTI PFTTI PKPYSC LRSPPEVFNELSTLCRX FO WSH 

CFLPFCAVSYGALAAOSRLSRS X FLEVLSQDF XCAAKARCLRWFD I LYKNVCKNAAVS XV 

TSLASSl^lXO^VVETXniXTOFCtlFFYOAIl^PJXINVVljr^LVCSA^ 

DICYVXXOPRVOLECRR I 

CPn.0599 69182) 489682 

oppA-oLiqopeptvde Binding Lipoprotein 

KRRE3GWfyKRC\lXV"ILJ(CIVACSLILLYW3nD/XERDtK5lKCWVRDI0EDlREISRV 
VKOVyT.TOAtPA^rvr^l^PKL , /RDEAFALLFO)PSYrNLLrLOPYICO(7rU'E 
nir.ILOTAirVCIRrENLJPFNCIXVVVCrrOU: C P.?LA:IPIIVr:KY eefstdlavk ieehlv 
PDCrXOKEFH t YUrTA/FWRP ( LPKALf-KHVULDEVFOR P4IPVTAIIO (K FFYOAVMNPYV 
ATMRAVAIJl:»:YEr\V:V:WFJlC^LWr»WKAirr/rNFXf»ECTRVUYWnimJLOPL 
PRFV/OYFANnEK 1 1 EOEWIOTYPTN:: f WACNFTMrfWANW/ 1 Vn X'«AYYKMWOOOCtVF 

:;RNii>rYOPi-viJWRF^TrErrrn.F9r4WiKioi::Yi.ppNORW^/:F^ 
KyvAr 7 :Avnrrv:-.Aui<AYTY .fpv^/vri :a»ka i ui< fj» i i hj.'LDX* wt 

i :a ;i'KA.'»t:p:nrNkVifJ*rii\'.:r't»VM»M < EKw wiener/ : ihkkv in w t vpkrfrlc 
Yvvr;r/rAi(T r atavatai-kki'; mxu/iu *iaul; *j.\n*>rtu>nAu*** :u ; i pi»ed 

f-RAl*ai.7B'JW1KK. VAWV,:FlHtEEAf<r ! irHil.IYKYtMXKPrifrl.YIIRKllKI fHKEAPYA 
Kl.K>tlC«LLYKI n\TCN I IWI«»T» iL f M-^ilTrVNVmVWI J!KK KHIH 'Ll.T:; 

"»^' , «*•^ r :-t.i ii. '-i,.H-mk/K>mt, .»:: lit II //mm 
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HOY MX I K^W5LCOAKRP3Wl.PW*n)PCU?f . -JLKOORLAYCtt I ILL5XY00KT 
F.7S LLJtCCTrSLNRAItOHlXYK ! LAOFTHXOHLMLC UOCCt PKSPC- 

TPnj)6Ql «91092 -572736 

cfpr iHArouNsJiK 

AFCASAAVS I IFTANP WOVTtOOCUU^liP X PLVtGLU IS I IVLLYCXYLPPQQRE 

DscWpurowiiA^ 

VIPAFCLACITLTUWVT1U.YIJSGXGDFFD 

AAMTxuxmocEYSi^EXFKFXPiaraLiiucFScrc 

KWOAI PVDI-SAHVSIJlDAyVALSC^YADPIUnrPCFOMIWI PSCRYSACI0OCmTA« 
RUttTOILNEYAPGNAWVj^ 

CPn.0603 694136 695185 

wKWRLxvLWLvst^LAjao^^ 
u>RVuiw«.rrpiAiaoivpKVLPOTO^ 

PSTKEICTUJaJlTUfTRHVIC: PLFPHrrYS\m;SIVRrFMKHVPEIPISWIPOrC SpS K 
FVSLITCHIRflFLOKWXWlCECCFLFSVHGLP^^ 
SENFl£FOSKroPQCWt5PSTAOLC0NIDTDKPWIW 
LRSRGYRALAI PAX YSS PLWVSTLVD X VKZNSTWAECU KSCKKHTC IR 

CPnJ>604 695981 695196 

fllY-Qlutamine Binding Protein ,„.«~ w 

CKKRQKSEAOUWKIKFSWXVNaiCLUV^ 

TSOTNAFLNDLVSEINYKENLN T^l^U^l^niUJmXT^QfJrSVUmiia^^ 
FSDPILLTCPVLWAOOSPYOS lEDLKCRLICVYKTOSSVLVAONI POAVISLYOHVPIA 
I^ALTSNCYBAUJUVIEVTXLirTAYlCCIU^ 
ACLVKTRRSCXYDAIXORYRLP 

CPn.0605 69673? 696150 

yhhF-Methylase mm 

LWarSSRCCVRILfcCKYTOKSUCTFSOTHIRPTSCLVKEAFFSlCra 
GHGAIGFEALSRGAASWFVDI SIKAIQLIOTNSALLCE0LPW1 FROOAQSAIORLIXO 

KLGXTYLAEYXVEXDP 

CPn_0606 697492 696707 

CT488 hypothetical protein 
SSYSRRWArrKSLOMHIYGLAOLHIW^^ 
EDIVIiiCOISIWCNUEAHXDFAnCDLPGriCYM 

Y11QCFALLTPKIAVVCVRLMDSPTICVKX1NFLTPSTQEO FLRg fiRLKR 

AFAAUTCEVTTVrVKrHYPPISSIXrrOT 

IRGIHY1LVAADYVNFVPQCVM 

CPQ.0607 698910 697573 

glgC-Glucoae-l-P Adenyltransterase 
NWaOKIO©FTCASNFESSHFYW»CVCVIIU3K^^ 

I DIPISHAI SACFSKIFVICOYLTYTIwQQHLFKTYrYHCVLQDO XHUAPEAROGO OIW 
QCTADAI RJOKLLYFXETEI EYFLILSGDQLYNNDFRS IVETA IRTHVDMVLVAQP I POCD 
AYRM5VLDX OSSGXLI DFYntTOEKEVUCRFOIJSn)RllIKKLTEt^DFl<*S!CIYLFR 
RDSr^SLLRZEBMFOCHLIOAOMattWQTI^ 

A£WCUCY1X»»3MIYSKKHHLPCAI XTDSMXSSSLLCBCCVIWrSHVSRSVUSX RSKIC 
EN8WD0SX IMGHARYGSPSMPSLGICXDCEIRKAI IDENCCICNttttLQNIJCGYIXYDS 
POKXLPVRXKI XXVPOGTHIPDNYX F 

CPn.0608 699690 699016 

•uridine 5 ' -Monophosphate Synthase (Ump Synthasei -truncated? 

VSFLYFVKKiRRXMUtKNYCDAXXJUSOAVAlLYOIGAIXFCi« 

I SSPBVLOmTLXWRUlPSFNSSLLCCOTrrALTLATSISlJCYNIPMV^ 

SDAIKVEGLFTPGQTCLVINDKVSSGKSI IETAVAX.EEJKXVVREALVFLDRRXEACOPL 

CPOCIKVSSVFTVPTLIKALIAYGKLSSGDLTLANXISEILEIES 

CPn_0609* 699672 699986 

CT490 hypothetical protein 
0NTXNSUREI*4LIRLFl^IStPKGFPLYLEPPLV^ 
NUffHYTQELLYKAVPCNYKSXYREIPLXIFPEVLICSTPTOSTC 

CPn.0610 701450 700029 

rho-Trantcnptlon Termination Factor 

R XFLRFKGS IMKEERSSEILircVRETXXlWWSNGXXSCVCECAW 

tAKXOR^XEELNXLARCYCVKNI(^LTXSO^A^FEIVKAXSERPOEIXXGEC^^EVLPDC 

FXSFl^PTYNYLPSAEOIYVSPAOIRRFDUaCGCTI IGTIRSPKEKEXYFALLKVDXING 

$TPOKAKERVLFEm.TPLYPNORIVMQ«KOHLAOMXLTAPtCKCORCLXVAPPR^K 

TVIUJSXAMAIA^POIVLIWilDERPEEVTOMIROVRCEWASTFOEOPERHIOVAE 

MVIEKARRLVEHGNDWTLL05 ITRl^RAYNTVOPHSGXXLTGGVDASAIJWPKRFFGAA 

ftNinSCCSLTIlATALIOTCSRMDEVIFEErKCTCNMELVLORRLSORRTYPAIOLIKSG 

TR K EELLrw P S ELER VYLF RQA I ADLT7 I DAMH LLbGR UCKTN SN AEFLLS LK £ 

CPn.0611 702133 701420 

yacE-predicted phosphatase /kinase 
RRNRROAKreEREXCISYDFXRSYSCE\T/flrfKKLCRJ4^^ 

OEl>3AYWSAOEX5HSrLIPHTRlCRRVI0Ll^S0WVDCAF0AOAIAAKVFYNSVLW 
LEAILHPEVCRX IEECYNOSIQOOflfPLFVAEVPtXYEIHYAKWFDWILVMANEDIRRE 
RFMKKTCR3 SECFIX^RCSRFUIN^EKI^OAOVVVQINCTX KEUtOK IEEYFYALKGAL 

«:rri.f»»»u /04KRH mazz 

MtlA-LMA r*,lymtn.iue [ 

kt : J I ff ULLCTVERfflR EY AHKXLFVLDAJJF I FRAYFALPEMKNJIOCOATOAVrCF IRSL 
NKU KEFTPCYM l!.VFa:rt*MKa'JRUA t YADYKSIoV/KK rEO I PPO I ALVKEYCiiL ICLA 
Y1.ErE::VEAl*DV I Atf [ AKK A«EEMYK^VtTAPICDUJUI.VNO*IVVAWNPW«>/3VV'U { SE 
V I Eh Y»: I PP< M I POYLAUA »T.TDN I POLPi VOPXKAAALUCOFCSVEGLLENLIMVKGL 
. \TML::EROETtKL::KKI-\U42^N I P 1 FVPI E5LTFPQHPVDEEKL IHFY lOOTCFKTLVP 
.'.VfJTEAXTVtMj 1 1 KDAtC'LTN I UJEW-iID t AFAVAYTiWILU:! J(LECLALT0C3WF 
F I AIJ££Q7TK tLP U * WFI -HEOI .T PYOVNliK RCCI lALLNAf ! XV tnEtiJYGlALAEHLTN 
- JTX* [rr^JIiWIItn^CT.NiWPAKIW^VXPlChLPEyrWFr.EFVAYLPI ixdail 
KK It&rttt Ml I LIU I FXn.KKVU'.'^ERA WrLOVF.PJ JV TU-IAl .FETELAVLTEE I YDU 



0 tupn ' rrvPrcEIiJ3CRLWEX^g3AKr^r/PlVW:i:^Oa<ACC 
CPn.Oi;t^ 70S^2 

xwTiAvxwwiAisx^ 
iiSSSwsiiTACxoxjawmpwrsHDRE^ 

StWT^GAKXFSPEXAXO 
VASAAASSPLVTCHXKHD XLPL5H0AAYX PPYLAL 

CPIU0614 707435 705793 

adc-ADP/ATP Tranalocaat 

WXWOCVCICEnCSSWPI^l^Yt^ 

TL\m/CSnACATVIPFTJCVVCIVPCAVIVTyVYGWLCSRYPRr^ 

AVIIYPVCDSI^LNSIJUlXLOEiiPQClJ^FIVMWYVSYSIYY 

Qy^ljQI^ITEAfiR^ALXOTCtiWS XCAC£ ISY**4CXCTFVAYS FACOSWKSVUXJJLT 

ItlTCSCLIMI^YWIKHLTIDTSXPffSWWEtCAATAKU 

IOSRYLLClJaXVl^YNLVIKLFBVWXIX7VSCXYSSH>^^ 

V^TWIRJOOmCALVT PLVMLVSCUJTCT I FAAIOU3 1 5 1 FGGVLOMTPLALAAWT 
QSMQMVL SRgrXI If K DQrKPiAFXPL^PEX?KyHCXAAXDCWSRXCXSGCSLXYOOtXV r 
XPSSWAASLNVI ALVLLIIMWWIAWAY XGXEYYSRAADAVATLXOPXtPSSSIVWAO 
ESVEOEEKAVL 

CPIU0615 708149 707634 

Do»A-Clycerol-3-P Phoaphac idyl crana cerate 

XLOTITDKVFVrVCrrVLYJtEGSI^XAM^ICAR 

FWGXIFTWOFI XLI^AOOEXXVTCLVPLVAX^FtYFLatXKrTrXXOFLA 
CPft.0616 708704 710137 

diuB-Replicative ONA Kelicaaa 

TLTtnt&SUHDXSTCNVVPSPVKSKESWTVUSC^ 
XIirRVMDAFKOCaa»XOymXW«EnJaW«ITVIGCPSYLIT^ 
XRSXSXXJUMSTAXEXEX]UaXOPKNVAEAXJ>£AONSFFKISOSTSVSQrri.m 
LTTITDXPYLVOWERCEIJTXJNAan^Frrc 

PAICKrAXALNIAENLCrWRX^XCX FSIJ9mTOUKRMICSRSEVDSKXXSXa0L8OH 
DFQRIVSVI NmOEKTLLI DOO PCUCVSDUlARARRiOCESYI} IOFL 1 1 DTlOLLSCfiCTTL 
WTJSRCjreiStXSRHXJCTLARninPXLCI^LSRJW 

SOLVICFUJUIEYTDPNDXPCTAELX X AXMUCSXGSVPLVFCXSJUlfWfYSArfEXS 

CPn.0617 710481 7U316 

aidA-FAI^-dependent oxidoreductatt 
LKWTHP IAYWXVVGAGKACCZAAYCSAXKSVSVl^tLTSNLOTI^ 

WRTixuLcxxMAEvra>scioniiLwnccpA 

H TtlOftT^ T^ T "'rarc V miVIT A^JL^KF^CXTVVLSSCTFKRCLXH IQDRN Ft OGWL QP P 
SSOCWEXajClCRCFPISRlJCTCrPPRliJ^SXNrSCMEEOPCOl^^ 
QLSCFXTKTHEICTKAI XSANLHRSALYOOCIEOVCPRYCPSITOIWlWlWWVnX 
PCCLKTOOYANCa.STSMI>FWOYT»URSVLSl4KAIITRPA 

SXLIECLTLCOO INCTTCY EEAAAQCL XACIKAVNKVTNRPPF X PSRQE5YICVWLPCLT 

TOIX^PYRHFTG1Ul£KIUXLR0t»UCAJU^HYCYELCU^ 

UjQXTTROmSVVSLAKALSRPEVSYtmREArTNDXRDLCAVl^^ 

PJOXILIOSUXAESLLIPEDLDYXOITAI^IA^ 

VLMIALKXMAKH 

CPn.061B 712300 713010 

lpU-Lipoaca-Protain Ligaae A 

M4MPTTNCXFLDLRCHSXLH0L0X EEALLRVANONFCX XNSGAXDSXVLCISIWLWDVK 
XSRAOADHXPX IRRYSOOCTVFIDSNTIXVSWIWSSEAS AOTOEU-AWTYCIYSP1XPN 
TFSIRENDYVl^XXXOCIiAOYI0R«R^KTTfTJWDIOLOX^ 
SHEEFLTnJUVFPSRJJOFl^IXASWUJTWEEFTI^ 

CPn-0619 713462 713013 

ndk-Nuclaoiide-2-P Kinase 

RJtYVYTMEOTL^IIKPDSVSKAHICEIUIFEOSCLRIAAMKWHl^QTEAECrYTVKRE 
RPFF0ELVX>FMVSGPA/WLVIXCAIMVSR1JRELMSATKPA£AASCTIRAXFCE5XGV^ 
HCSrrrLENAAVEIAYFFSXIEWNASKPLV — 

CPn.0620 714145 713519 

ruvA-Holliday Junction He U case 

OKMYDYIRGTLTYVHTCAIVIECOCIC*/HXAITERWAIECIRAlJ^OFLVrTHVIFRETE 
HLLYCn<SREERECFRXLXSFSCICPKLAWXl^ALPLKVl£SVVRSEDIP^LASVSCIC 
KKTAEXtJ<VELK0XI^DLLrLDSRVETSOTHTTSSCL£ECI0ALAAl£YSKIAAERJ<IAE 
A IKOLPECSSLTDZ LP IALKKNFSGVNXO 

CPn.0621 714707 714144 

ruvC-Cro«cover Junction Cndonucleata 

LSRLGSSFK0NKFKVF0E5 IVSEX.I ICVDPCTI VACYAX I AVEORY0 LRPYSYCAI RL8S 

LIFEYAPNVAKKAWr.KGlAiJKROVUVHVaK I LNVPEVLHPGNEDIADATALAICHTMVA 
RSPLCCVR 

CPn.0fi.12 7H7fil 7147*M 

^T'^Oi nyjx'tnnr. lc.il protein 

k Y.'JTR I.LU X LKLHLF: .UUKSZUlWrfHT* rjtR^UILLCRWXDAO IHEMQ0 ICNILSCV 

':::pk* iW.'AUWftw* t ioemer i hloyrfouialeefyrrreeakiiodleklooewt 

wI^^UvEKt^ tRI^Ttl ( r£ ( K K KUJji rv\M (TE [ .T EX :p RUT YFJ ( K t KOL.EE0L0R YVS 

rjv v&i: 1 1: t eojt^\axm: inplkk^l i duvet d 1 1 1 rrrn:;p. iarlhcxlorocbao 

T: ' : JKVf.'U I EK LTI'VVTOr *M7iKK A I ALU>1> I VKtATVC V» J'CI A IK W^WtPCf/TKUWUC 
• W> WKI^iFVtWVK::ii;:K;:» « ^: 



"l '(ill 



/It. 



■TV14 liviW'fl i«M I 
I P* JTf f IV VYFTRDI V I PTV I f: :i» 1-7 Tf Y, UT/lltfTKI 111 VJM* KWKA 1 1 V I !IU SI I* rPSNCO 

h::pw,vi a» u^p/mkvri tk i i wxkf ;u «<. t at* u<eal t KLTicrrnuv iokxpla 



101 



WQQ027994 f file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO 0Q27994.cpc] 



Page 1CM of 330 



WO 00/27994 



PCT/US99/26923 



INAU-'IADUA (HFP€TE£EPTSA3FEE33AMFFPETSSATECE 

CPn_0624 71801R 717011 

*japA-<Uy:«?raldehvde.J-P Denyrogenaea 

AMXWINaWnRLVLA0tUUW3SVE\rtJ*XNDLVPC^ 

EADHUVTWKK toruERN^jwjwwvtx«\aBCTa.r^ 

Arnra Tprnirr^rrFNpnror/i.'WAsrTnruAP t AKVLuwnmctJfnvHA 

■ • r 77» • *■ . *T; km > * ; ; , «V*/A'.T^:Lr Sir y-fSNAFWVf : 

. ■.•u.r.TUii-.rrrv; *u- ^kah* -..v; j-Ttuc ; : ~;rrvc?.':.:.zvr~~~>'^ ' ^ 

I WJiOR FFKLVAWYDNE7GY ATR IVDLLEYVEXNSK 
CPn_0625 718488 718060 

rll7-L17 RibmoMl Protein ^ 

WOKWWICniVCRTSSHNRCKUWMIJtSLrHr 
LAAimiAICMJfVRYNKLTSXSWAICQCOrSVYWD^ 
R I UtLQNRXGCNAQXCX IEFtAS 

CPn_0626 719670 718495 

rpoA-RNA Polymerate Alpha 

WLPAWCXAQSWWKEKGHSISIAJC^ 

GHGHTLGNALARALLXGLEAPAI ISFA«aVLHEYMAIBCVICDVTMX rUGfCAUJOY 
PHQDSSIXntTTOVLIWSISIDASDljUUtfOQICEVTL QnT . r.q r a raVNPPQ VIFTVTOP 
I0t^VVlJtIAreRGYTPSERIVl4»W?nfKVtJ>A^^ 
L^XVETXCRVTPKEAlJlFSTgiLTIWFSXFTJ^ 
WWnO^VRSTOCLSNANirriQELVIKPCTIUXOrRH^^ 

CPIV.0627 720059 719640 

rell-Sll RlboeomaL Protein 
rLIRSRVLVraWAOAJOSVKIUCOUWIPSCNAmVXAT^ 
VCYS8SPJCSSAfAATWODAAJCTAKNSCLRIVEVC1^0TG*CR 
IRX1ZTPVPWCCRPRKRRRV 

CPn.0628 720461 720063 

rtl3-S13 Ribosamel Protein 
DAYTTXJ*EAORMPRIIGXDIPAIOCKIJCXSLTYrraGSAra 
EEEVCRD6X^SEYTVZ2HaJUWV0SDIKR^^ 
RKGXRKTVAGKXX 

CPn_0629 7 2188 1 720487 

leeY-Translocese 

KIIttJRPWmJ»OrPLITnJWlttrYTrAIiTACRVGVriPW 

SGCWOLEPIISOBAfAjlMVIAESV^ 

RI<3UJRXTIVAlAVIQSUJAXFAUO©n.TIF^^ 

TTCTUXMnGEOISOXQXGMBXSU IALGILSSFPSVLGS rVTOCLNLGSODSSOLCLIS 
XUIIJUA/FVFVLITriLirECNraKIPVQYAM^ 

A5SXjUfFPATX(^FXA5ESSWKXJlXAAUAPG5LVYS ICYVXXI IFFTYPVTATOFKPEO 
X ASEXXXXNATX PCXRQGKPTOHYXXYTWNRVTLLGAiri^IAILPSL^ 
SYFUXT AMLIVVGVVLJTMKOVPAIXIJ<RRYD SVX JK. T UR T KCR H 

CPn.0630 722316 721885 

r!15-L15 Ribotoa&l Protein 
MXtIAXnZStR»aUCXLLBIBBPSSGHSK»ai^ 
RVPTRGFSHOTDKCVEZITTGRJAELFOEG 
EKrPVWOOTAWLSOGVQMLXJSXT 

CPn.0631 722812 722312 
riS-S5 RiboiaMl Procein 

TDA IRKGGEAAXXNLMXI EALEX3GS I PHEVLVMHDCAOLLLKPAXPCTX3 IVACSR I RLI L 
EHACXXDXVAKSFGSNNPKNQVKAATXALTCl^PRKDLLRRSAAXW 

CPn_Q632 723354 722827 

ril8-U8 Riboeonal Proeeln 
KGLISSV^VNl£0VFAPNVt4JILIKVREFV^^ 
KSLKXRRJttLRVRJ^RSSPTKPRISWK^ 
CLTTOO^EVAKVUJTOIAELGKmjOLOinATDRCP 

CPnj>633 723760 723209 

r 16-L6 Ribosomal Protein 

SMSRXAREP I LLPQGVEVS ICDOK 1 1 VKGPKG SLTQKSVKEVE ITLXENS I FVHAAPHW 

DRPSCH0Ca.YWALISNMVOCWUTEKRLm^ 

STXCVSVEKCTLISVKGLSKQLVGEFAASIRAKRPPEPYKCKGIRYE 



CPn_0634 724215 723787 

rsH-SB Riboaonel Protein 

E3SIKRKR I YHGHTSDS I ADLLTR I R14ALMAEHLYVDVEHSXMREA IVK t LKKKCFVAHY 
LVXEOWKRAIWVrLOYSOORKPVIKOLKRVSKPSRRVr/SAAJC I PYVFCWC ISVLSTS 
3CVM&CSLMSXN IOGELGCLW 

CPnj)635 724763 724206 

M5-L5 Ribosomal Protein 

GERKANKSRUWrYTCCIRKSLPEaCFGYANXMO I PVLXK I VLSMCLAEAAKOXNLFOAHL 
EELTM 1 3COX PLVTKARNS lACFKLREOOGICAXVTLRCIRHYOFMDRFCNIVSPRIRDF 
P.r?rSNKroCROCYSVCLDDQQrFPEXtnilRVKRT0CLWITWn*fA 01 UULL rTL LELMSL 
RFKKAO 

<:Pn_l>i» \ii ?25 100 724750 

r 124*124 RiUmnovil Pint-tin 

FT. EKEVKXia* I RVKOKVF I lACNOKCKEDKVLCLTEXJJCVVVGrVNVR I KN I XRSQtHPX 
TRISI EAP II I l:SWRLT IWEPAKliVKVTI^ELWORRFl/TriOLYRLVRGKICC 

'.IftjOii 47 /2 , i47 j l /2'iO'W 

rIM l.U hiUitMiMil l-ror.«<iii 

i k m ioukw J(VAi*my^KKVKi -FKVLLr^RRRYATvnriv rvt^^-RDVErtr^; i xkoov 
irAvtvimuaiiU'KrasTtjcFTTrwarvi foowwKTrniPiPVARKwm^FtKicsL 

AlhVC 



NX KEXVX SliASEPP^^RKVR tCVWARMEKTVVVIIVtR XF.W fgrUVJKMCTMf? 
CPCU0639 ' ^ 7J ^^ in 72,743 

A^i^2A5iSiTOWcSrra^ 
KNiARALryKOCprWMr: 

rllo-Llb R i Dot caul Protein 
XXKLHPKRWRIWCW 
CTRO^RirPD^TKJCPAE^^ 
RRAAAJOJCIKTRr/KRVtRV 

CPn.0641 ^7092 726409 

raJ-SJ Riboeonel Protein 
KWXM3OKOCPXCFRTGVTiatWRSl^SNK0e^ 

WRJOBCKXEVT X0TARPCLVXCXICCACVXXIJCI2XRA1,TC1CEVWLCXAEIR^ 
VA»IAROXmVSFRRAMa(AM}5V>OACAVCVK I0VSCRLACAXXAR5EVYWCRVPL 
HTiJiADIcnf ATACAETTYCI ICIICVWINLCOISSSTTPKNPAAPSAAA 

CPTU0642 '2T440 727096 

rl22«L22 RUsosoimI Protein 
RRHSMFXATAXYXRVQPRJUUU^U<RNX^tA^ 
AElJCIWiaENl^mVRVOK 

CPIU0643 727725 727450 

rtl9-S19 Rlboeonel Protein 
EIRIKaiSlJUCCPFVOTHIXIUCVRAMNIEiaOCTP 
FLTVFVSETWCHKlJWSPTRIFlCSHPVnOG 

CPn.0644 ******* 727722 

rl2-L2 fciboec— 1 Protein 

FXRX lJ&Mf HJCF Rm Kfl'R Q LVU'AFPEtTTRCIlJgTnCSKJlSlJUS^^ 

faMXAnSCRMRaQOAXOLYIIVVOPXMim 

YIIJtfrei08CDWVV8ClgS PP lC P OCOCrLltt 

OVIAXSPGYVTLJaffSGlTFHUfEOCRA^ 

TAN«PVDHPtCXSS&aUOmXPRTPWCXVT1^^ 

CPIU0645 728933 728598 

rl23-U3 Riboeaael Protein 
0M102PYtVXXRKYVTElCAXXL£KLSAi 
EAXYVtWNVKVKSWTXJ 

CPQ.0646 729636 728950 
rU-M Ribotanel Protein 

YRBl[>r/liSKrDFSCNKICEVTVAOSLJ'AI>ECDCl<OL IKUYIVAIRAKKROWSACTRIIR 
SEVSK STRRPF lUjAG TQWWaASPqFROOSIVr G P KPKF NOHVRl^^ 
XAQICXQTOXLT^n/DCTlVPVOALTAPKTOSAlJlFIJCXOIVECRS X LF IDHUKVOWlllIA 
LSXJ«LTAViaXFVYCXNX»CrKASAiefIVX5^ 

CPiuD647 730490 729657 

rX3-U Ritotoael Protein 

YIZmYCXIlUPLI7XP7XFIJtENFLFri4N5XS1CXL5RFVS 
RSKZSVMJXXESMIHXFOXSCSLVACSVIRVCPinVT^Xm 

wfrrricK^KPicuafuucAOCRvrRF^^ 

OXS ttlECr OGVKI tlCrC FROOPCSHQSCFHIUIAasiGMRST^ 

CPn.0648 731636 730605 

CTS29 hypothetical protein 

FFF13CPCKEViq4ATWIRSACSAASIO^PVAKEPAA\«SFAOI0CIT^ 

AKFV»TKSUIKCT1CI^X>VSIX^A«StiEACC^ 

UNCAVPS IVNST0RCYQTrTOA/EXjCSKTRXRJCrPCEYSI04LLTRC0^^ 

GATTYSATFCVLRPLKLX NXLT AX P FLDXATVGNFCTAVAC IKT INMCACVAOAVQCXAL 

EOKXJTRAXXSLYNPtCALPJQQSQLSCDVI LSAERA1JKJHVATLKRWVTT T rXMf f 

WDCVXLI PLP XtVACSAAXSGALTAASACZCL YS XW0KTXSCK 

CPn.0649 732672 731710 

fmt-Hethionyl tRNA Foray 1 crane terete 

LWJCWYTCTPTFAATA^DIX^HKIOITAVVTRVDKPOKRSAOLIPSP^ 

UjOPSXASDPOriEELRAFNAOVTIWAYGArLROr^IPRYCCTNLH 

PIQRCXKEGATgSQfTVXRMOROHITroMAMXTRVPX 

TWXCSOOLOLVSOOAAl^TIAPKLSKfSOOVPWKPAXCAYAHXRCVTPAPaAirrVS 

FSeJ(APXJUJfXRKA5LLAEACRYCUPCTVVVT0R0UAXACSS^ 

SKSFLNCYPAJCKLKIVFTtJW 

CPIU0650 733513 732665 

IpxA-Acy I -Carrier UOP-GlcNAc O-Acyltrentferate 

SRRJ4HA5XHPTA I IEPCAX ICXOWI EPYWXKATVTLCONWVKSYAY XOCWITIGWT 

TIWPSAHICKXPCCLXYOCEKTYVriCENCEIREFA I rrSSTFECTTVSICNMCLXHPWA 

KVAHHCTICMAaX5NHA0WCMV0VC0YAIL0O<^^ 

PPYT ICSCN PYO WC I NKVGLQ RRQ V P F AT ALAL I KAFK X X Y RADCCFTES LEETLHYC 
DXPCVKNFX EFCCS FSXRCX ERS IOXOAIXEESADKEC/LIES 

CPn.0651 733975 733517 

tatiZ -Hyriatoy I *Acy 1 Carrier Oehydratate 

ieejP3VIXLREU^P>mYPFLLV0XVLSYDIEARS ITAQXNVTXNEPFTMCHFPNAP 1 
MPCVL ILEALAOAAwVL ICLVLEMCRNXR f ALFLO IOKAKFROAVRfCDVLT LQAQFEL I 
WrF^WAOAPAT^OI.VTEAELSFALVOX E5 1 

CPh_'*S2 7I4MR0 7J1990 

kkn:;i vttv&:u\*yntfjrrvn-rijtKVj*>t!y:mHU^ tvFOROn 
k y ENvr A.u j^vyttt ni: rrr LSRa-> v r atvd i lmaalrs nn r onl i iom:tErpr 
t wrr>JVKvia.nvAc iceceoxvc; i arltp P'/yyonoo x flaafp??oeu( i.t/tlhypq 
: l'.t rnvvittLV i hee::krok t Af*rRTPALYriELCFU4EXCL iccxyru^wvFxDtt ; i r 

:;r* Mi«FADtIVRHX lUil. UaM4:UV:RI»FVAJ(VUVVn.7Q4SSNI AFtttK II-KAUU. 



HtlJIl.tH 



/JV/71 7JS4'MI 



4-rt 



7 MH*.H 
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inir.E-Apni ipoprotein N-Acetvlt rens t...«se 

OEPVLR IFTFV 13HTL IAFAOPDL3GFVS XLCAAOCYCFFWYSLEPUCICPSLPLRTLFVS 
CFWirrtE^IMrSWKLOTYICICLIYLVWLTLrrtlJVt^SCrSCLLVAIVROWTAn. 

wslpcvwva r muimxpscHsroyuMmrASAYCRorocn^wAcosFAvzAVMtsr 

YCUXKKPHAItKU^TUXPrrrCAIHYCriJCHAFOOO^^ 

spivvwEOLLOLv^prooProLLippEVvvpFacHRovY^ 

L3MSOCATAL50Hr0CPVt ICLCTWWKEW^YWWSACVXSHJCCISVGYIJICRILVFaE 

v r RrtrF^rr. :H>cr.P^Ar/v-KRLPnPR!V7r'7VRGLPRTnTT:T* ill puhlqsyk 

L\i-j M ;.-ii.Tnr<aivi k:* : j Fyurur:MJW- ! - wr* vr< f v:T7/r.vwr-.;uiP.:LK 

; U YUTIttTKAl ;/i.LT4U UWTOVtriCiVWtUArVAV^YaiKUVRUAK 
KEIR 

CPn_0654 7)7051 736503 

vdlO/yciA-acy 1 -CoA mioostsras* 

IUU:DFLSVT3RYYR>CEYPXKILSVEST7<IJa<X^ 

CUitSU^RIjaWAERHTCyrcVTArVTJAL^ 

VKWAD«YKOEWWITSAVrrFVAVWI»CPIPVWjrVP 

ELK 

CPn.0655 737856 737101 

dndO-DKA Pol XXI Epsllon Chain 
XEXHSUXDrmCWIMJCLOViaCORXZEXAAV^ 
QRVHHISNAKU^PKIAEWPCIKAFTraX2JYIVGHSV^ 
mXXTOu^AKEYGDSPWiSl^SlJWHFNVPYI^ 

OLKOVLAKPIIOOCYMPLGXHKGRCFSEX PLAYLOWASKMDfTJSDLLrSIRHEIKXROKST 
GFSQVNfPFMEL 

CPn_06S6 737842 738048 

No robust homolog present in Genebank/EMBL as of 11/7/98 
TWFIXWLSIJDILLTVEOTIXLTLYTASTO^ 



CPn_0657 738476 738051 

yjsE (AT Pass or Kinase) 

TORYIWVSHSSOETIJJCTELGCVLVPCAVLXJJ^^ 
EVASP5FSIU<VYCrcPKRLCHTOLYRIDQKNQeYIFODA^^ 

■ DTINIYITMOTWEREIIIEKR 

CPTU06S8 739180 738455 

CT538 hypothetical protein 

KRVGHDXSGAWQXXXOFttacQKKPEXJAryXJ^ IFKTPEDAV 

RII^SXKIWRETElOISSEXPOVNEWITailYICPrPCXVFAIWW 

PONMEKOOGVRIKRrLVSEOTDVIKEYAVPPKEPI IRTVFASAXTC*LFKSLPPLIJEDFI 

SSYLRiWlXEVONQTKroLESSFl^LLQnALVEOICXAArXE 

EE 

CPn_06S9 739482 739838 

t rxA-Thioredaxin 

WBWRDSNSIFREa&HWIISSDITOFIASGI^ 

ELPHVTICKXNIDEWSKPAETIf EVSSI PTL ILTTCDCNEVARWCtJCCKEFLTNLI NKHA 

CPn.0660 740327 739860 

i poU-rRNA Methylase 

MRVVI^PDXPC1TCNIGRTCVAI*»ELILVRPLGFS^^ 

SXEXAIJCIOTEEOIFCXJTlCGSASYTETSLPSSCTYVFra 

PHCXJDIRSI^TSVCIVLYEVVROKTVAWKNPTV 

CPn_0661 741139 740327 

mip-fXBP-type peptidyl-proiyl c is -crams 1 sane rase 

ELSX^KLSRTFCHXIARQIJUSEXSjIFTO 
OKLVFEKKSKEJOSX^EKriJCEtlSlCKACVVEVOPSK^ 

KCSFINGGWSSSEGNNEPILLPLGOTI PGFALGMQGK1CECETRVLYXKPDLAYCTACQL 
PPHSLLIFEINLIQASADEVAAVPOEGNQGE 

CPn.0662 742938 741172 

asps-Asparcyl crn a Synthetase 

SKGCYMKYRWRCNELTSNKIGENVOLACWVHRYRNHCXrVVriDU^ 

QPELH0Rl£*VRSEWSO^GKVCPRtJGMENPNLATGHIEVEV^ 

ADOH XNVNEEXRLEYR YLDKRRGD I X EXLtCRHOVMXACRNFMXMOGFT EIVTPVLGX5T 

PECAROYLVPCTIYPCKFYALPOSPOIJXOLtWOCLORYrO XATCFRDEDLRADROFEF 

A<J I DX EKSFGXTfQCLLP X I EOLVATXJ , ATQCIEIPLPLAJ<KrYOEAKIJSYCTDKPOU?rD 

LKLKDCRDY AKRSSFS I FLDQLAKQCT I KGFCVPGCATMSRKQLDGYT EFVK RYGAHGLV 

W X KNQECKVASN I AXTMDEEVFKELFAYTDAKDOD ILLL I AAPESV ANOSLOHLRRL IAK 

ERELYSDNOYNFWITDFPLJSLEDCKIVAEMHPFTAPLEEDIPUXrOPIJKVR5SSYIX. 

VIJCYEIASGSORIHNPm^SQIFTILKISPESIOEKFCFFIKAiJFCTPPHLCIALCLO 

RLVH\^TAAESIREVIAFPICTOKASOLMI«APSEIMSSOUCELSIKVAF 

CPn.0«»i>3 744220 742901 

hicJ-Hiccidyt cRNA Synthetass 
KSNHFERRHHVTVTLPKGVFDIFPYlAIlAKQIJfRHTS^ 

PIFntSEVFUfVGEESDSA/KKEWSFLORI0CRSKn.RPEGTAAWRSFLEHSASMRSDNX 
FYY I LPHFRYEROOACR YROHHOFCVEA ICVRH PLROAEVLALLWDFYS RVGLQHMQ 10 L 
NFUOC5ETnFRYDKVLRAYUCr. < ar;n^Af SQQRFSTNVLRILDSKEPEDQEXXROAPPI 
LQYVSDEDLKYFNEILDALRVLEI PYAINPRLVRCL0YYSDLVFEATTTF0EV5YALOCG 
^RYIXa.i:»FCGA^PACCFGVGL£lttIQTlJJkOW 

WJQHLRRI JS I PTEVCWSMKKVKCALKAASTB0VSr/CL ICERELISOOLVXKNKSLRKEF 
KJTKEEVEORLLYEIONTFL 

■ Cti J»AM M4775 744557 

tki f^mi.t hmkilui pirtsent In t*enebank/EMBL .is: ot II/7/ob 
l>*rA>IA«KKLtALlwtFLVVUKr»nT*KEHUAHATVLFAARAKYNLF^VODVFrVHEVIEP 

i;;|'U.lv»yb*iv 

• lltjm.l.*< 744»»'»N MMtfS 
ulifj; MfxaiSUvvplur* -Transport 

KMMVSfTKKPVITKll 1 K F. 1 EOOEWKKKYKYWR t R [ KflTHF f« IY I FYYFTRKSFTFAMPTL 

i ai * t ;fijka«.ii * : i ravn nr. triKFvr/TVMStwipRYPMA n :lm itcltn i ffgkss 

:: I VtJ-VUJ** yM^XAMMWAI<LLT1IW"/AKJEP/rr***^rpJIfNICCAtlPtLTCF 

1 1 1 /Y:> Mt« ;amy vi < > f ur it mslvl inrlrdtfo.'U/ «t»pr I ekykrdphhahi iecxsas e 

• rw: i mkki .: miH i UTYvr.TNfj* wflaaatiff r y rvRMAVKOwr»ALFL i etwiyaavk 



ANrrai^ICCUGm^AiL^CKISltt^^ 
OC?TtLFVICTFLYCP<^ICUVUEWHKIUACT 
*»^FIAl^«Uli^ ' „ ' ; 

CPn-0666 '4637« 75010? 

dnsE-EKA Pol IXI Alpha 

CFFLTWI PLHCHSOYSVLCAMSS IKTFVAKCOEFCI PA1ALTDKGNLYCAVCFYKECT0K 
**!TOPX IGCfY T APWFOTItltEKRflRAAMHL I LLC KNEOOYPNLr ILTSLAFTECFYYF 

rtf;pifc:ijtCY.:r:: : v ""-' --.w-xlu'- -- vtt/:uik 

mjkejiacfkeewu^vv ^rrv/:.^ K:^::irAr?a'::iv::wftsoAH 

EIlXNVTJSCETWIAKOI^tPNPKRICWRSREYYFKSPACMAELTO 

la^OPTFOFSTaWYPlYVTESUCTLNSYTEEDRYOASAVFUtO 

tAJCKFPHRDPIDXVKERJO3KEKAXXIP10CMCX7tXLIVVroiXW<AKA 

SVLIJLLGreEXEPIRFttFFERFINPEflLSYroiOXDICXAG^^ 

AOirmnTIKAJO^VlCOWIRTLDKAI^ 

\OOJCV01XCZJCTLTSXNIAKSAXEXXTGr>SLA^ 
SttMQElJklOILRPtaiJXEIIAK SALYR re 

LATVTFwS^YCFMCSKAAAYCLrTYTTAYI^^ 

LXMWSf^IPXLPPIIItltfSSNWPVATOICIRFAJi^llGISIgaJK E 

SXWXORSOLIOCVSKXSXESLXDAOCrDCPVSNROU^^ 

KTPYTLCMKDRXKEVPXCXPXDXPTR^KXEXaJQXX^^ 

VTA3ETENLPHGSVVRTVFI XOXVTTKXSSXAOiaCFAVIJtVSKXDSYIXPIWPtMYIZO 
QEXJXEtALmXLVLCaatSOSLRXSCR1MCOUIVNE}IX 
TKSTSOCETKAiCCNXPKENCHTQAlAPVrLSt^LK^^ 
V t TODNlR VASMSPDOAYTVCIPXI^EIJOELVTACIJVKVrTV 

CPTU0667 751097 750177 

No robust homolog present In Genebank/EKBL as of 11/7/98 

Nim£iaOiaYFWOaJLYFAAFVASlJrGVFl>ORV|CAOKIIOIL^^ 

RFVWISCFEELOVFtltHVSPEWLAiJ'PEYlu^^ 

HX ISOECEXLHSLVNGEHVIinCTWrCSKnP^ECLUJiAGXOlKRV XOTLATLGCfTSRC 
SlJkOALALKNITlAERVIICEroXiaaiFASCNOICrKFOOro 

RKAAVFPAOYSESRVRHLVXKI FCWTUVRSSMVYVPVYX ISLVS AtKSVRVTYIMAVT 



CPn_0668 751176 752162 

CT547 hypothetical protein 
WRJV^PRUWCFIiYVPUiVLVSTJX^ 

G0E7LKICGNFRXALLCFGX XTOKFPWXLRWAOYLICVCYTTOWPDIJUXArASYLOL 

P0AKSElU0MICYAXA0RFAG<aailCRXCl»I^5F^ 

GAOAX.YSKAAIXXVKNXS^TEATICTLXXLTXjOFPI^ 

QYlJ<FAKIJrtXAHKKOHPNHPIJqVVSANVGAMREHY 

YRTAITWPDTLLVAJCCOKRLXJRISKHTS 

CPn.0669 752140 752775 

CTS48 hypothetical protein 

XtYUXLPKXEIWaOFSt^IYI^FSLJU^SCCGYSXLNSPYHl^S^ 

PXXEXfffG0LCSALTYEL5XRSFAI5C3tS5CACYTLXVEXllCXm 

THRHF IVSNEOU^LSAKVOLINNDTOEVLI D0CVARXSVPFI)FEPClOTANAKEFAL0Q 

FEMI5EAX1CSARRXUXRIAETXAQQVYYDLF 

CPIU0670 752738 753196 

rsbw-slgma regulatory faetor-hlstidine kinase 

X ISYAYOGDJSPGTIAISCISKRGDLEWIKDHGPSFNPLAVSINIOEEU'LEORXlflCL 
CIFXJUCSSVt3EIXYAREDHOIIVMUtKLN«3HS 

CPn_0671 753660 753205 

CT5S0 hypothetical protein 

RXTIWRJCYTH5L0FFEEFYTOSIIinCT5FPECYIlJXA£ILSYPKCTXJ 
NDFX XAESKD&TXJNAOFAIWLVPEXVOGOAVTRGYIAVSOGEG^ 
OSSLILEALOLYIJaJIKDTENALRiFTtFTJWDH 

CPn.0672 753723 755048 

dacF(pbp5) -D-Ala-D-AU Caroxypepc idase 
TXKSTIMiaiPFrrYtCXIFYGSCASULHAGLSFPEVRSATAAVVHAl^^ 
VIYPASMTK IATALFXUCHYPTVL0TLXKVK00AXA5 XTPOAKXOSGYRSPPKWLETtJCS 
TXOLKLREEtl^LFHAIXVCSANDAAin^MACCCSVE^ 

FNNPHGLHHPNHYTTTRDL I S XNRCALKEPPFRGVX 5TTSYX IGATNLHGER X LSPTNXL 
IXPGSTYKYPPALQGICTtnTirrACXNLIHAAEXNNRlXVT I ATGYSGPVSDLYOCVX ALC 
Cnn^EPUJyCELVPPSrcLQLEIANLGKLSCPLPECLYYDFYASEOREPLSVSFIAMAD 
AFPIEOGDLLGHWVFYTOECXKISSOPFYAPCRFERTIKPWKLYIOCRVTTSYRTVHSITM 
LLNYFRIRKHRJCYXNLXMY5KX 

CPn.0673 755242 755463 

CT552 hypothec ical protein 

GKSTECXAYHC FUCQVS I ALNREEVWXWPHHLMF I LHOFOOFSCEOORFGS FLEAT IROR 
VS FLVLOEK X ATLK 

CPn.0674 75W85 755577 

fmu-RNA Methylcransterase 

RGILYVTKVPFROHHAYOLIJCOLHTSAISEAORVSYYFKOMISLGSKDROWIONIIFNIL 
RHRRLLETLI LDSCEOVTPEALVAXVNBCVLENLDSYSA I PWPVR Y5 1500LAHFLVQ0Y 
GEEOAEE IAK IWLTEAPXT I RVTITDK I3VKEL0EKLEYP5SPCELPEALHFSKRHPL0ST 
EAFRRCFFE lOOENSORISCCl CLTOKOXVLOFCACAOCKGLIFAOKAKHWINtJSRXAI 
LQTAKHR LLRAGARflF^LACNJLP.LGo FriW t VXJA PCSGTGVFR RH PEHXWQFSKKLLLNY 
VRVTJKS ( LKOASAYVC PRCRLV/ XTT^LKEENEAHVAYHHSL^WKEVHRKTLPUfVGXC 
DAFFTSHFOK ! 

TPnJ»»;7S 757^)1 7 r .»,7..#l 

f.T»»V*> hypothetical prof*, if, 

VPUXILOFOFS IC;YYLRVMXI-\IIU>rrHI!*V.'M<KW^LiiAWtV>n*IMJTNYOT:nffrri 

FKSLt Lf E^KRPVDK [ VWANK7Fr.rf ;r: ;nF: W-n>FTl [L-/TV::FWrn JU tfJVCRRLAA 

', JAEDFOT 1 1 MS 1 1 :10: '. L:»£VLAt Kl. f ' ityjl i /n i r nCTFVUXOhTTA [ Ai Pf-tUT at I AIT.FL 
AEVI JiK V I VEKK LMVJ K : •.IWTPET^ri f Y : ! 1 1< r^jNPAl MDKH ITMMM4WI J i/Y PROIC 
tAI*KAAt*\-YNW1PflFWROFrJ»L7r*M' 
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Tpn «»H/ii 7^^120 758051 

tiomoionoun CO CT695 

SMTTPrSGNPGPBNTISPPl gSAAtCSDSDLBWVSCSAtW 1 ET r ADTCI PEATPStC 
TTCTUCDLVORVEYEARGSaTTKLAR t (UCAVSOIMOfVKTKRHPKEOCVRSLSOZFCD 
LAJt>TRLPKETAEPPYrfAIXTALASCWFF7 ? HVFt*R£rT^ 

PEAAVAFAL Z UlSCCKWATIUVQESLnX^tECACMYNAFSLEATT^tCVSKIU^nJ. 
YrWR I OCLANVW nK I ITSPYLCACOCVSVVOWJCTIfO^ 

r*.PNFALVMKO I l.Yr.VROOnnr n/ttFLM^EEHASEVOTDWl^LT/NLP ILEEDYR 

H" '.w-.KKCiriv: vkk • v-:..v:m vr 

CPnJ)«77 760410 759256 

No robust homolog present in Genebank/EKBL as of 11/7/98 
R tAKSIOT9GNHSPPDVWVRQ*0CD3SSTQC^^ 

WOTIVREFTI^KXSPOSSOCASCPAIOSPSCPTXRPTHPAPPPPTTOOANAW 
APOPPTAOS SSGSB37TAMSS^MCLVSIXJCZUVHSHACSOXVlJaC^EWriayi LHLN 

C^AlJRVREHHPNPgRZWZAT AiUXCAAVHSHATSWIAWQawniDVWMAMISStt 
UJCHKVlJWAWAmTfTVLICOUrE 

CPn.0678 761329 760682 

no robust homo log present in Genebenk/EXBL es of 11/7/98 
k: XMswps©iSKNTHJmxyuw<Pt*ttESC^^ 

TCFTSRXSrniSGAPEGSOOFSAPSADr/KSP 

IPOVPOOGAOR5SCSTTlJtPTRPAPPPPKTOntCWCRPATHCSKCPAPOP 

ATHGnPAPOPPKCZUCOPGOSCTSCKXRVSMSOED 

CPn_0679 762936 761725 

pgk-Phosphoglycerate Kinase 

GYMDia.TVgMPEnCie\aVRVt^^ 

Hl^RPKTOrOEEYSI^PVVDVLIXW»KVPlAXTOn 

Rni(XEHPEXDFTFAA£USyC0rrVNDAFGTSKSXHA5Vy^ 

cXEFLGRKLLTS PWU>FT AI LGGAXI SSKIGVI EALLNQVPYT f IJtfXMUf I P LQAICICS 

LSNSLVEXSALOUUWVUCIAXSIOrrriW PPKL0C 

FDXGPimXErZRZZNOSATVnWPVGVyEVPPPDSGSIJaAN^^ 

AAAVVW^CCSTICVSHVSTOTGAStZTLBOGrLPGTfiVLS PSK5 

CPn.0680 764254 762971 

ygo4 -Phosphate pcrwue 
YSHLPLZZrVIXCGTrTSWOTaWDVAH^^ 
ORVAffTIESSrVSVmPMIASClNVra 

CTXXvu3ocrriivwjsvciiuswiLSpr>fBCVAyLirari 
pFtwutfAram*rrwzs«arcucvssTWAV8^ zttyyvhtkhcsyzsot 

PWCCSLTVRlJC£RaafVCRrfLWtR I FAYL01 ZVACFMAFAK2SNDVANAZAPVMSVLR 
QAY PASYTSYTLIRXHAFGG 1GLVI GtAIl*SWRVTETVGCXrrn.T PSRGFSVQ43SALT 
IJUJSZLQLPXSTTWVraVLSXaJTOZR^ 
FALRALPH 

CPn.0681 765001 764258 

CT691 h ypoth etical protein 

LRDCRYEEliOiAKLVSDKEYOADC XKNCKRNHLPAGLFMPX SRAG ZLEZZ5 ZOOS ZADT 

A£Z7VXZU.TZWttJarfPSKETLIWUJWliAra.T^^ 

RliWCRVAKSEHESTXVlflREUCirrSDDrZZPEXE^ 

ZtMIUEK 

CPnJ>682 764912 765955 

dppD-ABC ATPase Dipepctde Transport 
TSHGWIWSLFRfWNXPKJtSCXRLMASW 
OTIAIXCESGSCKSVSAHAZUUJJCPPTSVSOOWPOa^ 
ifonpqaslnpvttieqofre: I hthlaltaevaxexkl y ALETTC FHD PRLCLNL YPH o 

tt3WA£TADZ)VLVLYAJ5R)r/ZCAPAVQHPWPSHPY^ PO 
QPPHYTAFPSCCRYHPRCSKXUfRCSAEAPCXYPVRSCKKVRCVILYOD 

CPn_0683 765936 766919 

dppF-ABC ATPase Dipeptide Transport 

GVULMl INrPOPLIOATSLTXHYYKKSFVrOGKTIXSRPVDDVSFSLYSTlRAVCLrGESC 
SSKSTLAl^ACU^TSCrLTnCTP IKIJttlWCRHOUtSQVRI^VrQWPOASLNPIUCr I 
LOS LGHSLLYHKLVPK EXVLATVREYLEL^LS LEY FYRY PHQ LSGCOQQR VS I AAALLG 
VPCLXICOEXVSAIIJLSXOAOXLNMLA£I£Xia^TYl^ 

QIVEXGNTKR IFSOPOHPYTRMLLNAOLPCTPWROSKPirQEYHlttfiEESCSTCCYrYN 
RCPOKQEACKSEX X PNQCDAKKTYRCXH 

CPn_0684 768056 767181 

spoJ /parB -Chromosome Partitioning Protein 

EKSCOXVTEEXSKDTXXEVAXroXRVSPTOPRRVFSNEELOELXASXXAVCLXHPPVVRE 
ICTCDRVLYYELIACWRWRAMOUACATTIPVIUCHVIADGTAAEATLI ENIQRVNLNPl 
EHAEAFKRLIHVTCLTODXVAYlCVqwC^^ 

ILTLEDPXLREXLNEX I XOEKLAVREAELXAKOLLSEECSSI ELKPTPLOKAESSKQHEE 
U»Rl£DI£CrriCVgirTCSKATVSFKU3^ 

•.'Pn.0685 768026 768217 

no robust homolog present in Genebank/EMBL as of 11/7/98 
FPpSOVLLXFXWXLDtOAFEXUMX^TDQRIWI^ 



t.'FnJWft 768 37 J 768176 

No rooust nomolog present in Cenebank/EKBL as ot U/7/98 
AKDSieXPfX;RLFRWOELFFFS5VWCE08RPRKLYPSLOHIJiFPIEItPRFUJX 



«'PnJl687 768501 7«J214 

«T4»u lrypnr.ttutic.it pror«uri 

UK i HKHr.M lAYn FrrrPNtTtt.FMCKLVIlN tWKKFY.TFSSA I A IC IVLAJFL.ILK tVHNTYK 
HWAKMtJ I LLLTRAAEVAVa>SFLPSK&-ALSJLE0A7HUX:E3MK PYAOFUUCFY 1 1 IN 
KI«Uf lAYYAl ll*Ytnr.QAU)Lmr rgKlXKEtSEAOATOLYOVALSKJYQLWTANS^PE 
YtTr.':FI.Tr.U«V I EUCCt4JI' < W^DFAALXJSlH^H0FEPM*^:;iXO^LSKRFTa(>ai 

i'It.JKHH /•.•H7 ft /7l)li7 

T4*i rr/ftffi itw i*Mi protiMii 



PCT/US99/26923 



3LMLIVLATR0'/FF.THSJW3. . JCNYLRUJJCWA ITUriCERT.* ^IX-TFCFASFC 
FYTN I FPFLEE0F I PAWT7VA5RY I rSNAAw OmPSH R LK r i ETUAr^ E I F JNYMPFCC 

KSKOLYUCKQCPKR 

CPn.069« V1407 '70 117 

vfhO-NLf.*W«Ut*1 AjBinrrnn-iferi^ ^ 

? V.TEA YAfV/F £ FY? .Y * " .*?':"'!! ~T 1 .1 - - » : . *V?fT t i.* * r ** * " V! .V^FAI 
MHAWI^EXACRRROSLVKKIRVTO^UCLM^ 

LOOVAELVKRYDAYLAVDCAOCAPKLP ! DVO LWOVD FYVFSSHK IYC FTCXCVLTCKK PL 
UOVPVEOZaMVAXYCMOWEYLPAPHKFEACTPNXACVt^^ 
DrnALTT^riJOCEUJ:iPCVEILCPSIEEPRCALIC>rTIDC^P 
GH0CAQPAKiy*#A/OWt-RVSLCIYNI)EDCX30FILVLCDSL0KIRR 

CPVV.0690 772704 771436 

ASC Transporter Membrane Protein 

LSVUCDirA/LVSIETTCSlASCSP^IUUVEAO^ 

RYlttAT^EUXOMWLHMWHSLAFECX LINSXYEPSL5gLPECVI\rSrPTAI«tt tSF 

|*3CFI7VNWPlJkJlJ^VCSEDRCmYXPEm7rSDP 

VXL00RASA0X0XSTOVDLEMVOSSK?IVlCVTEXXS«ECAO 

TXAT^^ICWfTWIXSCOCramWWIVCKIOCHAXSL 

rMETTVSroNlKSILYSOTiaraTISXSSOGn^ 

ETt)EVKASHGATWPIJPC0IFY><RSRCKTEA£AOEKLIKCFIJC0CL 

GTS 

CPO-0691 T73467 772685 

CT691 hypothetical protein 

RCtCSKlJaiO^X>USCICVKXLTOFNtJ*IOPCTW^ 

SSGElAI£ECNLI£XU>EZRSIUGLrVG^ 

SXDEFinTI^TVXXTYEYtaTTDlJTJRNVNE^ 

OSOLXTVOAl^CRVLSCTlUZinTSSU 

SLMMELtAJtSYQEVTICRVAWR 

CPO.0692 774945 773461 

ABC Transporter 
X0ErCATOJCVI«3ESV1tCTLEEREOT 

X XDjTttJOAYBYMCOXJilJAHARLHYCPI AYDDXVYFSS PKQKKPT fiRt f r*PPCXUffTT 

U£XPLDEOnUXNVENVAVm.VrOSVSXCTTFia^^ 

YlUMVVSHRIHPTAAIiiAAVrSOC^FVYVPK^^ 

ED00YA5YIJOCTAPAYSS>CI«HAAWn.VAH EHAV IRYSTVQNWYA CDOTGIC O CIYNT 
VTXRCLCA&YRSKX SWSPVEVGAAXTW1CFPSC X LJO CT ESVCEFYSVALTSCKlQAPTOTIC 
fttJ<VCJUrn^TVISJ0CISSt)ESKVITR5LVSlCKXAI 
KXVVEH5T^Ei<E«XTSiaPXX«lXYLitS1CLSPEaVWSLVX 
CASXLLLZXUN5VC 

CPit.0693 776292 775240 

TPR Repeats (O-Linked GlcMAc Transferase homolog) 

LRSTHKVLG EX SKEEAAXHLAXE7LCSG INLFLSCEY EOASCRLKETLEXiSfTAAlAYCY 

LCZZAX^rCRVSEATJACSKCnj^SCPGOSYLAYCYCA^ 

0OVGCWSLBSVYKIUjaiX0EAX^CnKXIAJU»W 

EVAVAKNPL YWXAWVKLCFLLS RSXRWDKATEAYEPW^LR PCLSOCKYKLCLCYLTU2X 
TIUALKAFQEAXJlMAIOAZ3JU4rfVCnA^ 

YLHHH UUi l UX ATKELLTL OJCKD Sn APU^ICTVVSDPSSKyERJUJTIS 

CPO.0694 779635 776330 

pbp2 - PBP2 - 1 ransglycolas e/ 1 ranspept idase 

FSCESCAKMIKSMXRPIOOTXYUXAOiaKRX^ 

EAYKPO I RVLPCYVTRATI CCRrCKTl-\VM0LOYtVSVAYC AI FOUTRANRVDOOOO 

LIPVWWYIMCLSELI^OEI^CLDRXAXEIUIHAKASV^ 

SKDWP3LH^VVRJtKYP0ESVA5DIL0YVtIPISl^ 

FXQEKEaAVPEAPCTKl^TLSAZLOAYADAI^LLEYEXTX^ 

XKOQAX IALDPNICEILAMAS SPP^fRWTOFVRAKVA£I)SKAVRSSIYiartJMCIHIAEIY 

OUCVPLIPJRRNPLTGirYEEXLPLTrtxrFLOrLFPEl^IKWLJC^ 

VTRLWtJPYEECTCPCSAXFtlAVFPNEraiUOCVlSW 

KEAUX3VFNIXPANYtWILYTDlLRl.rVDPERFSPVLPSEVH 

SAFSTIlfliAFXEWrKSWRKSEFLQYIAAKRQEEAXiliCORYPTPYVPY 

rCQEHLtnri^YlJ SI t TPtTrffi LCPyfDZLDt^rZWEXJ^ 

LSEKLPALFSTFREFKELORPLLOJCYPXS XVRMCR0TEOD1AASFYPVYGYCTUIPMAYC 
OAATLCSIFKLVSAYSVLSORILWCHNEEPANPLVI IDXNSFCYRSSKPKVCFFXDCTPI 
PTFFRGCSLPC^FKnCFXDt^Al^MSSNPYFSLLVSECLCOPEOUO^ 
TCLCI^EYAORVPHOl^YMRSCLYATAXOOHTLV^PLOTAVWJ^SL^^ 
UJQTOEW^YU'SKKKRTIFMPOAVVEAajCTOfR^ 

ICKTSTAES XKRVCLOREYCTMXMKOI WF AAVCFSOOOLSLPT t W I VYLRLCEFCRDAA 
PMAVXMXOMWEK ICORESFtRC 

CPn.0693 7RH201 781382 

homologous to CT695 

SLEVSHKlCUJtSALLSAAFACSVCSU)Al,PVCNTSDPSU.IDCTI 

CDA X SLRAC PYCrYVFDR 1 UCVW PKTFSK^KPTCSAAAfTnT AVOIUWA YNXHLltDA 

D«fFTNAGFIAlJ^IWDRFDVFCTLCASNCYIRCNSTAF7iILVGLFCVKCT 

SNCTWELYTOT^F^AMRUALWECCCATiXAE^ 

PKCYWWFPLFTtWTVATATCTKSATINYHE*^ 

AONIR I AOPKLFTA^IUTAWPSLUaWTALSTTOSFSDFHO I VSCO iNTCFItSRJCACCV 
TVnATLVQADKW^LTAEAftL INERAAJ IVXQFRF 

CPn.06^h 7R1703 7R25 , »'» 

CT»i'Je rtypnt tvr t 1 ptiir.»*iii 

NT>TFYVRHPLLTY:9IFE tfWilLFSQ'.i lCLTIR0LM.'W;AIIF»3IOTniAWr»mKLYXFEE 
KNfa.Y I !MLMCTUX'l-W»Al.l tl IhKV lODNKTVLrvt JTXFOAW.*V I RlV^AI EACEFFlAE 
RWI/X1KLTNMTT rRJCJ IKTUJK TFJ(f)l ^JRNOA Yr.TF.K tAALLAKHIK'KLUOItiXKiYMK 

kak :u.vwor:TOc i A7A^\Kr i / : r i vi jvr.vim* :w7t- 1 rn rvi 1 1 >iijfje :i.K::rRLUN 

V I FENII EAKIIKU: I EfTrtlVKJa JCVt 1 »I4^K| SUI 4ilJ lUEtKKKK1 1.1 .AKKfTtiKAN 
l.'PB.OA"!' 'M.r«,itV /»M417 

rt:t -eiu/kmc u'n K.M.roi rr: 

K*TiF:«rrn J(TLK5.f V;t.TF« Tf.AJ.l'Ai t V Jtf,KKAW/l 4tKJ* ;iJV;ybWKFJtR 
ETFWJl 1 AAlCTrVW 7TAL I KITWTI >I-V A WAVV'UKl-V: '411X111)1 1 .KV KVI/TVKAI •.* />AA 

: :: vjupm » r/0FLttA\rrH7TM :»i i m i ■ vav ki r. at?c rr a*, r v : a r m iktvai .tmj «•» : 
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TAOSUUCOrAHHWAAOPOrLSKESVPAEAtAKk :A30tOCKP9r/tEKIVTCKUC 
FFOEACLLECPFtKNADW lQaXDOF?WTrrc3SVA£ECFXUnCIGA 



783443 784201 



epJJc^cttr^^ 

I LRCLAEOKEW I WWrJACJOMCMLATLINGMAVAOALICACS X ^^^Zh£25rwlwn7 

ecKntEALKrje iu^^ 

vi i -.- rvi^r/ : r,^.i-u*:;,.i, :/?*-••.:/ : ?tt::H:s'fiwr. , K. rf i-:-L*amJTa 

•:»*.. •/ir.*':. , i Mi 

GPTU0699 794177 794721 

rrf-Ribosone Releaeing Factor 

TOSVLOOTBa04AA^ 

LR0LVISPY1»WASALWCGIIAANU<W 

EEAXINVTWIRREA/OOJCXDSALTEITW^ 

CPnJ)70Q 785094 785609 

S^pt£ocy!^wtxotei 

CPn_0701 . 7«584 796672 

INFOOHLVLHGIDFOCINVOTLW^^ 

E5F 

CPn.0702 789700 786929 

yscC/g«pD-Yop C/Gen Secretion Protein D ,, ,, nnfiniT 

Liao^pvicrviiiiicmLOGiKxiaciaiciLscLrrxj 

RDDttAACPWJSAASt^AKKSHTXXTrPCSIPSKVTSXFW 
XUCQ^ERXKPRPEWlTTADVKRSPRnJ^EVEXPVP 

AVNMNX^IKKOX-EEQTSTVTEIOWPCTOATPHASXIWVAS P^P t 5i5S^!^lSM^ 

OtJKSEZaCVKERLTKRELTCIiajCXlCrTWFEDISIX*^^ 

FNVTIVSHDPTSVDDLSTILI0VXJOOTtJC\^ 

TCEAWVTWTRX.YSVSFSAAVNIIOPI45HI^ 

AAXXXnXTrSVIWITYEVKYANPAALVSYCQrnrcj^^ 

LODIGYNLYVrTAKDEDFimXNS tOWLEVWJSIVI IGWQOIVORVICLIj K^lJP KgV 
YI EVLIlXrrSLEKSWDR^OWVALmEOSKVAYASCIX^ 
IPLPTTOLTCrSIMlKSSAFCLCIICWVt^HK^FLTLa^^ 
RIMAOtnWWPTVCOTVPYOTTNTI XQETg!VrQNICTn?IC^ ^VVTS TVAP>»^L 
OI1OTISELHS ASGSLTPVTDKTYAATRLOI PDOCTLVMSGHIRDICrilCVWCVPtX^I 
St i^rrf rePTTr^anypMTMMTTKPlCVISSrEE?rR\n7^KECYRYNW£AJEGSIWV^^ 
APECQGPPSLOAESOFKZZEXCAO 

CPH.0703 791205 789685 

pknS-S/T Protein Kineee 

PJtICFMXX*GGIPLPEPQVIGGYH\nCKIL£IO<IJlSR^ 

TS RSVYNFLKEAQSUiQXTH PN IVKFH RYGKWQOC LY X AMEYI EG I St-REVI LAQ F" I S X- P 

OAlDIIFDIAQAI-EHI^RNIUiKOIKPENILITPO^XKLIDFGLAWt7re 

ICTFYYMSPEOJlQGESHSPA^IYAlJGtAAYEMLGHWLfiR^ 

QPSI»WWWSSTREriQOIHHYIWSCX^EPlJtIiaaHTVALYroL0TO 

FISCVLYH0GYPLYPHATTJTXAE6IWFM*aOTSPIS^ 

DRVCEINECLIRMaPIDEHGISILCXXISXZNKEX^IACGX^^ 

FSPGLGKITSUJIRTTICVAWEICDEAWCTLIZXI^ 

ESIHGGIQSRQHGSNSPSTLXSLKRXR 

CPn_0704 792330 791209 

ntN- Flagellar Motor Switch Oomain/YtcO family 

RYFMAVAADSSASWUtSRNWLSSLCKTEEOVAM>EFPKnjCX}HXIREXF^ 

FRGSITAVEATXEFCVHLLIOPMVVQFVfEVENIiFLTSEEXJM 

EKXJKXXCFHYYTVAEACKLFEEIOWVPSLSAKVOCSDAI FTATSLOGSFOWD ISLRLOCK 
h^CRLLLPEDTFOSCOKFFSCLHDESOLHNIDOTQQ ISLSVEVCYSOLTOEEVHOWPG 
SFIMLOSCLYDPETEESGAXXTWKHOFFGCRFLTPSSCEFXtTSYPNLTHEDPPLPENP 
OASAAPU^YSRLWEVARYSWVSEFIKLNWSILSl^rWPAYCVOIIUCAKW 
XALSDVLQIRVLEV 

CPn_07OS 793176 792334 

CT671 hypothetical protein 

FMEIJtKTAESLYSAKTONHTXOfONSPEPRKRDVKVFSLEXWOTRCEKTTSSlOrrRTES 
RKFADEEXRVDDEIAEVCS KECEOESQEFCLAENAFACMSLX3 XAAACSAEAWEVAP I A 
V3SI0T0WI ENI IL3TVESKVI SEIN5E0LVELVLDASSSVPEAFVGANLTLVOSCOOLS 

WfSSr ^^^jg^^^^^^^ Y ^^^^ CTSV ^ U ' V0LWCI EEV0T 

CPn 070* 79J689 793180 

CT670 hypothetical protein 

YAVAKY PLEPVtA I KK DRVX)RAEKVVKEKRRX«L£I C0EXX>REKEAERinCVKNHYHOKI00 
LRDLIXECTTSD AVLQ 1 *S YIKWAVQLS EEEEKVNJCOK EWLAAS KELEKAEVNLAKRR 
KEEEKTRLil K EEWMKEAIJvEEARAEEXEOOEMWLUTOt^OKK K RESCCS 

•:Pn.07O7 /OSOIV 7*13704 

y:;cN*Yoo N (Flauttldi -Type ATPuae) 

'/NMDQLTTOf T/Tl MJOUiDVKLTTWCR I TEWCML r KA WPNVRV^EVR LVKRNHMEPL 

yTEVwrruTAFi^puiELJCvjPSSLViPTCLr^iRAar 

r< iri/jf r/i vrri • r fkawdpuiraklro lum vrc [ dumltvaju vjk ic t fagacvcks 

. ; LU SN I AUNAKKAUVNV (ALU SEROREVREF t O iOL'ZRE* «MFR : A" f %V: 7niCX£ UOL-RLN 
AAYVrrTA lAKYPfcUp IKTWf Di JVTtt F ARALR VP 1LAACEPI ' AK Ai :YTf jv pst LPn l 
r .ep:/ia::i>»« rr tTAFVTvt.vAt snrwErvAOEVK'.: r ;ii t vi i :nalauayi iypa iovla 
. : t ::uu-ta ! vr«K^RR i u :kar evlak ykanwl i ft v mvwk f.intEi i»fa t dm i dkuir 

FLKUtif IIRKTMYPXAAJOI J«A C FR 



CPn.0707 



rrr»iiJ7 nYpocney.^ 

":.r 



7-^203 *"»VT42 



Q V IANCK X ESTPJU^^W*0T UVAjC*i PLJ 

CPh.0710 ^ ^ "J25i« ^" l0 
"^S^TAV^ITKARAVKCS 

cpn.0711 7W791 796486 

^SSlSe^dkkfcoocsuj^ 

rpti 07X2 7 '»3 15 796781 

wi rfMBiin- homology to adenylate cyclase) 

S^R^ECPX^PVLTOISHSIGWS 

^PRA^^LJ^S^LNPW2WAJ(DSPICC1^ 

rSrmiAIZAf^EXEn^TmOCE^ 
^fSSwtl^rOTXrTAXlDOIAKIWISV^ 

wlSSptoiwndlsvs^ 

S55SaSrWXUOHKAJ»Ai)TXVW 
Tl3lGSFILTIJ^GCXjaLFCIC7rA^ 

TOeSpgJG^Xl^yVRfEEOAACLVDYLNIHFNYI^LLXJI^^ 

nmac 

CPIU.0713 799817 799332 

{S^SSAOON^IVt^U^PPSAITTA^ 

CPh.0714 601125 800091 

SfS^PEXAOAAIJ^ELTSOOIRPYIWW^Crn^ 
T^S^Sol^AAYLYOHCWrrFCSR^^ 

S£SA50FSDLSC£5tJtS X 1&lVTtfftW9RTtJilKEt1ttTVTLDIVFltt&1QIMXACT 

KsovmAJautcAAiaouexYciacsOT 

CTh.0715 601636 803462 

^X»04AA^E^XLstASL^X 

FDO^tJCXSASDXOXSXCOOCRCIPlCW-IDCVSKr^^ 

[SSS^FSVIlSVRiaKTllIATTHRtWESK^ 

rwmF^IROYTYUSCIXIRFWEVFISmCIJ^LFDA^ 

XAIKXJ^PXFESQTimiClXBnCIRSSLXiaMCCAXVQAL 

SlOF^DlJCSKOKKVHYICI PKLRDCKFKYNDRSLYC EASS I FLXECESASASILASAN 
PLTOAVFS LRCXPMNVFS LETTKMYKKDELrflATALG ITONEIOHXJOrWCVI IATOADV 
DC^IRNLiITFFlJCTXXPL\TNNHLFILETPI^KVRNyrrTIX 
^LEmFKCl^ISPWTA^ICPEIW.TPVTITSt^ISSlWFY^ 
IKWLITDf 

CPn.0716 803466 804902 

cry rA -DMA Cyraie Subunit A 

mROVSEIJRWFt«YASYVILEiUIPHILIX:iJCPVORRliWF^ 
im^PHCOAPIVEAXVVl^ICCYLIDTOCNFCNP 
FWT^AFHDSYDCPXJCEPOXLPAJCLPVLliKCViniAN^^ 
LMDKKFTVFPDFPSOOUOPSEYODCUaiTlJU^ 

SIBlAAKRGTXKIt^IOOFSTXWPHIEIKLPKCSRAKEMLPLlJEHTECOVIUYSKPWI 
YENKPVECS I3EIUaJfrTAL0CYLEXELLUJ0EOLTL0H*/«RTl£Y IFIKHKLYOSVRE 
Vl>IM(KISADOt^A\ai4ALEPWU<EWTtnn7tOI7rSOLASLTI 
LAIEM(OAAIOKDLJRIKE\TVKYLKGliERHa<l^EP.KTOITt»FKTAKW 

CPn.0717 8a4«»68 HQS 10ft 

CT5?6 hypothec tcdl protein 

lP.IFFICTTITIVVRHEPRHIYIRKPETPKAPDVEKPr7VTEYMT><ANTrTFECP^ 
RRALIEORCAEEC0KHYDNFl0CILI3TFCLV>IKDMOPAOKASKR11RSVYKE0 

CPn_07l» RP930O 80562<i 

Cr*.?7 hyporhoc imI protein 

RA'/MT.rTYFLALr\TRLi10ERFlXSPKRWAPF1NSPLYLTLIADHDTrYWW<[i)KFPLP 
VEVWEXTVIJr/r^LLKJ IFUrnDLSSUtLLAcTK FE ILTUt&LYl.'AON I 

C:fTtJ>rt\* Mt»»i»«77 HfloH'iO 

:-.frJt ( ivrtiftk mi i. lino : , .ynrh.n:««) 

FDI KC.VRKV I V. fcmTVT.JFTVrrKWWSnfJjKYLTC/llf KY^RAFYgWI I LiyJLVO IWSQ 

iim*7A'niur»:P! vr iniOKKRaj£LLrKA ti-uArrttzr im i i.v inkcrdhwmpapc 
»»h k .ti .vuai j jiiu, :i :m *KKFfttnnrt«n ; »viii<i J DKr/r.? ;t. r ( taktk«akkvf::elfs 
rr r»i j»r.:^i -\v» : ii :ki if .tt i irni i ::iu wnkmk wfr/r:: ;k kavtih \vi aw jklsfv 
aj ^;M.n» :trnM.MvnMK»i[*rrr ti/au v/<; t Pi«wr:*.Y':LriK«,«i jiAY:,viii.'YiirrpROF 
i ::irA. :u<KiM<::i.i.t rmtNhrrr I ikkw.i.e:: t ijteo 



is*. Mii ( .<t*ii 



'.'Hi.'iY'tM 



/••V/IM V-»S0J4 



i."T«.'."i liv'|«*l Ifl |*t»»l ••III 



105 



WO0027994 r file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pc1 



Page 108 of 330 



WO 00/27994 



LR rTHKEFI-V/: I KM. VDR PEZVP I K EV^CTKT : . . -^JVAKPO fTT I I^XECTTI KAI R 
TLLVT?VA;.TWWRV3LEIKED( 

'ipnjn:: W7^?t 9084*9 

cdsA-KDO Jvnr.n«ta«»! 

KRKVHPNNKK I L : AGPCVI EC22 m.E J ACKL03 I UAPYSDR lOWFnCXfDXANRSSLN 
JPRCP^LTECLR t^AKVKCm^ ILTSVHTPCIJAYAAAEVTNI ICVPAFI^RCTCliVA 

t a eta t vw kfcc-f: „*; pwnMEnr t t ir /r .rrrif nr : :j.twww » z.vrtCMRS r pvl 

.!•:/-.... ~n l • ■*. : •.*:.«?« s* . .t .7r;r F :a*c." 

LvVw.-;<:.:Li;i; .v»u.; twi/. r :. :: •*:. my. Yt 

CPnJ)722 908477 908974 

CT654 Hypothetical procein 

YCLSKTI(n.YCCaJYSL5lXVLAroTKVAI I0VD0ICDV3O00CHT0t3PPrUCIWCWV 
SKQXCS PECPFFHCKI0K5CMELKFP0S5YSCXCYLTRI SCH X LTQNFEKQMQFRGNSGL 
tWOt^U<VYI3CRrOVDPVPCYGSPDKSDSSSGCMXTLyLSLFRN 

CPn_0723 808978 809703 

yhtoG-ABC Transporter ATPase 

ASMPIt^VCWLVKlCYNlOCPVTNDVSFW 

KI IFXNVWTXXTHDHRAMflXGYI^EFTIF^ 

TLVDXX£L3SCU<iaWGTLSGCERR* 

IKIlASRCICILrTDHKAXr r SIADRCYLIXDCKirrECSSSOMISKPMVKOHYLSDSr 

SY 

CPn.0724 810802 809706 

No ro&utc homoloa present in Genebank/ZMBL at of 11/7/96 

RTSTW^WRSO:il^KlLPFPeX*ia«^^ 

NIPYITKATRRGUIMJCTIJVYXJ^SUCI^ 

FFYCCSNIEDILEEMRRPHRILUjCFSYCOKPKACPDCRFWDACRYIJPSHPTCASCSICT 
MHRIliARRYTTVI IPT FIDX AKHLHT/IJOCRYPGYQ X LTAVTACELSLKMFCOTASVMNLK 
CVCIRlTGRianTWIOAERGVKFCVTILE^ 

CPn.0725 810629 810587 

CT552.1 hypothetical procein 

SCGDVQfFF APLLYESUlRCUWPTSJOOOQIARlXriNDQLTTEI^HVNELlJCSt^rPE 
5LTT XKA XAEEVLSCDEPLLO 

CPn.0726 613364 810880 

CT620 hypothetical protein 

ADIDKI¥CTSISTrYXXl^LVS5MHSFAQR>tRZSLEHIANYEKTT 

IUiSt3IY!^VnClHKYEVERATVJ«SIPVMIHa^^ 

YYNAVKOKMAQOLXVEXlfrVMTTlMASVNSlO^PAIlXflVPOl^ 

PQTLYMPPEEXrf AIQHA PTf TOGHK fOf 1 NOLAGKYO^ATLTOTFADCRVECFXDILT 
AVQGVLT PE0rTIFAEIATElOAIADH\CNn)EACL0RI EISACEKIA\vn^DLTRNDX 
. IHFCOHITDLYSDQVAAIfiSFDTVt^ASIYVNOHOOTMF^NLSSF^ 
S(XnXSSAAX^CAX^ARCLNSRFNELTA£0OKLX^XlCStVTTI^^ 
STWALN1TATWDHVICAMLE£AKEIJ^ 
QYTIYSEKNGICVEIWXU-NYGSTGFLPEITKI^AJCTNAECT 
NKIEDLQSQLttFTtMXTElXKaU^QASELIU^^ 

YKKLYYSNLGS5 XGNS X IDAISOYVMfflVTYTNrA^YVGOOPAVCAOGANAFPGSOESAOA 
KUJQERTOAAiYtflETWULTVlEEOIWRVUm 
SLVLLQNYWPXJIAflGSVACTTEVICECOEC^^ 
STIOSICQSFAI*OWFOU3IJ»MLTSH^ 

CPnJ>727 613559 818192 

CT619 hypothetical protein 

KYYLFSMSTFSIONRLRT ISGESTRI XXL0KKVSCPDPR5VPAINLEELNSCXYALRKLM 

NAWSENTNVAAUiO>NNTIPPTTSWTDYlCHSRPOASSPTlAPSSOT 

VXDQCLA£LVASVTEZ0LCAL5TXSTVWLKA5Yt^L7TLTAE0EX\n^ EKNL 

LEHVKOEKAAEXOAXOEEXXAVUJUCGVS^ 

RAXVCAPXOEMNENAIOliPTPPAITPDNVNEVICSHNT^ 

EI ITIL0TLVPLVDKTTFT1CAETDLIYTAT0LPOT 

SIQNX^ETKRVVENro*SKt£TOLSMF^ 

DmCLU:LSYMYERLAI»EXAIFD)CSVNE^ 

LCTAVTSQDQI KAYT^RGNEFKATRHFFHNI G DOKYQF ANETW 

OCF IREAMTNVCTVEADWSNAORILWEFVrAATAHVLQLOLO IAILQKKADDLDPGKAS 

FTE^FAVAAWITSESLGDALISMILNSOl^KQEAFIJCPLXtt^ 

ITTIEFSTTSVYYSUSYLVOSKTOONtJAGDYYE^ 

NCLLQK INSLPCATSAOKOE«IJ«TTYYOYSLSVTLNOLTVlXSLtWNCLI^ 
YDKSVFK IESFDDWI PTLAALESFLTSGFPNX SATCCU3PLFTOVQSD00TYTS0C0T0O 
LNLQNQMTT IQOEWTLVSTSMQVLNSI LSOLACA X YSN 

CPn.0726 618493 816525 

CHLPN 76kDa Homo log (CT622) 

VFMVNP ICPCP IDETERTPPADLSAOCLEASAAN KSAEAOR I ACAEAK P KESKTDSVERW 
5 ItASAVNALMSLAOKLC I AS5NS5S STSRSAUVDSTTATA PTPP F PTTDUYKTQAGTAY 
OTIFTCTCLADIQAALVSLCDAVTOIKOTAATDECTAI 
KYASt»l0AIU»U^TSFOtXQAAUj0SVAIWKAAELU(EM0DNPWFCCT 
VD0TDATAT0 1 EKDGNA I RDAYFACQMASG AVENAX5NNS I SN X05AKAA I ATAKTQ t AE 
AOKKFPDSPIWEAEOMVIOAEKDLKNIKPAWSOVPNPGTn/OGSKOOCSSIGSIRVSM 
LLDOAENETAS I lJ<SGFT<WIHMPVTEWDSOAAOOElJVAOARAAKAACDDSAAAALADA 
OKALEAALCXACQQCCILNAOCOI ASAAWSAGVPPAAASS ICSSVKQLYXT3KSTCSDY 
KTOISACYDAYKSTNI^YCRAR^^TRWIN^^/STPALTRSVPRARTEARCPEKTDOALA 
KyiOCNSRTLCDWSOVSAUJSVMOIIOSNPOAWEEIROKLTCAVTKPPOFXJYPYVOLS 
NOJSTOK F I AKLESLFAECS RTAAE I K ALS P ETNSLF X QOVLVN I CSL YSGYLQ 

a-njW 81*905 R18592 

CKLPN 7i.kD.1 Homohxi fCTWJI 

PAW"^JTLN lO*rKDTMKK0VY0WLA5VVLLALT IltCYAELI'L'iEOKVKiiHTYTTLDEVK 
WL.':KRf;FVETWKOOCVLR I ACDVRARWLYFREO lKlir::DKDKYNrLF\TmYK5EFYt,Y r 
■r/RAFJNWt^;;KW*/TA I AUIENTAAfiVO I NRAFLGYRFYMfpfcTRTDPFWE ICR^GLCO 

u'Escvgi^ttFttiLHiYV^Ei^KOYr^^ 
vFFwrsvvi%OTFVP::CT:?rriacAATN^ 

• JAFI .AKATKTH JF tKKNLAMK til rTLf Jf JLR r. AM*; MTVHYKYVEALSVrc ( DVS ! 
I< IRrlNI J>:fVKAOA t AANYPrKP.AU JFTWYW !F: lALYHYt ; ITU lUtFIWYttA YL-KPANDK 

iii';rjFTKi»KFiJii:ir::AF 

'.niji/m Hjinu, sivh.i 

mviN ltiiiHii.il Mitnht.uitf t't.^.Mfi 

: S >-K« > 3 M K :RKI jNW: :t am:: r FN it^ffj-i v;h f V : I FKR t anatyiy:aop [ vaaiv 



PCT/US99/26923 



LJ JPRTVPFLPK : U7;L : L^.. . . PMF BFLRACiSIXRAA FF FRR F. »R L I 1 :FTLLXE 

A«/Lwwurr/EEcr/SN : L'.tm : urnzt FLnm-NVNRAU.H - ow""^^^ 1 
w i Fpvr AARH30pflarrci^vAwr«p*rEvc rrwcsi^a&OSfCldWtoii 

APL3UC r IFCUCUD r CLARTVHETC PLYtKYJUCX YCLTTRCI " J L* V M VU.PA 
r2RCVOREWERCLXL?CKFVir.TMSV7<r t^ACU^lALPCVWVLYEICLrPOSAVYA: 
VRVUC YTA3 1 r FMALAPLVrVLTl AOROY AVPLF : J* IGTAiAN rV^LVLCRWVLKWS 
1 1 .T/ AT j TTAW/C LYryrfYYSSKRLPKY J KLLWEJ X RRSIKV>CTTMLAO<ITLCLSILT 
?TTYVfFLNFl.TPf AWPLTT.'T PTAO* I AF:^F?C t Ft AFimFAKLLRVEOL I NIASFCW 



CPn_U7J' ^il4J4 <ZU*0 

No roPwst nomoloq present :r. CeneoanH-OtSL as of 11/7/98 
VAIAXSWI PV 1 RlXr/PWX LKlERAJCrr^FUIKPFSPPPUCCtmJOISPYTSSt 
IT XCCSYFXUfXASLCSoTLRLRS X S X X S 

CPn_07J2 822092 822976 

n to -Endonuc lease XV 

NF>IKVLPPPSIPLLCyk>PrSTACCIJ<^IYECRDICAST^ 
DFWUkIJCOTl^YIKSHAGYLINP<*PDm 

A1JCSSKIIX3OOCIVS5FS0SAPUT3SSP PLWLLTTTAOOCTL IGSNTEEUTaVONU^ 

0IPXCVCVT7rCHIFAAGYI}rTSP0GWCOVU«FO£Y\^^ 

KAPLCECY ICKESFKFLKTDERTRXl PKYLETPGCPE>**0KEIOEiiKFSlOWt)S 

CPn.0733 923739 823101 

M4-S4 Ribotcaal Protein 
CXJCYIIAJlYOCPiaWVARRFX^XFCRSRNPTXXXPHPP^ 

LXACYCMXMZTOLVKAF7CEV1HKQCWAOMFLERFECRL0NMVV11HC X FAAQQLVA 
HCHX LVICRRVDRRSFFLRPCMO XSUCEKSWU^SVKDALESKDESSUSr I SLDKTGPX 
CEIiVSPEODGIEAOLPU»INISVVCEFLSHRT 

CPn.0734 823863 924915 

yceA 

OOTKDfPSSNWIIWNYFOOYVRWIMXaa^ 

OVSCRIYXSE0GINWFSCYEPh^ELY>«WUCEWWSXIKf^ 

KELAAIXCEVDLSKOAXHISPOEWHEXMEN^ 

REFPEYAEXIAQECPPETTPVMMYCTOIRCELYSPVU JJ^riLl V^ 

GTCTCWLGXLFVFOT^UJklPIDESDPr^APIAECCKOT 

CIHQ«QGCCCEirSOSPRVPJCFDSSRCNXPFRRAi<IXEISENSISASC^ 

CPn.0735 625680 625003 

•Uridine Kinase (Uridine Monophosphokinase) ipyrimidine 
Ribonucleotide Kinase) . 
Cra»<Lf0CJ«X XCXTCHGACXTTLTONXra 
LIW»POAPIKDLLXSDXKRXJQ^XVOAPVFC»VrXK3l5X^ 

FENQEUIDLWDI R I FVDTDADER I LRRMVRDVQEQGDSVDC I HSR YUSHVKPMHEJCFXE P 

CPn.0736 627731 825992 

ygeO-Efflux Protein 

RCEIJJUJVROCLVAFHTVSVKXKS FRALVTTHFI«TI INa^YKFXX>FTIX IAiMti 1 f > < A 

KXLSCVSTFFALPriXLAPLACSLAORFOKRN X ILATRFIEIIXTILCTrTTPIOfWaO 

YVVLXLMACHTTIFCPAKX^ILPEKI^SEOLSCANCSXKrAATYTCSXl^ 

HRIlHNSYVWPTUCVTVSIXSTLXSrcXRPSNV^^ 

YLTVSI FTCSFTXLItlAYTOt^IIPFVEFTIJClfPKKYCAYLFPrVAXjCVCT 

CJCDXKXGYVTLAAXGLALVFHCLYAFACSXI^VLFFII^AI^FLCCVY^ 

EJIKRCOIIAMWFIXFFGVLVAACVTRVU^NLC^ 

EWTOJXGIIUUttLGYYWIWSSSPKC^ 

OI^VIWIUMCJ^VCVPTVVSSVRI^EA^ 

KNVERFTSIWEOGrDLHPIOLVOKffiXXRVIYTLVTPKA 

CPn.0737 827469 830756 

"recC'Exodeoxyribonuclease V. Canna* 

ICRSAKLPASGASiaU0CIUKXIXTOERIFAFSrVRVU>SNRX^ 

SAI>©FTLTETV»tATKHCRASFStiSPRHUAOt^ 

WX1040LVHVtSDHXFKOTXPrA5OSXVXNLFUSSCCS0PflXPDrXTV 

KASKFTCREFt^PPTYETTWaJUVAFKOrHTFSORPTKNASK^ 

MFTTILMmTQEEIXSLKIFtlYAiiLPICHLAEFFINLSTYT^ 

RAXDlTV^LPDSPXICNAWEHYVUDROAI^ANlJUfKSOSSQNFrL 

HDS5LSVX0NS X LOLK PTS P0DFS0TKOT IC I YRALNI PREVQEVFCKVTEXXKRQVSPE 

EIFILSSHIESYICVHLNAIFNPHWXYFTDEVDPRAC 

OLLTHPOIOOPIDONJCVPYUIKJCLSSEWKISSKDRASCOWKAL^ 

RVSOVEVWETTVPLIYF IOER XNLYLSSSOHSYEOLFONVFSCLEKIFVLSPtrrSFITT 

UlNSLFPTFl^SCSLLFrrorCL^rLLHFHKPSPLYDKPGPYICSl^LSLrplCOYVFI 

U3ANKTTSSDiroLLNRTTTHEELAFSSTEDEENFHFL0 ILVSTKHELH XSYXS8JMOPN 

I^SPPIMHXKETULP\TTLPTQPYUAFFKNKACLHTS0EYNY51JU<A 

FI PTVKOVNLPOHLSLNEI IKCIFSPU)LFLICTNYNUXSYPOttJ«OOKlJ , PTKIIOIEO 

FVMECFVDKEKOL I F^ I S PHAEELFTYYR EKT t LLRMC LDKD PKHS PYTVTFSSS t PXER 

PYHESYUTn^LSF^PVOIHCTXHGVCNECLY^ 

EOKOLLERYVALAVLC^tSOHL^SDSALXKLTSnrTKENNHPPFSPPEQYUUCVX^^ 

SSOPIPU^PLCW!^CDEEKFH0AVl^rSEEAKNPSLPIFVCFHNRNIEEIL#«^^ 

ERLKXUSLFRGPCLW 

CPru«73B 8)07n 83)895 

*recB-ExodooxvrtPonuuLe'*se v. Beta" 

K FYLFSEVPVKPFNI FDSNSS IOGKFFLEASACTCXTFT I EO IVLRALXECSLTHVEHAL 
A I TFTtiASTNE LKVR t KCNLAQTLR EUCAVUICOPASLPT^LO XNCNVKO X YHQVRHALA 
TLTX)IOLrrXHCFO^lX0YTPrTRLIIIKNPALTHS0L*/U*HtTW^ 
OFHU^VRYNIT^KJC^L^KLLA^rTOPirr/Fr^RVDlLEOICLl^COIYNSiaXIP 
KQVFLDOLTAH X SCFKKOP F:» I L&&LHI I FVDLLYTS ETI (53LFS FFK I AETFVFKKRLAR 
YKPC AAFTVLENMrMVFJtTI J>X.tlUt>R I KKTXL7DL0EYLKQNYTPWLSPDESVFAI<CKL 
l^nnEAOPWVAI JIECVOI.VL r OFFOt/TDKOOWr, I FShLFISrKFTasXFLtCOPKOSIY 
W R JAOL PTY I.TAK» f .•TF;3EDKOL^LVIJNYR.*rrr , K LMEA IllQf PCKI «PFLEJ POYLPXEY 
I IAI #HO-*:;KTFENPPI IAI- 1 IIP KFYHT I F.D»JAl.>J I Fr.EALPLOKEOK I PUMfWLVSOSN 

•mfel: ^yat r pv;:p;ikhk:: i fi i;.tew i i.ttallea t uipenyek iuk ti ^.'ltlfolsl 
nhvrTKKKnFT r Yfv:-Uii:y r .:iur ;r.i 1 ATr/t'-/Mrfvar/Lf:y:rtt:ni . (fuemeklosy 
: jtt i :;:;YrY« vlu ilknp:;rp n+tzm a i ;:;r/::tiULETLK itt iin&Wi :i^yd rVFCPG 
i e»c;kknks:u:eu jjfji>va<twfvi.yu- t :.TOPPf:t.^a:yu.TNWKLEf7roj:aAYO 

I JV I III JI0niPDLFaY::UI« WK :i»A'n-/W<U-LLCTFAI*VTPPKT M'SPlt^KFLLOTHK 

ikX>:s t pyj :kli- i :»v.vu*t *'.arr: ii.iuk tr.Kr: WJis/juretiM:*rttwr t khtiiuo: 

KEKT tLKU-:KT»-T::n .TFft VTFr:r &j VI .1-flKf Fll PT" FLF L£M JUXX V [DLFFEHE 
«0(YYI lCWTP: FL»;KTNJDY::K:aHj: I Y I WjKrU/ft/U* r/VKAV\i*rUK>VFAmWtl. 
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' .-v I F I Ft : : LivacFFAucr.ED t pmknpka t'Jt /« 
•rr W« hyonrncticu: price in 

::r;LFKucYmjwKrrK :r.v : : i alc xlsfrs i pcevyox irssfvslhvkffpkiko 
ythwss sew/worn \r. i kkhs pvlscnvlvcl*/dyvc ekosr i rl i T3VQ<x pswamr 

-3 ronuwririCLREL t POVEO X3HAY t LEKOIfYEJClSCLOELIXILIOCECIXOAIXHCa 

v. Tirr;.' .. :„•:»" .kl: „•.„:..- ii.kfh: r?!?*;;4Jwr. 

CPnJTUO 836054 834864 

tyrB-Aromatic AA Aminotransferase 

;*MSFFNHIOTSPOAILGLCNVFFACXRPEXVNLVIGV^ 

LEEIONlKYLPISGLOIFIJEttRXLVrrMVDPSAXW 

CKWVPEOtWSNHIRirSOBCLCVIRYPYYSKBOKOUJEPLIAnjCEVE^ 

CHNPTOmrnsjKWKEiAiuaiRELiPfnTrAVQcrA^ 

AASSSXNFALYqERVC-^AVHSTFTCn.VKlHSn.E«II«CYSSPCRMCVEIVSTXLSN 
PYLKEEW3S ELNF I RCSU#KMRTRIVDALJlKVACHTTDr LLSOHCPPAYPSPSIUCQVLFL 
R EOHAVYTT AOCRKNLNC ITEKNI DHWOS P I0AYEL 

CPn.0741 838383 836185 

greA-Transcription Elongation Factor 

EY X FRLKTCS r/TYLEXLQV w Z EEGQSANFLSLWEEYCFNOWRCREL VEX LDCVKSSSL 
ASLFCXXVD7VVPLWEX I P£X3^DKDRVL0LILOL0TSNSOKPrDIATtYVNKKYSCEENF 
NEALItVVCLRICRDFOFSI^RFDFLMHMHKrOT 

CIMSAKDISFETAITCSL7PL3CDHFI,SRRFGDPDCFEAFAKQ*PIEVVEIIiRDlJ3PKTA 
KEIXDELVDLVIPEAr><NRV«QSAimCIKXrn7U^ 

^wI^SAEK:SLIYKFIRI^5EIJCNIEI?^LV>JU^DLDVXECNK5LILQRrf.U^E 

YLGIKI)ASIDlffiYITSXSEXOTSRLXJIMPIVALOXSFl£L^ 

TSPTWRDFVYKTIKNDPSSVF/UaOUX^ 

KEVLIU.FXX5AX^MYQVASn>HKELSiaUiiHY^ 

STKCWSSSDLrWflSLAEWOFTXJOWl^^ 

HVtKAKEIEDARSLCDLRZNSrnTAXXKRAJU^EEIRVLSEEX^^ 

CVCCXVTU0SOACEVVEYTXL3PWDAX}POSC:U 

X5RX0SXUEEHGA 

CPn.0742 938442 838888 

CT635 hypothetical procein 

TKMMVXVKN5X5A0K I IDS IKG I LT I YNI DFDPSFGSSLS 5DSDADYEYLXTKTQEXX0E 

LOKRAOEILTCTGKSKECMEVFAJOTDNFSPEOflALEJC^ 

OLHPTXESXRPKQKLSSTXXNKXIOIWXPL 

CPn.0743 838956 640362 

•norA-Ubiquinone Oxido reductase. Alpha" 

IFMKITVNRfXDLSLOCSPKESCFYNKIDPEFVSinL^ 

XAEYIWPIWYrrSHVSGVVTMRRGNKRSIXDVXIKlCTICW 

EXPXOWALIKORPFDIPMPTOTI>ROVriNXJtf)NRPPTPSPE^ 

FVVGVRAIAKLFGLRPHIVrRORLTUnOEXJn-IAWLKT^ 

NEKEV\TO^FO17^TICTLFU03RIU<EOVTAIJW^ 

LNPI SONtTTLISCDPLTGRLCKK LtXPf L GFRPHS XSVLHHPTKRELPSFIJlICrwrJTF 



CPnJ)744 841387 840389 

ha»B- Porphobilinogen Sync hast 

EMSSLTLSRRPRRNRCTAAIRDIXAETHLSI'KDLIAPTrVKY™ 

UJLLUCEIEPIXTYGIJ^VMLFPIIPDOIJCD^ 

ISDIALDPYTrHCKDCXPt^GEVI^DESVRXFCtnATU{AJE3«^ 

RSKIWSGYSXTSIKSYSVKYASCXYSPFRBA^ 

LDEEEGADIUr^PACLYXJJSaYRIRWrCLPLAAYOVSCEYAM ILSAFOQCWLDKETLP 
HESUAIKRACADKI XSYSAPFXLELUWEF 

CPnJ)745 941903 841742 

No robuic homo log present in Cenebank/EMBL as of 11/7/98 
VDSCFDttfRASSUKSTTYWAYDPKHTtJVYC 

CPn_0746 • 841939 84356? 

CT632 hypothetical protein 

FSGRCPPSFEVFMLGKE2vE?TCK0KOCLSHFVTNLTSDVFAiJ<N^ 

VLSl^RALLLXETLSNEEXXBTVCTEAYDFETin^ 

MENVSIlAAKVLEXfcRIOTSPLEJCSTRYVYFDOICW 

OFLFDTYSAL I PQVRAY F EXLY PXDS KT PASA YATS LRAKVLDC I RG LLPAATLTNLGF F 

CNSRFVfONLXKXLOWNIAEUUU^ESLTELMXVIPSPVSRAEPHHW 

KEOUCIAEOATFSEEMSSSPSVOLVYCDTOIYXVAACFIJPYSNRSLTDLIWCKKW 

HEDLVOIt^y/SARTORHKSPRCLECVEFCrDr^FCAYRDLORHmTOEROLUT 

HHGYNFPVELLDTPMEXSY R EAMERANETYNE I VOE FPEEAQYMVPMAYNI RWFFHVNAR 

ALOWICELRSOPC^hhTn-IATTSLVREWK^^ 

PTT 

CPn.0747 14 394 9 844053 

<.Tti31 hypothec ical procein 
RTCMGCKCAEVOILSSRSLSCMKILSjSLFYKKFC 

:PnJ>748 R44946 844121 

ispA-G*rjny L Tronstransf erase 

nrUTJUUJTYRPS t ESAI EKALBCFCPXCHP IRSP'/EYAUXXSKRLRPCLVOtlAOCL 
GLMIOVMOSALAVEFVTrrJTL lADDLPCMDNDOERPORPTVHKAFDEATAliASYALIPA 
AYSItLRLNAKKLKEQGCDPRE ID I AYNI ICO XTOtCi XGCSCVL0GOYDDMFF3NRG0EHV 
VSlMIKKTCWLFEtAC l:JCWLFfXCDrOFAPI XTZT^NNF^LLFQ I KDDF3DL0KDSQ0 X 
- 1\ MY ALLFCEKAALELLAR30NWLELLDRL£A»^LK.iJ33EFET J IsnUtfF 

•:ili_u74*» -4Sp1m HAVWu 

• ilnrfi MtP'*il*MK: lyi t«u>kir.pht»ry la«*» 

WrYKTYlAJi: I FTri-KDn.YI Et L-!KAllYTWmtI»LMLOMLiaftrVK:*; I HfTTVE.'WTLKN 
I EK I E I AE1 'AYVFJJ MY I V» SIT I UXA/TtMU UAYLKNVlTiX'Wi Wl 5HCTE t KNSYLC 
Ml tTKAAl IKAYt* *J.VI SSMHUW WRt.'ANPfttXCPn t YVn::T::t*nKK t DTHP.RK LCAF 
l * :tC< .VA l< kWVI Nit SOII II.PIITR IRP» JOVI 



tCtD/CpxR-HTH Tl inr. 
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Receiver 



• •in uv\n 



Oooun 

Krn)PILRlKSY»tPCP^WOKl *W5!S^>«^SS=»5? 
ESVAXFCCYLLLPEOIPSP^XFPEEOCrvtr ui rvCZAZTKVLJtOGftTCY£X»ft*rrmCVL 
OAVXRAPLROHEVLEHS ! PDTMTFCDXTFRVLNLV C ES PEGSVYLTPStAC X LKAii. IN R 
GKLCLRJWLUXIKOITKE I IARNVOVK I AiLRKKX^PY^SKnTXRCVCYiPSODCSIP 
LONKDNTAKPNEE 

■^ : r : : ! V> ^S^M 

HFRCILPC Ifti«PSAKVLYYUV PKEKSRSVW I EEEKtrftSSVliOiLPSO 

H0MLMXUP0CPI^K0CKPSOAEKIFSKVT=XA00CaTUXEEa^RXJN5PPljm 
VKETILCUJ^PNSPYYHl^lULVCYKOKLYRmEOl^Y^ 
CXXTt3FIiOYXSAHSUEOI04PPECRVXUOWXI«LrjCHECEW^^ 
LELVESK&AfiXYFOYYEMVLPYLICK XY X LEOCPYAZLLP EEEL\*SL tM£HVT ILPKDCY 
PL XQt£E)ft«KKYVHPt«SLVv?XL\TRFSTOCra tXTP 
EEXOJKXVOOXICTEXAJCOC/AUJiXIJPSISISEKIALSSCTW 
UJLI^XOSYOIDRCOLVHHLVYCWCDXJ^^ 

LFIKQAYXQAIUSSKAIARLLKIXJCFI SEAKX PSXVX StAEKANPtAOAEYVAKSffOKC 
YLY SWLTKVAP S POSYRLACX-CUtENXR Y DEALEFLCM l£ P KDS X KX7YXTOXA1APCOE 
HQSKSRAAS 

CPn.0?S2 848595 850082 

"recO-Exodeoxyribonuc lease V. Alpha" 

GUAtJrTEPAPFXXDLVH0^SPt^XAPA5KKXSS0FE£5FVFl^VS5AlMYGHPFlSL 
EENRIRPSl>0CISrT^YRGP>O4LPKEAROXLPWI/SCRLYlRSLYTIRSia^^ 
SATPWYTPPSIDSSXI^EEQWPXPNRrTQSCTSIVSOC PGTOK T f U kAflUULVKOOf K 
LRIAIVS PTGKATSH XRO XliOCnn FDOMVLMOTVHHPLOEYAYRRYNS XX7VIX.VDESSM 
VTFDIXYSLVOTtOCYEJCKKLYTSSI. I XLCXJTNOLPPIGICVTWPLCOLICYPraiTPF 
LTrSHRATrWVDOLTOSVLRC EMI SFS PL PS I SSA I rvTJ^TKFVKSI-ROStARLCVLTP 
MRItSIWWiiLNWIHORl^OPOLRI PXMVTW 

HEPIDSRAXJOYVYNY\rt*SVHKS0CSEYDEVIVI XPKMEVFXr/SXLnAITRAKYRVSV 
UGDPCTLHKXIKXSNY 

CPTV.0753 851009 850161 

No robust hooolog present in Cenabank/EMBL as of 11/7/98 
IiaTAKLGROALLHLRSWrPAIRASGNLFTO^ 

LBSgiYAHAAlfllCTECPLCAAIXa^AVACAMI^ ETDEnOC JU O MAD 
AEOOfTOKLORRSALTITXaCVARIASKTljCTATFlJiEHD^^^ 
VATaSLTESSISLYRXl^TRPCTISOPINRWCPSAITAARSICAXRNAriA^ 
CDAU^t^FIJ>AXX^^VLtHAXLCLISCVXNPVXX^AKXC 

CPh.0754 851381 851040 

rs20-S20 Rlbosoraal Protein 

QFIUJLKVLVT^GDIMAPKJCPNlONVI0RRPSAXXJiILTA0KR£L 
ITJkSUCLD0T0ATI^Nli3SVYSVVDKAV^ 

CPn.0755 851579 8S2799 

CT616 hypothetical protein 
YXDLFTMLLVRXWI>fTCFirYWI 

FFAJJIRRENOLJCTAAVOIXOTK IRKLTENNECLRO IRESLXEKOOESAOLO XOSQKXJOtt 
tJHX^LI,VKTKGtCOKIJrXJiKRTEZNRCLJ»C , /DSL IQE tbLXl tl WJ TL NRCAIT 
LAYOOALNDEYOATFS EORleO^KROX Y XCXLENXVODLMY E XRNLLOLXSD XAINXPS0 



EENlOO*FVYAROSORAV7ANAIJXn<rCYCAEDFLKFCS D IVI 9 
CSGRLVIXT1<SRGHIJ , FRYCIJ4AIJJXCPLCYHVLCVLY7 

CPIU0756 852889 854676 

rpoD-RNA Polymerase 5ivna«66 



TY«INEIlJMSPX3TPEOIDQVUIT.TaOIOVLNQXrATOOKER^ 
CT PtJDPVRMYXJCEKTTWIiTREEEVE IS KRX EKAOVOX ERI XLRFR YS AKEAIS XAXYL 
I5CXERFDK X X S EXEVEDKT^FLKIXPKLITLIJCEErnTLENT ■TT.^LXOPDLSKOEAAKL 
NDSLEKCRX RTO AYLRCFHCRHNVTEDFC EWT KAY DS FLKLEQO I NDUCVRAXRNKFAA 
AKIAAAXRK1«YKREVAAGRTL£EPKKI>VRHX>0RWMZ}KS0 EAKX£HVESNLRX»VXS XAXXY 
TNlttWFLDLIOECNHCIJIXAVEKFEYRRCYKFSTYATWWIRO^ 
KHIETINXVLRCAKKUe^ETOEPTPEEXAEEtCLTPCRVRE I YKI AOHPXSLOAEVCEC 
S ESSFCDFUEDT AVES PAEATCTTSMXJCX^XWEVLXTLTXJRER PVLXHRFCXiLXX^PXTlX 
EVCSATllVTTlERIROIEAICAIJUO^PIRSKOIJUaXr^^ 

CPn.0757 854709 855134 

folx-Dxhydroneopcerin Aldolase _ 

PC X WXALVX A I ERYQI* 1 1 SKPRMWLFLCCSVEERHP K0 PVL 1 5VT FSYNEVPSACL5DK 

LSDACC YLEVTS L I EE X ANTKP Y AL X EHLANELFDS L V I S FC OKAS K I OLEVCtCERPPVP 

NLLNPI KPTX5KELCPSPVLSA 

CPru0758 455104 856459 

folP/dhps-Oihydropeeroate Synthase 

RAMSEPRFVCI^L33NUINTlFKNWIARTLLCE0AVLCLP^SVILETEAlXi 
LPYFKSVLTC ETTLCLRELLVT I KOI EKWCRAEESPPWS PRT IDVDILLYCDESPCCW 
TEXT! PLSNLLSR PFL I AL I ASLC PYRRFCTQGS PYKNFTPCELAHHLPSPPGM XRRSLS 
POTKU«^/VVSnT^DC><SlXCMFLOPEKAVAOAEKLPTBCAA 

OOEKERLEPVLRLLRETWSNRKOYPI XSLOTFYPEX ILRAM0IYPI0WINOV9QC80SMA 
EVARDCELSLVMNH33GLPV-DPKN I LSF3VPICEQLLSWH EKOLItMFSDVCLNANOV I rD 
PC ICFCKCAAOSLATLYEX AKPKRLCCPI LICHSRKSPIJLFCNHOPKOROWETVCLSIL 
LOOOCVDYLRVHNVAAHOKALSVAACEACAP I 

C-Pn«0759 ?»S64I4 H«>*«07 

folA»Dihydrotot.ir.i* RofJu^ronH 

LLVICPV1IP3NFENPLr7yE>k.*KMP';VP'; TVACOPRCV ITLGCXLrvnriTEDtOFF^ETIQK 
FP lVHQRKTWETl.PPKYF\'r»RAr/VFr:ilF.KR^ IVH'IE rWVTt,*LFEFLLI.DL.-;:'PTFLIGC 
^ELYJLFLENOlVP.DFFl;:n tKrr/A^LO-FFrUXLETVTKTVr JlprTv'K rTTCYYEMWr. 
UNTKNtL'L 

i;piij)7t;o itv/k'M 
'..T».U liyrvtrniff. I'.-.it vn*f»xu 

RfiCPKI<CLEI PKP:y>VTNV ITT/KTI F f YPVUfJLY.'; I UESTASKIMM:: I W ITr.K tVH 
U: wyiWELF.KV::KLEL t KOb\LAYVr/F3(Y< : I YLTKKIS7 1 L i r::A» ; J lilCINVPi ^KVLY 

i*DFU.™rru;£*uwFYm.Fji'*;i r t::ri::irTTri^p(rrtr.u;u-WNt:Kn»i.YNYv»3(r 

X> T W ALKMTY .* :M IXDt J U '.AAAVl / ie il? CCTI' CA I I EEAPK tTOK^CTTlUtSttTLA 
I A HjCLLV ;PLLO;iMAWETr Al T" 
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7PnJ)7M u%l#W «8375 

:TT*7o wrmitic.a promm 

- r mt.tw t axnxo i edohmlkh efyorvseck lekcqlo ayakdyylh I kafpcylsalk 

ARC0OL0 IRR'J UENLHOEEAGNP** IDUTOFALSU^SEEElA*mEFSQAAODMVATT 

RRUcwrct^JLCALYTYEioxpcvc/^^ 

SCKEKLOrr^ REN POAVLCGS0EVU7TUOTUSF XNSTEPCSCX 



tELLYTSR rSCKONYSLCNHETICRS I YMNl*Pt)RXKAL£AAVAY IEJCQFGAG3 IKSLCRHS 
ATKElSTirTGALSLDLALC IKGVPRCRVX nFCPESSGRTTtATH IVANAOIOCCVAAY 
IDADULOFSYASLXGWX0OLMXS0FDCGE&ALSX 

2 rrjrm rc CVKVCL0AR>t(50ALRKLTATLSRSQTCAVF XNQXREX ICVSTCN PCTTTOC 

RALicrrsstiujxiutzcsiia^tMSoxcanuKVKVAKNi^ 
scxi^vEYNXiEKwswriiYOfxic^^ 

TPSVMANtTPOEVPAOTVCA 

CPn.0763 860520 859972 

yafA-Forflryltetrahydrojolate Cycloligase 

HFPmDP«EKSALRJCLriSI!WO«EIWMEA^ 

CHCEANRXLX0XCTIJUJ'X:W!2*YTVLIP$XD0LIS^ 

LVPGLAFCCOCYRIaTYCHGFYDRWUW^ 

YLC 

CPn_0?€4 861819 860524 

CT64 8 hypothetical protein 

CYKSKDIKKI^LJl^SSLXAMSPIYC R lX3D YEKL TXTCDnXKWCLSrriCSKEKI^ 

YTPCVDnJVPOPYOKVKRWTOIXRCrWVSCLTAYHTMWX^ 

GNIUOAZVIGGIAI&tfPSAESGWLFlXTrTFAYKDKI^ 

ECRxjjcAmjjpOTHEiYAnraOTiOAiYCinrAvirniA^ 



KSCLLTIYYTTOXMATEZYIWDLUICEYFRPGDIWPY^ 
KIPYQCCXPLLN 

CPn_0765 862415 861801 

CT647 hypothetical protein 

TTIYIKLLGRUOaOtflSXLIL^FULJ^ILFVXArrt 

DHErXJVXKHNARISXDRl©^XESLNASCro^ 

XRALEJCSNHOLVWKXQMHllDirArimi^ 

KMTKOTTPLGNEVWLTHAZAISRWI 

CPn_0766 863785 862394 

CT646 hypothetical protein 

AMinCLmKXGLTKM2OTXXUXLOICrCI0CWXVCHCroXP^^ 

SLQGSDILVKSSSSSLXNRJWILlCVALnaXASLAIJ^^ 

WIAOKNTtJCKn^n^AOIFPDKI^CWUVDIFTlATO 

FVKNHAI AVARS FSNHSTrXSCDDIHA™ 

Wt^VVCOSWrYCVEBEXTOIYCOTIAAAiMCA^ 

SlIJGKY3UJlATVVV5l£SVXJCIJCSt£^ 

KXSASNYFLLPT XPTS EQTLKFLLALGKSSPS IKFSYF5YTMTSYPSKCNP8LPYSALVC 
VKCQO0PEDX POnJOCXSSHPI^OHVSESLZDQRSrXXJOrrLSS 

CPn.0767 863878 864177 

CT645 hypothetical protein 
NXMI£YtlJlTA2!WSFUlAYXFA5WV?D0QSARWY0^ 
IDPSPrVCLLCLSXLPFVXLRVLRFX ILNIFHSPWLLQYL 

CPn.0768 864144 865163 

yohl/nir 3 -predicted oxidoreductase 

YFSFSMAAP I FIKN ILlJlSSIVYAPIJCrSDYPYRCMSALYOPCl^FCWVICVSXSI LYAP 
ERTSKmYNENMRP IGAQLCCSN PETSCEAAXILEGLGFDLXDLN UX mUKmCDCSG 
SGLLKTPELIGRILLKriNS\ttIPVrVKIRSGWIKEH 

TRAOGYHG P5K0EY X SRAKAAAGKEPPVFGNGDXFS PEAAOAKLTTCCOCVLVARCTLCA 

PWIGKQXOOYLTOSYEXrPFIKRXAAFl^HKRLVEI^ 

AKVRTLRS S LAXAT r/OEVYOLVNDV EEADDSSLETFVKC 

CPn_0769 867763 86S121 

:spAONA Topoisoraerase I -Fused to SWX Domain 

3 IQCPHAIRLKXKSU XVESPAKIKTLOKLLGSEFVFASS ICHIVDLPAKEFCIDVDHDF 

EPQYOVn^KOeVXfmXRlOAAKCElCVYLSPOPORESEAXAmXANOLPOSPLXORVSFN 

AITKNA'/TEALJ<HPRT IDKALVNAQOAHRI.LDR r\TTYKISPIL£RXLOCRSG ISACRVQS 

VALKLWDREKAIOAFVPVEYWNl^VL*CDPXTTICrFWAHLYA PECXTEN 

[AnXINSEDCARHYAELLEKSSYTtTRVEAKAXRRFAPPPFZTSTLOOEASRHFRFSASR 

TMS I AQTLY ECVDLDS EDSTGL ITYHRTDSVRVUPEALTTVREYIQQTFGXEYLPEXANV 

YTTXIOfTODAHeAIRPTDINLTPDIUJ^^ 

3rrTOTEIDLRA9GSUJCFXGFLAVYE£X0D0^^ 

AFTK PLPRFTEASLVK ELEXSG XCRPSTYAT XMNK IOSREYTTKENORLRPTELCKI ISO 

Fl^TtlFPRZMDXCFTAUfEOELELIAIMICKPWKLLLOEFVfrrFLFWXTAEKEAVXPRXL 

TN I EC SKCHKGK LVK IWSKNS YFYCCSEY P6COYRTS EEELAFNKEOYAEOTPVDS PCP L 

CGCVMKVRHCRYCTFUCCEKYPECRCTISIHKICCEEIEOEEPlPCPAIGCNCKrFKKRSR 

YMK IFYSCSEYPECSVtCNS XOAVITKYSCTEKIPYKKKTPTKKKSSAICTTKAAKTP5KK 

CKAKSSWSSEWnGPLFLPSPDLAIUIXCNEPVSRGE^ 

L7PCNNIAT t tCPKPXOMFOLSXHLSOKLTXVSNOESSASS 

.PnJ)770 868122 rfri*131 

.T',42 hypothetical protein 

K PRTRNVEKLEFVTCLSSPDDDL ITFNKQCLI ACPEEEKVAFLVRSNAHLOACPETPASF 
i-ElXREOFG I FPFJYVEVLYWEGLDVVEACCTW I Ufl rEVT I0U* KHHRKASP.WU34YSRD 
fcVlwNIIEAVIIAVRMKFHEPVFEEVUVYOTSRWr^RP^ 

::i.WYPACtUMLVU WTI.MRLCMAOSYLYRAMKK t PKMLGVPPLWVLLRLTOKEIKMFA 
V KM PVLnr/APKRKLENVKWK^ IYQSYFV 

Injj77| «7»'ill rSf»-ii44 

i r*':M:;m^tt:::AUjMi\wjK^^ idnp 

VVtAXTA JCLKHC :p 'VPfPN: rrTGYLNOTIX JrVEJLYTRLLPO I E EAFJTAEERF t AHQ I 
•V 311 UBW :( J-l J'NI KDFAOELELPLEK I IIKVWTT tWILSPBG 1 A2PSU3SYWMKLLRNSS 

if.VAY: : i vprrvrucim xfap ihkkf:;u;l::elrh ilkkau^ i pwttpaaactvkpmvs 
ti . c -itjrtm y.vr i w:rrRrji4»:» iki^iketfiip^qilpkeeoknlsOOIL^axwlik 



NLRJCRE7TLUJVKE7U.PK'. ^KlPAPYPUIIKCl^CwFlttTTtFPAIENKAVA 

apw r fpox^pw: iTicD^HSif ekvujw t rcv r atcstplcttv t sor :taac i pr ar 

RTVAJCYRAOUttfA^R»t^«8CJ^ " ~" ~" 

CPh.0772 ~ 972400 *-Qi6* 

KWulrXXSEii^ 

VTFTWAARXUCEP ^T 1 ^ ASTNEFDVPWVCTFNStiCVF I LR RS I KUJ« ENNFT XY30S 

NVTAVUDPCOSI Y JWR'JAN iKfi I LWFENO YH4AKVU.LEEW ka YOU LMAANALXKMiA 

SMwrLASVWPCWXIUJtfiSTWEEAD^ 

RTPtDAiJJllUlXPYEIXGCUPrinxroOXLAn^ 

TTrFALTOYAIAOCLPIUCACQOAUrrKDW-SIOCOO^ 

DFItSVVWITCTUXUttPAPTnCDlUC^^ 

SDtm/iLT ADRVNlJfrLHNCKC LEFR'/SFLVCLtEC PHAN SLC3CTYEKI ttEARLCYV 
GITRAOOLLYLTAAOVMLICTVR»aPSRFUtEI PKOYKIQVR 

CPn_0773 872485 873195 

uno-Uracil oka Clycoeylase 

F1O<ATXD0WSUCECLPtOniB3lJC£EWSICPYK30tXXFU(0 
AIJtSTPFDQVRWXLCOOPYPGXCOAiCt^FSVPGC»Ri. 
CCt^SUAMOCXl^JfrVLTVRMEPTSKACXCWEX^ 
RXKCCLLFNSnOMVt^PHPSPlAAKROTGCSKFSXXNY 

CPn.0774 871183 873425 

CT606.1 hypothetical protein 

IXAIWECIHSVCFTJKTPRLTAJCSWSMEKIiTTOOU'SA 
AEHREX RCCLEOSLRTtVPSE 

CPh.0775 874040 873414 

ygov faaily 

EJWXIVIASSHCYXXRETKmJCRLCDFDI FSLSOFPOYXLPOEOCOSITANALTKCIH 
AAWIiXWlADimilJWAI^UCPl^ANFACV^ 
AYfXOCmVSPICEirinTCICECYXWEXCSSGFCYOPIFVXYT^^ 
NQVSKRAKALOXXAPHt^LFEXHXXTRD 

CPh.0776 874180 875487 

CT605 hypothetical protein 
rcrvtJ<KrnxxL*FFUFLsrrKKWFYsrvixsc^ 

CXOtXXMALISKSAAINSRCOIJAliSVFYSRKHDCTyZILC^^ 

RyPIOJtSVSLYCMCEVPlCEVAStCtVP^riM^ 

VU»PKP»CCTIVlXrPU^PTrSCSlAIPYCY^ 

l#OSMrTt}EltnJWMPTSPCWDP0S PFTTAATCXl^CAl^VAS XCVCYTLPP1CVtjOAPI#< 

OCSXVADilinKlu^PSVLFX^FrrTPFrCICy^ 

VUCALYPXOVEOTIJCSIERIPARilSSICNUX^ 

FH0LRS5CLLSEYAES 

CPn.0777 875586 877178 

grott-2-heat ehock protein-60 

TSEX3RVVWWCS0FIX2^AXJ0XCVKAX.T1CAVrPAP0PRCY 

JUgtlLQiaArESIflVKLARXAIAICVVgTKXDG 

MXLC^VSIVYOOLOROAZEXAlSPiaWfVAW 

SIOSGXSICrRGLeaQWKSGYUOTVTRFE^ 

LXS8QNTMPLVX IAEDFIOWIJm.1 LNKlJWCLPVCANnCAICSR 

ATLICOESEfCEIPVStXVTi^VKOVMITKETFTFt^OOCDAEI ZQARXQCLCXAXARST 

?F f rrT?TT l T" r ^ TPT " ;T pry^TTmfyrrrrrnwranFnT r<;xi .BiTKJummRTVPmnv 

AFLAAAHAX 8VPWLSS OfTfCf ETL WWPXJCVtJWNCCRSSCEWHritSHPIPW' 

CYNCaCrinFE32LVDACXCOTLXVTT5SlJCCAVS^ 

CPIU077B 877400 878092 

tia/ahpC-Thio-specif ic Antioxidant (TSA) Peroxidase 
APVAOSDRVPGYEPGG0RFE5SLVRWXRVEZEVFM^ 
VSI^imiKYVVIJITPimFTYVCPTtXHAFODAi^ 

AITOnCOOGXECmPLLSDEBRVISRSYHVLXPEEELSFROVFLIDKOC I IRHLWWDLP 
LCRSIEEIIRTLI^IFFETNCLVCPANWHECERAMAPNEECL^ 

CPn_0779 878502 878095 

CT602 hypothetical protein 

RrDLIFOKXrrVAI^EAEXCSYtTTAYFCRSLVOUiNYUIDVSS 
VYFRVREEmrCVOSTFFCWFUnOTTLKNI IAICLPCVGN0HI XEASRSLCOKHNSLLL 
FFOHDLYDLLTFNOPF 

CPn_0780 879241 978591 

papQ/a»iB-N-AcetyLmuramoyl-L*Ala Anidase 

HC^IA^SLRJWAKL5FFIL1^LLFSCIDCSPXHAACR5PSUX^AEIEDISAXI^ 
HEVEXVKLS ERLOEODSKCOKWTAAKPETLAOKI RELESCOKAUAKTLAVLTTSVKDLOT 
NLOSKLOEXOKOHIULAODLRLVRRSLLALVDSSSPCAY ADFSDPVPEN I Y I VRECOSLS 
KtAKKYKLSVTELKKlNKLOSDAIYAGORLCLOP^XO 

CPtl.0781 879851 879199 

pal •Peptidoglycjn -Associated Lipoprotein 

ONCYRSRRKTVPlX&rFPSATDKEJmCllHSt^^ 

TCHHTRRXKPSSFCT^TLYT£EDFKPNFTFGE*/T3SKEEK'vYKSSOVAAFRNITFATDSYT 
[ KCEENLA I LTN LVMYKKKNPKATLY X ECKTOEP.CAAS'/'I tLALQARRANA I K EH LRXQG I 
SAORLSTISYCKEHru^SOWELAWOOflRRTEFKIHAR 

CPf 1.0792 88 1 077 87977^ 

r » lb -p»lyiMcch.u-u1« t ransporter 

«;ni'*fU<ou:FovpFFiTA^LVYAEELr/'/vR.':E«iTLP lEvjaxTrDTKni'K iokyu:sl 

TT% lP;KOIAt<iID(riOrTiVV»KE5S5»PLA ISLRLHVPOt.3"/* /LLy^tKTPOTlJl?"hT t:'«ON 
UWWW 1 1 III AADTM I\* ALTT: I PC t O VJK IVFALSSLCrDOKUCOriFXWTTnYUf :kn( ap 
f.TTf»'r:u: tTrKWVW»*4»IFPYLYV2YF.YCVPF t FLGSLBlTErMKVWIJtt WUiMlT^S 
IWKTU VAirrA^irm J-MO(*^iLTrr;PK»PPRU^EllFn7^nKP:iFNrWT:7^.VF r.T 

NKi/aM>i>iM::i4MTivAiRiXTKKYPN:;ncPAwnPD r jrF.lAn;:wrw^iifri^)i^ 

:3l :W/^l.rr::!-TNKR:t-: MA IrtWIH.VFnAtWAEESELYL 13LVTKKTW tAH JVtKKUK 
(•.* *F «AKI«A*f I KMTI , 

•'ttl.'l/MI tfMfMMH HM||0'/ 

iTViH I iy| i. mi inftr.Mii 

tNMXYLI'V I A ITAl'l Ik*. ; I UA.VFA: :f l.rKKR(/«/PKAP0EF.LVT IQVKPi VITI- IVWIfl* 
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AJCTIRP: V.VTOPOKOAKCSPPODJVOKALOKP 1 pv ..TEPPRPSPAPTVAKITTTATtXP 
PPfTTKKim)l^}«^L0TWEVAOALSU{V^ 

0 EDEUTELFRT! 1 I ALPSKT/VR I KLVLJPNGE lOECSrLSCVSAAOJCQLLTOR XOALPFQ 
KPLEKYKVSKN t"FH I KLVTWES 

CPnJ>794 982359 991972 

TxbO-Biopolytsrr Transport Protein 

ORADr: T r-T«-<PrJ?f P7^KIi«»fYPfT£EtEEEPL«*lrt.TPL:0 ivfvilmaftvavplif. 

n u ijnift r;:H'TV.rr;r:fc"i ; «v«\ :*■•! ikujva;. 4 .:: 

CPn.0795 993039 992296 

exbB/tolO-polysaccharide transporter 

DH LYF ETLSVNKDFYSMVW FSKNP I IOAYTEADPFCKS IFFCLLI LSVCTWTVLKOW-M 
QWfTt^KAGKSUCDFLXIQ^HAPlJLDIHPEIJPFADL^ 

PII^SEDIOSLETUfiAIMPKYKAti>tl«SF:PATT:SlAPri£U£TACI LVAPTHIS 
SGSSG^IKBCIJlTALCn , riCLITArPSLIArWlJCAHSSCLtSErCOTAYlXLIISX£ 
VKYRNTML 

CPrv.0766 983137 895293 

dsbO/xprA-Thio: disulfide Intercnange Protein 

OTrWjGIKITASKGSHIWttlPGEIGSPLXISWOLPI^^ 
GYEBSALIVAIVPJ^ECYTPCOEVEIJIAOVEWLACCDSCLPG^ 
PDTHAErTXTUiAOPRVLENDHSVaVAOCKC^IIWXSiaCINAT^^ 
AXTSYSCG7CTAWRIJC\na4lJGV0XI^^ 

AGLSQYITI LIMAFL/jCVLUJIHPCVLPLVTLKVYG L IKSAGEHRSSVIAICUffTlfiW 
GCFMCaJCVAFlUCVLCHMlClCrOWEPKFVATLI rVFTLrALSSUXFEHOTHTANLC 
GKLQSSEMKSSNNKAVSAFFNG tlATLVTTPCTCPFLSSVLCLVMSLSFlfl QLLirTA IG 
LGHASPYLVFSVFPKMLSVLPKPGGWMgTFKQ LTCFMU.VTVTWLVWI FCSETSTTSVW 
LLCX3MIASUSAWXU:RM0TPVSPKKORVCASIXFTAr 
SVNEKLWPFSLEJOJWUUkOGRPVFVNFTAK^ 
TLEAWTRiaJPCITEELKRLSRASVPSYWrPGDKSA^ 

CPn_0787 BBSS04 996401 

yabO/ycfH-PHP Buperfamily c urease /pyrimidinase) hydrolase 
. TRP£PVDLAOAHVHl£DOAFE£BINSVL0RA0OSG^ 
K IPJCHVGYJTPPOBVIOTIEEIJHOTHAAAJ^^ 
QRYtAL5UXEZ^VVHCRGAFNDFFRMLXX9YYtOJD^ 
GWPIS IS31VTFTOAQOUU>LWELPLEHU.I£TO^ 
VANVXGMF PQ ELAALAYKNVLRFLMG 

CPn_07BB 886521 997432 

sdhC -Succinate Dehydrogenase 

SLVJCSLRHSRXE I CPEVSHKKCKYYST FI FRC I KSLAG I AFTFTLCEKLFTNHLASSYFS 

QQCGFVAKVNGFHKIPGUCIIEVAGLV^ 

AWIYSYTWRWAWlLLTCIAFHVVHIAFIRYPVHVDIHGTrY^ 

RCFLTLNLPNTZASS X CVSRKOLCCAIJAAXXSERHSYlXITSAfiTAFLyVVRDAlCSLFX 

AUmLVXAAAFHGFTCUOTCCRMSWSLR*^^ 

YSVA 

CPIU07B9 897436 899316 

■dhA-Succinate Dehydrogenase 

QMDQTOCVXVVGQGLACX^AAKOUkNLSX XVELVSLTKVKRSHSVCAOC3SXNAALNLKPE 
EEDS PYVHAYCT IKCCDFLADOPPVLEMCLAAPRI XKKLIllFOCPFNRCPSSKLOVRRf'G 
GTLYHfflVrCGAST^LKYTXDEQVIUu^^ X ILM 

NlJT^LEIUlGQAVriAraPGVIFKMSTOSTFCTGAA 
TAX rcRDJOJ&ISESVRGEOGRVWVPOTSSKRr^Pra 

GKLVSnrMZAKM LRVCEAGLG I XRMEAYU^THLP EKTRHKLEVVLD IYKXFTCEDPN 
TVPKRI F PAVHYSHGGAWVCWPAADD PDPJ5SRFRQKTNI PGCFNCGESDFQYHGANRLGA 



HEEjtAKXMVRNV 

PTUUJrTKSSTCKXELTNIPONIRLPI 

CPn_0790 999279 B90103 

sdhB-Succinate Dehydrogenase 

NSRXFLXXSVYPYPJCREIOIENLETFXUCIYRGVPCKQYWESFELPLKPCOJVISALMEXE 

KRPVNILGEKVNPVWEXCLEEVCCSCSrLVN^ROACTAi.I0EYIDA 

L7KFPLlRELrVDRSIMrom.ERIQCWVAADIEGETrc 



CPKXLPLTESXSAVCREI5KFSLRSLFSALFKXXX 

CPn_079l 993104 990111 

CT590 hypo c Met ical protein 

TClASSttttWEDISDRNHYSCYSKG ISHNYLLH PMSRLDIFVFDSLI ANODQNLLEEI F 

CSEDTVLFKAYRTTALOSPLAAKNl/UARKV^ 

U3PHlWNEAGiraEHUJQlU<ALKENPKI^ 

7 X HVROAALTALFTYLRCDVCSC FATAPA IL IHOEYPERFLKOUfflLISSGXLSR IVNQR 
EIAVPIfO^arrCaFKPLRILDLYPWLVTCLSSSPGUCKJ^SAA^Lirri^DSEJ^ICO 
IXSHOYLWKL0NVHETLTANOI XKSTLLHYYQLQESTVRAX FFKECLFSKEQVAFSTQH 
PP.ELSEIORVYKYLKAYEEAKSAF I K DTONPLLKAWEYT1ATLADASOPT X SNH I RLALC 
WKSEDPHSLVSLVTHFVEEEVENIRI LVQQCEGTYHEARSQL£Y IEGRMRNPLNNOOSO X 
LTMDHMRFP^ELNKALYEWDSAOEKAKKFLHLPEFLLSFYTKOIPLYFRSSYOAFXOEFA 
HL'/ANAPACFR I LFTHCRTHPNTWSP IYS INEFIRFUSEFFTSTESELLCKKAV XNLEKE 
73 RLVHH ITAMLHTOVFCEAI*LTR I LEAYOLPVPPS ILNKL00L50TPWVYVSOCTVDTL 
l^D7FE!lCEI^TLTEKHPEXPHELAAPYADAlJCDLfTGIKSYlXEESK^LJ^SSP7KVFS 
1 1 AGS P LFT EAWDND^^-TWLRCVWVKOHODFLOD7 1 LPQLS I YAF I ENFCNXYALQHV 
•/?iDFHDFC=WSLTLPEL\T»CCSRFLSSLPTKlKTVALtYlRRI^YLMVREVPYVSEOOL 

f • evldnvt:.'; yu; i miR ityekfrsl t eet x pkmtllssadlah i y kcllmogyok xytee 

t/r/LRLTTAMRI1MNLAYrAFXXFAir.NOT«^^ 

M 1 9ELFAT::HtVri .YANP I DYT WPPPPCY RCR LPKEFF 

•:ifiJI7'i^ H«*4\»5S H-JMOH 

\Tvr» f*-/jrf»rru»t u:ai protein 

»>HHMNIK':iniMKirTFT1JR\^FTPFLVtPIPlAUIIlfyvn^ 

Tf EFtTKLTI IIKLFL0RtANTlALKSYA5P^AEPYA0AYNl>«AU;wrDrJU:LtDP 
.••T/r^mTrri^:i)PF I P YUCWPEMXKKLSAA\A-;KAFt.LT r r* :k pu JIYL I LVEDVASWDS 

■rrT:XLLy::wpMj;KWKDLrosu/tTKeNXCL^ 

l '/FQAR^KiA I E I EKAiV t UiCENL tTVS INKKRYlTJLVfXK tPl^TYTUJLVPVSDCI 
VryVLKVPUl teFFYVl.AFU*WWt F3K I NTKLNK PLORLTF* *M KAAWRi M1NVRFEPQPY 



oyefnelui i FNTTLiitiii- ac :cYinr:E*urE^ : : -M-trcrirrPia- 
S.^ISrS^ 

VL-LOKISKXTTAJJSrDrrrtf^VVA^^^^ 

RLPLETHOALOPCDRLTCLTTCEt) t OCTrSCLr lEECLKTPUlPLN I LtL.tSLTmLNN 
E7EHSAIX77L7ILJFTT 

CPn 07?) e-MWH «9H*» 

rOfU-einm.1 re»niir-»r/ t.wnilv nr*rt^in-PP:c cnospneciie lRtb« 

*!v;'lV' - ; -«- - "* Ar.'T.'.lJ-SiViFVA 

WTLTO lVPU^l^Lf3CVLWJ)AJ It ETfNV ^NEMOKVFTCZYNEXSLXKVFWCD 
KIWASSXPEMLCIWf«KlDrPKN7PFLAA^SPKN0EVFTr^OA»^^ 

LrrTF^ACUJtcu^tacoraTVKrAiLSKYWiJCASDPAXjaimn f piki wmyv 

FLNDOPCPXMEXXPL«PlJXGENrfSFKXICX?rEIVCCIEJWSIDXAVW 
FAPLWRRARMYTAYFFCXLLCSLXAFIVARRL5l*P ZPJCATAHX ESPJOmCLYTDOSLS 



LAlU^XPArTNOTOFFtVFVVCIXSXARLFLXVAfASCXCVNACCYS 

AFMP<mAX£aPEVM«rrsict^PKPcsLFVLYwrrcAi<w^^ 

LTCX^AAXiA VKP1J<L5VKTFVCK5K0KXS} XTLL I LXVLE5 
CPn.0794 997123 999004 

No robust ho mo log present in Genebank/EKBt. as of 11/7799 
KSSXHF.SF1JLJCJCSGGNC7VSLTO I FMI PSCESFADSLXDLNL 

CLDPSVECLSCBCA/SVCTfFTKACSTPVEYOPFKYTVSI^^ 

YixrriTUCTCSUSACicyNcyucwsAxxrrLTi^TC 

SG0PKAVY>*ASCAT^VT0LADISa:SRSSYAYWSCDC7::VCSKICT 
NVPTYTjCTIjOCXJASTCLY I SCIXnvrVCAAN7A7VTNCN0 ESHAYKYXCNQKICD 

CPrc079S 99BO0B B99 195 

No robust homolog present m Cenebenk/EMBL as of 11/7/99 

GTLQaWSSATXVSS&GSVrAroAOTAXXSVKA^ 

CKVllCltSOXACCSWHAACHTDFSSNNVlJmZKnr^^ 

ASOHZrrEPta^IALCACLYVNA^ONXJ'SNXVU^ 

VPtWrNVStMKEJNHroXCMW^ 

BWAAOIECRYtatSXflCHVirvOPFLCaX^ 

CICSMXALVDSXJIVGTRMGHXONFAAHTDRFSCS IAS I<3IFVFEJCtJ7VTHTRAFAEMRVN 
YELPYLOSLNU LRVNOOPLQCVMCFS SDLRYALCF 

CPIX.0796 999290 901340 

No robust homolog present in Genebank/EMBL as of 11/7/98 

selyssylopcl»csivrnsau»ux:lsrsetfxkvtish)ocfxk^ 

ll7ai pcsfahtl.vdi aceprhaa0a7cvscdcx iv ickkvpoopfaitvctoy tdoklq 

PLJ^VPJ^SVYPNCI7PtXnVTVCTt*YAICMC^AVTC>^ 

VSA0GXVXOGNRNIiajGASVAV»CTCVXT0LPSl^DAN^ XV9IMVDV 

SMOTAVOWXCX»X^XCTL0STTSVASAXSTXXnVXVa»ENAO^ 

ICTXOSmi^VSSXXMVIVCVSTNSEHRYHAWADC^ 

IXaCVIVCPJtfArpsCDHHAFLCPFOAPSPAPVHGCSTW^^ 

COURUXOHSAXVESVSSWSFTSVKSAXSXOSPAVONSVQm 

P^OFKLGETVVtX}PFmXQVLHLSlutaYSEXNWP^ 

PKMSTAATLCVERDLNSH IIJEFJDCSVSA>CNFVIJWSTVSVLR PF ASLANYYI3VROO0LV 
TLSVVMNQQPLTCTLSLVS0S8YNLSF 

CPn.0797 901352 902694 

No robust homolog present in Cenebank/EKBL as of 11/7/98 

VLXWWINVLTIQXIJIMSiaCIIC^HLTirTLFRCV^^ 

EDWKCY7T7Dm^KE^EAHA VSGNG 5R I V^ASGACQG S\TAV rWES^IKKUTTLa 
VCVSATARCEOTO#0VGVIWE^ 

V AVKWNKKAVYS LGTICCSVASAEAI S ANCKV I VCWSTTWC ETKAFMHXDETfOtDUTTL 

GGGFSVATWSAXCRAIVCFSAVKTCEIHAFYYAECEMEOLTTXflCEEARV^ 

XXGSXKTQACAERAYLFMXHX 

CPn_0798 902810 903856 

No robust homolog present in CenebsnJc/EXBL es of 11/7/98 

WFEI IFVVTlVPMKKTCCONYRSICVVFSVVI^VLTTOTLFACHFIDICrrK 
SCDCRWVCYECCNAFKYVDCEKFXXEGLVPRSEALVrKASYDCSVIIC ISOOOPSCRAV 
KWVNCALVDLC I FSEGH0SFAD3VSSDCKTIVCCLYSDC7TETNFAVXWDETCMVVLPNLP 
EORHSC AWDASEDCSV I VCDAMC S EE I AXA VYWKDC I PGAXR S SAKAVSKDOS 

FIVCEFISEEWEVHAFVYHNCVXKOICTLOTYSVATC^SRICKVI 
KYV0CR><IDLCTl/XiSASFAFtr/SDDCX7IVGKFE7ELQECHAFrYL 

CPn_0799 905001 903940 

No robust homolog present in Cenebank/EKBL as of 11/7/98 
KREOt«AAIKOIUlSMLSOSSU*f/LFSLYSl^ 

7TLSRLSNKKASAKAVSC7CATTVCF I Kt7TWSR7YAVRWKYV<nXELP7SSWVKIlSKA7C 
ISSDCSI tACIVENELSOSFAVmWWEKYLLPSTWAWSKATOISStCSVIVCSAKD^ 
S RTTAVXWTG H EACVL PVCWAVKSVANSVS ANG5 1 XVCS7QDASC X LYAVKWECNT XTHL 
C7LOCY 3A I AKA VSNNCKV IVCRSETYYG EVH AFC H KhH'/HS DLCTUCCS YS AAKCVS AT 
CK V I VCMSTT ANCKLH AFK YV'CCRH IVLC EYSWKELACAI 1AVS I DC E I XVCVQSE 

CPnJWOO 906550 90524'/ 

eno-Enolase 

R K E I K X MFEAV I AO IC ARE I LDSRGY PTUA/K*/TT!7ni7'/^EARVPSnASTCKKEAl«EFR 
0TDSPRY0CKOTU?AVltt>M(EILFPLVKGCSV'/E0SLXLJLH>1OSOr:SPNXCTl^^ 
'TVJLATAIIAAAATLJlRPLYRYLOCCFACSLPCPtM^Itl^lMHAONnLEFY^EFMIRPIGA 
J>i> I KCAVNMwU7S/FHTLKKLLHERCl^T f 7VCCE'7>F AHILASNEEALELLLLAIEXACFT 
PGKO ( LAUDi.'AA:;: "•rYNVKTCTYCX^RHY EEC I A I UUIL'.'OnY P I [)'.: I EXXTLAEEOYXCW 
ALLTE'/UiEKVO rVi n'IM.FVTNPELX LECISTm^NJ.VI. IY. PNC t' rn.TFTVYAIKLAQM 
AtTYTTI r:^IIR:VaTT1^lACLAVAFriACOtFT^l^K;FJ>VAK^NI01<ElFXEl^ 

':»Yt_'imi| 'NIH/O'i 'M»«".7i7 

•ivill KHltliirl. m:'..< Al** I'MtJtttiii II 

I I KTMTtVLI lAI'KAIW U\H*VA I APLTiAl WR>K//K.*H)V! .LTrm » :n>T I AlfWANVTII. 
(•nA/|«AIINKTLA,VI AOKKWKKFMttlAVEYPl ^TfWY»>I'EAYIAI»::i/PY I J'JC.'IU.INDE 

i uki J'u ;at k: : 1 1 jzm i »n . i v. :: ,y.7: i vr. i a-; henytsmalvi :r kv i t» 1 1 1 .taolvk 
wiYOA:;rtryH::AFMw<» nr pi*AYR:»XAiJ<LlTUR/rLT::tir/::hii.TNi ii(ewp 
::ati.yf<::!iiyv i r'EAiH>x«Aiirr tOKt*3 >cfj<ma f fodp(' iekth i Fiurrtnr#i kmc ttv.tr: 
wry 1 1 KHVtwiiPii xavi iwivn ;i .i 4*YPPkDF(.f.( t otrnr/TLi v t PAMVMf u/^ snKo::i . 
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vrr:rni.f::An»«pt.TrECAOKYf i»r/tYV3AT. .-cvqessch vr^oi irptcipop 

Nf'EtRPATfXATWLLEEtRLPI^KHCKILVtSlTKKlAtBMACFXJEI^IPAATLHSCI 
ETAERTC I 1TWLRGGV t DVL t CVtflXRECLOLPEVSLVA ILDADKECF LR3TSSLI0FCG 
RAARNtNT.KVt FYADOKTRS lEETLRETERIWOrCLCVOTEWIXVXTCPl IKAIFANPILC 
TSKDSE3 PK EG7R PL5K EDLESO X KKYEALWQRAAK EFRFNEAAJCYRIJAHQACKEQLLYi. 
F 

: : .' r:\ r.t • : •. ii.v ' * .Wr.t ir-r 

: \i *."wkk!'"." . - : m? run/ a wv« : r: xtjEL^ssnTtZ' :" tf : : ;.r *j r: . rrr 

TRKEEVLir/T^IYEVlADWLjVCIDPTKSI IYLQSAIPEIYELHLLF3HLISINRV>CI 
PSLKDKAANAS XEECSLSYCL ICYP I LOS ADI LI^Wn^VCKDNEAHVtLTRDIAW 
OTLYCCWPEPEVLXELTSLVCIDGCG10*SXSA1«AJY^ 
IRATTPCRVECNPLFXYHDIFNPHKDEVEtTKARYRCtt^ 
KERJUen^KPLAXA7mO£tXmaOOieVAKVTHEE^ 

CPn_0B03 910304 909752 

CT5B4 hypothetical p roc tin 

FMAAJCTKTLELEX3WT LlilCNIJCRIFATPICYTTFRrrCfA/VnC^ 
LIICKLTOELAPO0K0AAHSLXAEFMMPIRVAJ©XHIJ^^ 

R LARVDCO EFLLMTTT/OJn'CKL I RK LLARLLEAOKN PVC EKNLCX I Q£EI TSLXNHFDEL 
TKAW 

CPn_0804 911074 910310 

ap6D-CHLTR Pl&UBld Paralog 

CXFSSHaOJCTlxeSRFlQCKrPTmEAXARiawra 

LQHYNFREOIEEPDLT0LCTL£ATVKQIHHGSVLIJ{GERIT^^ 

LTYGNRQT PYOTLVYYELFTLLPEPLJCXEMEXMPROAVYTLASRQGPQEXKEEX XRNYRG 

□yCSniDRIRXEFTLVZTOCRJCTSPVXOAIA^TKGSOILTICTSLSSDEOI ILEKLIK 

KLEXVKSNLFPDTKV 

CPn.0805 911646 911067 

ninD-chroBOscn* partitioning ATPaee— CKLTR p La said protein GPSO 

GYASRHKT I AVNSFXCCTAKTSTITJilllAAXAOYHOARVLLI OTAOANLTSCLCLDPtX: 

YDSIAWLOGEJOMOEVXRPXOTOLDLXPAfllVLDlIET^^ 

VODICYDYVI XOTPPSLCWLTCSALIAJUWA^ 

TIUT/ALSFWNCTIGKWSAFAELIHICITP^ 

RAS EDYFNLTK ELLX IXRDI 

CPn.0806 913816 911867 

thrS-Threonyl tRNA Synthetase 
NAMNESPPNMEAVOraXOVTCDQIOHEVlJSTTA^ 
THLNEStm.mTSESPECIUnFlirTSAHLLAOAVLIUimA 

L^XSECTFTLXESTVKOXVOEKIAXSRrTYCDKOOAX^FPO^FinTXIRZXJDiQXS 
AYSOG EFTDLCftlPHLPST AHVKAFKVLRTSAAYWRCC P SRESLVR lYCTSFPTSJOELRA 
HUmEEAKXJffiHRVIJSAUSLFSCC^SPGMPFTO 
EILTPOLKNISLWEVSCHWIWYICAIWYTO 

PUWAEVOfVHROEASGAI^LiMRVRAFHODDAHVFLT P EOVEEETLKI LQLVSTLYCTF 

GLEyHLEX^TRPSOTXCTOSI^ELKTIUaiQULVO^ 

DAIORTWOCCKOUKFLPERTEIiYTrAOCTlCSVPVHUD^^ 

RFPUWPECVRIXTVAI^IPRAKEXXEAWjatf^^ 

YMITLGDHEINOWLAVTmU3NRVINI^^ 

CPJU0807 913950 914879 

CT5B0 hypo that ical pro tain 

TLQT(2J{MSLJXV7LTAF:WSSSFALSia.VWASAPI FATGARMVXACAXLALAAWFRGG 
FVGISKKIFLYIVIXAXTCFYLTlJIFEriGLOSL^SSKTCFIYCLSPEiiSALFSYIOLXE 
KVTLWCVl^LGLVSYICYLTFOa^DSOPWT^ICU> 

I EKOSTLSVTAXNAYAKLX AGMLSXKHSAWEPWRP LPVODX SQFLYATLALWXSHLXC 
YNLYAKLLRXYSSTFLS FC^WLYSCrfC^lXC EKCVSl^VlJkVAFWAOCRl.rYH 
EEFROGYXVS 

CPn_0808 916396 914956 

CT579 hypothec ical protein 

UCJ0-P5WAIJCSLXW«PCSAEPSLAHIKPI IFKCACIAKTSCVSCSSSOOPTLAAOLAOSS 
C<CAa4AOSGHDT?gATKQGAOAEVAAOgFEDLIODASAQSTCIOCEATSS mCSSKB PCSE 
KSGKSXSSTSVASASETATAQAVOC PKGLRQNNYPSPSLPT P EAQT INC IVLXJCOCTLA 
LUJLVMTLKANAAGESVKASFtJSONOAIRSOVESA^ 

L I SG IVNI VGFTVSVGAGI FSAAXGATSAIJCSASFAXETCASAAGGAASKALTSASSSVO 
OTKAST AXAATT AAS S AGS AAT KAAANLT DOHAAAASKKASDGASKASGGIf GEVUOCPN 
WS EKVSRGMNWrrOCARVASFAGNAI^SSMC*^ 

RLAGOAEAOAEVUCOMSSVYGQQAGOAGQWEOA^ F 

N 

CPn.0809 917794 916307 

CT578 hypotnecical procein 
CT*IMSISSSi«PONQKNIKSO\^TSTPC<^^ 

I AGASCKDKTSSTTKTETAPCOGVAAGKESSESCKACAinWSGAAATTASNTATK IAMQ 
TS Z EEASK5KEGTI.es t^SL^AAOKKEVEAVWAAL^GKSSGSAXZXTPEX^KPGVrp^ 
EV I E IC LALAKA I CTLCEATKS ALSNY ASTOAOADOTNKLCLEXOA IK I OKEREEYODOC 
AAEOKSKDLECTKDTVWTVM I AVSVA ITV IS X VAA X FTCCAGLAGLAACAAVCAAAAGGA 
AGAAAATTVATC rTWOAWOAVKOAV ITAVROAI TAA IKAAVKSG I KAF I KTLVKA IAXA 
ISKGISXVFAKGTWIAKNFPKLSKVISSLTSICWVTVWCV^ 

O0NVAOFOKEVCKLOAAADM I SMFTQFW0OASK IASK0TGESNEMT0KATKLGA0I LKAY 
AAI5GAIAGAHKTNNF 

CPnJMtQ 918193 917925 

i.T577 hy pot net ical protein 
c:ElNHKKI*KT*IG\\WKAAFVKRVrEE^>EAAlTO 

Ny HE I tAALJ»7Tt.EGUIl.EECTCLPn:A0ErANFAKELiGWIIi:UKNLTrAmK0MVK 
C«AE 

cpu_ohii >is20j» 
li:cll<tiM Nt*itprNtt;«« rnir.i*ut h 

I t»WlF 1 1 / • I M: X* IK I HiNAMJPyK P:5 /V.>7 IK kTft ^ f< LAkXA/VjK K AK A Dt /L WjVI I I*V PT 

kee t kpai/yi i m n ^wcLuuyj iloi^oylleeiytvaytfy: v ikynea'^lfolua 

Aypgrr/F Wl * Jl - WVI IUI JILYNEAAFUPFLAFPAOPONP I PPYY I AP-.UXKLOOr-EEGN 
MFI J1VTMI II « 1 »INI 'KFK f I.KCRt.\) tKKO:'. I EKCHA»;ETK KAITKKIW IKjtFTTTNKICJRK 
VH 
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EiETSc^xrxwww^J^^ 

AG ZSWE IOS3 XB LUC U VR TVnCCtn V3CEPCAR0LCTTV I VWSLPVWPV«»GP<0KSl< 
O^RGCIR^IOmiLSTAM 

OKEAfCVR I VCVLS3 PSFHRPTROGOK t f INDRP t EC ^^J 1 *)^^^ 
VUCLYLPSSrarNVHPOKIEARILXEEU^^ 

SVPLWFP^ET3t?^EEESVEF0CNl»FAYKEEVnLEItCEY ? .WCPKSWWIYSSDVR 

v — :.-r->: rrrvvv* 5 ;: f*. 1 *: -'''V ^.ttcta 

v in - „ . • - r — .•: - ... v. ^ -.t ^ tycr 

AIJOCETLTCATrJICHOH\TWW^^^:KFfc>CU>lA^ 

CPn.0813 *20B43 «1934 

oeoP*A«inop«pcidai« P 
t^il>ocim^dr:uworai^ewciu:lvi^ 

rVYRMDKDLYSH IQRVTLTrT-TCI^'AlXSl.Y'^^RY^XIGri^A^TVYKICrACKOVlP 

QJrtPLECrrEKXRSXICSEEEIRRHOEAAAXfiSAGYSTVLTliPJCX 

AEAQAEGPSFPP X XAFCEHSAFPKS X PTPRPLKKCC IV. I D IC\*LLNCYCtOKrWTALG 

TPHPlOIXSYPVVVEAOKJWMAi£KIXMJCOXO^ 

HXHTYPCSPRGSOVKLESGKriTVETCVYFPGXCCXRXEiniiCXXJlCNKWFSL^ 

LVCL 

CP»uOBU 921996 923357 

CT814.1 hypothetical procain 

FFlSrXLSTOXFNLPLTiroLl£XGYSrVSFXAI4h»GOrSPOT 

GCXMrPOA£SlijC<»CAilIFTOUERLP0UI(EXXF^ 

CYXAAXJCERVY^miSSWa^SFSPKCWEX^XE^ 

XSXP1U3CETtJlJ^ANXX^CVnUACF1(Vl^FPVKC^ 

TKIRVDFVSSOeENYSItYX^VCXJVlXWDClCWQTCC^ 

DLMWOGTOROT XSLVKGVPSP X EINEVIRZrETreMRSWSXPXVLVflXORLILSPDDWV 
LRTAJCGWDCLSRADO I CWVTGIO.TGPLLVFEXliXI>t^GFVI^flHMFNAORTl.Vrr IS L 
PLKQGFI7AVAS0EVS5HI7lSAAAi{P<MT1<XGCS 

CPfV.0815 923361 925622 

o»pD/pUO-Cen. Secretion Procain O 
MVTPIOISXXKLVAX^GKLCCSS^ALTXAaQtASX^KSGRCA^ 
OUXXAXEAIUCLMSGTESEJaJAOXXIWIGCV^^ 

WNHPETTTYNLVTOYCTEDS XYLX POEICAIKIATT^KT^KESFXDCLTOILSRLOIC 

VRCWSWXXnY»OOWnXSVAGWSSWCDLEAl^ 

XNPeTrKVDVXJ^VWXTGSACEVCmJCXYNrVOSESXRQ 

TVFWYNVKKSOPOEX>AlXSX7VHZ»rP^GQ<^^ 
GSVrrGKTIAXglCTGTLIMWEXEVLPRXQKIXtaaiT^ 
GUIlXIttCEEVCTaCGCSPSAfSWAGGTOXl^^ 
MASPSWT»OrWXAVVDat«AW 

XTLLTUlif UITGKNHDORPDVTRRN ITNXVR I AXXIETVI XGGLRCKOKSSSKXSX PTLO 
DIPGIGiarGKSSTSDSLTDIFVFrrPKILtNPVB — — 
ASCAAAJUUOaCXXJIVPASGVSLSQVEIlOEYtSGC 

CPIU0B16 92S6O0 927X02 

gapE-Gan. Secretion Protein E 
RG»mtAASIWOEXXDILPYrTtJOO*CIXPXEESSEAITXAi^ 



DIAIHRlJ^DCRIKIHireOEVmRVSTVWirc 
EXX^nrrTAPECXtXVTGPTGSCmTLYSVUEXJCCPLTMXIC 
AVKPRXGLTFARCXJWLLRODPDILMVGEIRDOETAEX AIOAALTGKLVVTrumOAXS , 
AX PRIXI1C X ESYUS ATLVGWAORLVRT I CPYCKVAYT PE^Q EXS FLAS LCXSTPtFL 
YRGQGCVHCFIlSGYlCGRQGXYEnJlPNTLFRSEVASroro 
AXJkVSGTTTLAEVLRVTKRCD 

CPTC0817 927106 928287 

gspF-Cen. Secretion Protein F 

GG RMPRYTtYTYLDPKERJUCRGYIXALiH IOEMEXLACDJIOVLDXREVALXRMSIKSTEL 
IVFTK QT I LX4ilSGX>P LYESLVSI4ttX3WEOKMSLXXTSFMET^ 
FDHFYCSGVAACESVCNLECCLONX I WLEERAO ITTtKMVCALSYPCVLLVFS FAVKLFF 
UjGVIPSlJCETFDWEVKGLTXrVFCVSIXIJAYRYlJWFASAI. 
. ILEK£XFAI,PGTia<rWKVAVNOTSVASAXLXGGG^ 

IVOAVIGCGSI^OEl^RSWPKljaGMIAi^EESCOLfcDVLGW 
TSWCOPVXLXFLGCLXGV IMLAXLZ PLTSNIOTL 

CPn.0818 928158 928682 

predicted OHP (leader (16) peptide! 

CYTKNVCFt*WVST*DSDFSWWPDRCDHVCN I DPTKKOYPWX IICCWCWCWWOMUCO 
S ITLIEMMW ITL IC I IGGALAFNMRGSl HKGKVFOSEONCAKVYDI LKMEYATOGSSLK 
EI IAHKETWEEA-5WC KKJRKLIJCDAVCEDL I VQLNDKG DDLV I FSKRVQSSNKK 

CPn.0819 929117 928956 

CT5o8 hypothetical protein 

ASLYC/CLFLIWEKFHNMIGKANFHLKI ITTOFLTO IYIVTI RDP I AYPLTG IC 

CPn.0820 929042 729659 

CT567 hypotnericdt protein 

0ESLPCRCCCCTFPR5ETS.1 1 RTEMPHCNS I AMKKOKRGFVLMELLMSFTLX ALLL0TLO 
FWYRK I YTVOKOKER I YNF*^ I EESRAYKOLRTLFr^CLGGGYEEPCSLFSL I FORCVYRO 
PKL^GAVRASUIHCTKDORLELRICNIKD0SYFETORtl^H^HWUF0RNPDPEIG>PE 
T I ALT TTPEP KAY PFRTLTYCPAVCK 

cm.0821 'iswiT viom;h 

irrwt jtu>KftejrK i fti ai?lt;;i .v::i.vapt>aa/iarkrcacaot r etv:ehfk.': i krsaca 

R I EYtttKWIUtiAt tR i:;KIJK/SKVTTr.Ot AKVATKKKORYRLLUVrFKRn WIGRYNLYA 

i^!KPr-EirY::nTA:!WYA i v iRLLRr<Ar/OTfaf/i'f*^rM iamal i:wkoei lergaolg 

HA' I KTt.TU -mWL i PYKMU9 tH'Mil^lXjLIXVU Hf EEK^LGItt.'KLNL I FMDH JXEAVL 
mill »AYftCT: "IJ.|<I\: I WKAVKrwIA I 'Jii |T?j/JVU.El.FKT«TDFia.EI.I<ltKKOLLL^fc Y 

nj.i-UiiKKM»'7iYTf«u:Ara«Yij-*i.VMi/rKAt::p>. , r«'r::F::iK(. 

MIM.-fi mi •••'i 

kfi. 1 1 vlI;.t i FN I :: it :irw«*m rmYJHEXwran -frkvkouijjni'kvipwkkfl 
;niRAf >:a i . > 1 1 ,vi .v. : 1 1 aw i : ma* i *.v \ h "rr/i t a-nivei rkmij;ni a«:y;: r amw i k 
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na lie tl i liwlnxpsfavsf tvtrv [lsfitt. .stcsichcdkhootshkps 

CPnJ>R2l 711501 
v8CT/3prff«YntfT Tran location T 

fyalovrfskt:; incnxelhs x slpelfsnlgsayloy i fqhfpayvwsvft i r urtlp 

tFAVAPFLGAKLFrePlKXCICLGWIAIIFPr/tAITOI^^ 

VICFVLATPFY AAOSACSFITNOTXOGLfiCATSL XSXEOT3 PHCI L YHYFVT 1 1 FWLVC 

shr rvxsu.LOTr.Kvi pihsftpao«slsap iwfmiKMCOLCLVKr xqlsapaalakl 

oi. m \rvv-.v: r ::-ucwrr r:,v;TLAWF*;r,T ««. 

..i-'.IV.-. 

CPn_0824 932677 93337? 

yscS/fliO-YopS/f 110 Translocation Protein 

IRTRAVLAF FATSFKSVLFCYSYQSLU.XLXVSAPPX tLASIVGXMVAIFCAATOIOEOT 
FAFAVKLWI FCTtiilSQQrfLSNHI LRFACQ X FQNFYKWK 

CPnJ>825 933618 932677 

yscR-Yop Tram location R 

ERiitvrriMRsirRrsi^rrLsvsccrAnASLVOiscpSTOPTPPPSN 

VAASSVPSYWPLKAOOW^HLSDCSrSDTYPDITTOAXXLIFLA^ 
KIHTLVUJWALCWOQTPPSOVUCIA^ 

IPAFXlCOXIINAmCVtXYLPmXOLVTANVLVAIOeOILSPLSXSLPUaXXJV^ 
-GkOLLQCLKXSrX 

CPn.0826 934382 933612 

yscL-Yop Translocation L 
HDNXRSO/PSSEVNOPORYYAIVWC^ 

KTIUU»EAWA£lXQXCAOXROEAJCOQCFXSSSESWSXO tAFLEEETXNLRtRVREALVP 
lAXAS\nUCXXCX£LSXHPrrXVSXXS0JUJOXT0NXHXXI5VNTO 
VEYAOSLILTAKPOVTPOCCI XETMIXNAOLIJVOUmEKArSTIIJCAKNPVDEPSn' 
SSSTBSSSLSNDQOm 

CPnJ)827 935273 934434 

CTS60 hypothetical protain 

GCXVTANTFGTXDILHXHSXEISJLSRFLPra^VK 

WITIAHKEFPPEIQGOLLAWLPEPLVCEILPLLPCISIAPKRCAPF^^ 

PCeiTEXXFXPASSANAXLYYTCPVnCIALIICIiSLYSIAXELKHILDKV^ 

TEXX^TYCOSHPMXMLETTWLSSVITDAELROnmTO 

RW-DVCRAYrVEOTLrrvnfDHPyVDY^^ 

CPIU0828 936292 93S267 

yseJ-Yop Trans location J 

IXRYAWIMVTU^ I SFCtJFUfrLLCCTSCNSRSI.IVHa rWLLVSKGVAAQX 
LPOAAAATACAATSQMWDX AVPSAO ITEAIAX LtCAC&PRMXCTSIXDLF AKQCLVPSEL 
0EXIRY0EC^E0>iASTIR»OCVVDASV0ISfTrENEiaiLPLT 
I WSK XKM.XJ^VPCLVPENVSWSl)RAAYSDrrXWBPMCLT ILAXSS 
IJXFRIJEFra 

CPnJ)829 936729 937298 

No robust homo loo prasant in Genebank/EKBL as of 1 1/7/98 

KYXCI^rPTLAJCSTYXNXRDSRFYSWLCFXKXETYYIWPlJSNYt^ 

FFLANAJO^VPAGYIWVROT 

CIPVVSTLrcXOTFCAVWCVEGFSGSTPTJXIYHm^ 

PVWrLFKCYS 

CPn.0830 937339 937959 

No robust homolog prasant in Genebenk/EttBL *s of 11/7/98 
D9CSniKFT\fleA07TP0VFSKWVYKYKSSRi:iIAX^ 

G RVIIJC tYONEEEFFRATERfPSXGACYLRVRWCMSVTJTFEDLMLVC P5VPKDFPLSAF 

KVTTIO.IYWSVXXSIPVVGAFFFSIGMTAI^ 

EKFILRXXFTLLTLPFWLXSCLKS8AA 

CPn.0831 938249 938434 

No robust homo log prasant in Genebank/EXBL as of 11/7/98 
NKRKNhrVLIRKSESEWFEATONYPTIOOCYOLVR XREHNl^VRAHFDLSLSLOASVHP 
AA 

CPn.0832 939750 .938827 

LipA-Lipoate Synthetase 

VKXCBPTLNTCOPRVIUCXXPERFPXWLQRPLPOGSAFKA™^ 

RAECWSRKTATYIAIX?DVCTR5CjFCNXGHSKTP P ALDPTEP ER X ALSAKELGLKHWXT 
MVARDDLEBGGAOCLVDI IOKLREELPOATTEVLASDFOGNVSALHTLLOSC ITXYNHNV 
EWARl^PLVlWIWTYARSMmLEOAANYLPDLKXJtSCIMVGLCEMECE^ 
GVRIVTX(XYLRPSRXKLQVKSYVrPETFWYRR\^EAMGl#FWAGPFV^ 
SVODKASA 

CPn.0833 UU7 1 9 39747 

IpdA-Lipoamide Dehydrogenase 

RGVLPEILITVSEWrQEFDCWXGACPSCWAAITAAOSKLRTALIEEDOAOGTCLNRC 
C t PSXALI AGANWSH X KHAE0 FC t KVDCYT I CAT AMAKRKNTWOC I RQCLECL I RSNK 
[TVLXOTCSLVSSTEVKVtCOOTTX XKANHI ILATG3EPRPPPGVPFSSRILSSTGILEL 
EVLPKKLAI IOCCVICCEFASLFHTLCVEITVIEA1XHILAVWKEVSOTVTNKFTKOGI 
R I LTKA5 ISAI EES0N0VR ITVNOOVE EFDYVLVA Z TRQ FVT AS ICLCNAGVIRDDRGVI 
PVnETHRTNSWIYAIGOITGXWLLAHVASK^ 

El AMVCL5L0EAEO0NLPAKLTXFPFKA IGXAVALCA3DCFAA IVSHE ITOOXLGAYVIC 
PI^SLtGEKrMXRNELTL^IYETVHANPTI^EVWAECALLATNHPLHFPF'KS 

•TnJIHM MI544 '142014 

•Ti'ii. Jiy|«ain-r. ifMj ninr.ftn 

k 1 1 wr^AKCTEMoRTTWKi :tr...'v:;Hnvr>.>« •PYCc-AFL^DPfVAryit 'ps/jch i-fpegask 

KFAKCrLFAV^ECWFAVU a^NPTOPniKOVl PEWTV/LO^WPLAALFU; IOLLAFAFL X L 

i >:rru:> :i.vli*i*nrayky. : r t< iAAVAYftOYRKij-L 

i.'Iii.IIKI'i 'M'ie'lH 'ML'IM* 

mmi I :M(/:aiP i.unily holw.'.WO 

UN IMVr ,KAl-\ I FROPAMUm^KHRKETWriP'ET^rr C H I HiKRArEUYWUrTLKLOD I 0 
1*1. TFA.'* !; i TO :» rLIUXTAYPAVYDAI; ;i JirUILKH'»::KVMVF:;ilFFLCn I PUJAO 

• iJ-wm .c;riitTLTi ttri.-:KEVFyowi4rr o i A^EEPTvrTNKTi-'i .k:;a l*/ rtax k fffl 

NKWIAKITl'^VSKf-SIFjU^DW^ 
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(TNVTVCAEEAXVNTTUF /KDRENHPim»U^VEYVAJrniEMrrCPKA[ALPIVA 
t PUJtfJICFICPOLLSUXYOSLCYR LR YD I RL LRDAS FSFS AY LVTPnCLCtCSLT YPNYC 
YSPTKCCiQVVCaXSPXOXiriVKSSOlEtmJfER^^ 

F£CVliFHYDVCDPS3TEIRT'7TW| rt INULf FLEXXNITLPIOa^TVtPOTJIPAFTVXN 
OAALRRLPNFFSSPPNLKDIXI CVHROSRCXCLOLKP XLVCLCESRCWLPGVFtYMDXC 
FS LI PTPLCCUTFLPRVI P P ENVPQ FLTO Y AO M ER I LFPNPQTR PPESYELVIOS IHRPW 
PASPLHLOX^XJCTNLCSVP IC I AI^LKSKHTFLFTOAOFL3LM0NL^FLX0IXST0Kr 
VrAENT\'IANnT^KLOW-APLJ^nTCT[ANPEWFFS0LKAACLPPrPONXJ^StKC 

uw -a j>*wf:.".'?o ••■ »^i:^\ll: , :'.t ,wrvr^r:v, 

•rv :*!r.viai;;^ai: ;'Tv:-*:- ; .vv • : rr: ?r:jr~r:r.yTx-K:A?-r: 

VFOElHMAWaSJOIHK I LCR ^ UA^KK JJ LTJTP T ENNLLEFKCLLD 1 1 LPKYLPSDALF 
KKXJTKRCSSEIIXElIPSOCLlXKLTIlPFILWmCKiVU»E 

KLYMATLOREXSHIOKLETPEEPATNFLH X FALLWHLKO XCDHP AVTTKXJPCCTKMYESS 

IWNAFVKIiJCESLNAGYICVWrSO 

TTOPlC0Vr^SLLAAOTINLTACNWr»m3» 

LITC3TLgERIKYL XEXXIRLLPKVX ASOOSN XLKHLNRXPLLTIL&T KDt HCT lDm S 
PVDAPVEDOTCVLPPEDS 

CPn.0836 946960 945722 

bmO-Amino Acid (Branched) Transport 

|Q«XNASKXTNCXXSLSIWICraiFWFGAGN 

VCVTUjCLVSHLITSCWJCFFrSICRiraiFrrAIILLICPTO 

LSEHKSAFIPSLPIFSAICCVLIYIFSC XLS RLrOWLCSVFFPXKLVTLLWIIRSFMIP 

THPMVOEFXPNAJtOAIO^FXEGFYnWLXJUUTFCSX^XSIJ^ 

SFOSXSIOWinSIJ^FFX^AILLOHTYLCnUAAM^ 

PraiIJ«VSVFIACLTTEXALVCIVADFLAJlWSFXia2IYASAVICTL^ 

TXSHLU.PLL0LSYPALIVIAC^XAYK2>INFRYSPV^ 

CPn.0837 947777 947145 

nth -Enodnuc lease XXX 

LTOCOFILRTt/IAXJPNFXPStXGHSSPFQLLXAILL^^ 
OSXLX&PPOaYOLXAPCCLCERXSAYXYOLSOXL^FHCEPnCNAlXTC 
ASVrLOXAYCXPTTPVmirALAORWKISEKKSPSAAEiaSL^ 
YARWPAIJiHXirtCPICSYIJgCXANSTRT 

CPn.0838 949196 947781 

thdF-Thiophene/Furan Oxidation Protein 

XSXJfXYFNSFIOJVXJOXXLSESSFNFSXFMXJOCDTXM^ 

VXAX«XFSC5VASFA5KTXKLC0VXFECrUXX3AUiX)fR5PR5FTGE 

ACSO XLTXALXALCARPALPGEFSORAFIMCXI DLVQAEAX ONL ZVASJ I OATR IAOTftrO 

GNFSKKI OEIHTLI I EA1AFLEVLAOFPE EEO POLLVPO EKJONALH XVEDF XSaTlXOO 

RlJVXTTSLILACJCPNVaCSSLLNAliCOgVRAIVTOIP^ 

ACORTTtOTXEWXEJULSAKEEAXSXLWVlOATOPtfDXJ^ 

P PgM I SL FQrjaSRJCTCBSLTOVKQa^ 

CXJOMQONLY^PPEXXALEXJUCAJUlSXGKX^iaC^^ 

CPn.0839 949230 950159 

psdD-Phosphac idyl serine Decarboxylase 

FVXV5Ria.VqXWIDHIT10OWIEPXFYncIMLFl.YNS1^^ 
GWX^01CSWRROXRPF>O4RYKXSE)O^TXWADFTSFNOFFTIUX£J(P 
TPVtXSRYLVYPN^EFOtCFXVKSXAFSLPKXXCIM ELTKLYAHGS XVFAXLAPftTYIOIFN 

GSXVQTFSPNQTYAXQDEXGFFAFGSSTVXLX^PHAXRFGND^ ITRCUOQ 
SLSRSQXZEX 

CPIX.0840 950141 951544 

CT700 hypothetical protein 
ISEJWNLXTXIHTrCIAKRIJKSQiaffilMWLVILWAI^ 

QyiREVRLSMELirFWAf JkFQCjIXPXLIC)qiSYRR0CLFgYMRXLJUO40ajrgptKT LAEAK 

KLGLRSPYFTLEIAYKAYRFOAfXECAQAF ASVPQDLFEEEDAAJCYASALVWrm HA AT 

SLIEPWISPLSH0ETTVT7t3HIYFTSIOlYKDAIDrYNRANAI>CVCPVEVT^ 

SSYAKACKLFRKIXSKFVYKEEALFNICL^EOKLCRPCICAIi 

AAltAAMDOJUTirVXAEPairEXJ^LnCSTFAKD 

FP BCLT ACKAXAWLCC^JYATIXCSEECLHYAKXAVEIJ!^^ 

AYEXOSFU5IU7rSLOEX0RJtS0XLRILIUUaJLNt}HHXVCVI^ 

CPn.0841 951719 954640 

secA'Translocase SeCA 

IKIWKLCFXJCRFFCSSOERIIJCXFOKLVDKVNIYDEIC.TPW 

ESLDSHXJ»EAYCVVKWCIUU-ACT PVEVSCYHORWDMVPYDVOX LCAXAMHXCFITDtQT 
CECKTLTAVMPL YIJ^TCKPVHLrrVNDYLAQ RDC IWJSVUtWLCLTTCVLVSCTrLLE 
KRKKXYOCOVVYCTASEFSFDYLRX3NSIATRLEEOVCRCYYFAX IDEVDSILIDEARTPL 
I ISCPCEKHNPVYFELKEKVAS LVOfX^KElXSR X AXXAARCLDSFLIA/DXLPKIXirVLfC 
ISEFCRSLWLVS KCNPLNRVURVREHPOLRAMIOKWIVYYHAEONKEESLXRUELYX X 
VTJEMNNDFELTDKCNOOWVEYAOCSTEEFVta^DMCHEYALIENDErLSPADK INXKIAlI" 
EEOTLP.KARAKC LROLLRAOLLHEROVDY XVROOOXVX XDEHTGRPOPCRAFSECLHOA I 
EAXEKVTXRXESOTLAT\mjONFFIU^YEXlJOm3TAXTESP.EFKEXY^ 
CLRXDHtlDEFYHTEREKYHAIVNEXATIHCKCNPXLVCTESVEVSEKLSRILRONRIEHT 
VLI4AXtlHAOEA£I XACACXLCAVTVATNMACRGTDXK^ON 

IDPjOLTCRCARLCOPCAAXFFLSFEDRXMRLFASPKLWLIRKFRPPECEAHSDPHFNRL 
tETAOXRVECRNYTIRXHTLEYDOVWiraROAXVAFlWD^ 

vaslvmsdxofkcwtlpnleewxtssfpiai^ixeexjulxotdsxaexxaaelxoefovr 
fdhmveglskacgeeldasa i crowrsvmvmh ideowr i hlvwdllrsevolrtvook 
dpllefkmesfllfesli rdr r it larhlfrleltvepnprvnwi ptvatsfmmivnyc 
plelty7t0sed0d 

OPn.OH42 ^SOIS '^4710 

CT702 nypMthntiwMi protein rtr.jiw-ahitt with 0841) 

KYYTPPTt3R3PW^tALm:;EPEY0C>10LLICTOn^ 

I LFf: XAfS/TLJI r^AOFL t AT.NRRKFVI frYYNDOVWnEWTPF X 

<»n.'rt4! <r*f % •••iSJ tn •»*,4*i*i4 

CT7'k* r.ypttrlicr uu\\ oror.«*n t Hr.wnhitt with 0»Hll 

tW.ttKL : ; rjWRKKVWYiJuYW rrrrOf ll/j I J'FJJLI i3Nr:ERAAL'LOKYyETt nr/VEENTOOC 

DLL V/V:El'.VFKm ClhJfX' 

••|ti.'i««44 ^^"1 '»'»».'/ M v/i| 
ypli':*'iTP'»:;«' , n , P-luih]iiiii |jf ••hi 

KNh TCT IMLK I A ILCRI tMJK::::U-»H , rur.l *\ 1 VK^/Rr.TTRIlNI.YC JRUIAFtn/PAOV 

r nrrr/um: :ed*/fokm i ynoai .TOAF^nvr.i ,i .v t u i t«r: i teecdaii i jucllltlkkp l 

[ LVAiirADf. WEELO HIETYKI * ; lUM 'rrv: n'AJ inKI 1 1 1/TUW l K LVANLPEPRF.EEEE 
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■ nxEU.^TiraiKZSEAALPr^rrrpqr.ievrrirr:. ..xrrtPESPOOApicrLKtALiGRP 
nvqc:::: t tw jllneerc tiBfrrrTmmf 01 ly=hkdroylp idtaclrjcmksvxjcic 

W t j^^nTDCA I : *RAO I CLL7 I DATQX LSSYEXH I LJLX JKRKKPH 1 1 LIHKWDLLCXVftft 
EHYTItDUlATOPYt/iOAfKLC (5A7TKRNLXX IP^A I DELKMWSNKVPTP IVWCTUSA 
UHRNHPOVIOCS RLB I YYA t' JK 1 1 1 P LQfLLf If IAKSU.TKHYEYYLIOmJCSSPNl.YCX 
PFDLEPKEKPKPHN 

! i. ■ .yr>.| , . 

»?r:v. , : j ;• :•, KLwr.r.- :• rvr-; : K.wr.": rf*:j*r«v WA/r.*". R-.TwnJC." 

R PLED 10 lATNASPTIVST tPPOVI J I G7AFCX IWKOOOUJWATniSDGEY103GRHP 

dr i iFsawmAuworrwoOTOPPtDicvronra^ 

LRIIWIAIRPSSSLOPTLOPTTERAX IKCAPALVKSVSPDtl^njCXKXJCROPYCALSLL 

UOXVLIFXFPlIJWPYSLLmXEFAPJCFTIPTHTPEIL^ 

LRISNKEUaiESWYEALPHFONOSCNRVFWAHFL^ 

ISRVQELESRLEOFILRIICTSSPVVSAPDLIAJQCXSPGRL^^ 

EXZLLLL0OQSPWK 

CPn_0846 959383 9S8U2 

clpX-CLP Proteaae ATP*«« 

RD0tOTICm,T:CSPCXRSE3O2VEKLXAGPSWCWC:rc 
TPSOPSOIJVLTPKEIKXHIIH^VIGOERAKJCTIAVAVYIOT 

VLLLG PTCSC K7L I AKTLAX ILDVPFT I ADATTLTEAGTr^CEIJVtN IVLFIiQAADYITV A 
RAERGXIYXDEX DWCRTTANVSlTRIA^CECVOOALIJCIVlCTTANVTPKCCRKHPtCE 
YIKWraiLFIVOC»FVNLgXIIAKRLOmiGF*ro^ 

APGMX P EFVC RFNCI VNCED-SLDEL VA I LTEPTNA IVKOYMELFA EENVKLVFXXEAL Y 

AIAWCAXOAKTGARAIiTCLENLIJtDW 

EAXA 

CPn_0847 960019 959387 

clpP-CLP Protease Subunit 

KLPDEETOKTLVPYVVErm^IOUWDrYSRlJUaJR XVM ICOEITEPLANTVXAOLLPLM 

HALPttSlMfXKOPSaSXXCTSAOXOLOAACXLTUCICKXJ^XU 
PFH»E£MSYCUDKVVTSAlCETNXX7rSST 

CPn_0848 961556 960177 

tig/murl -Trigger Factor-pept idyl -prolyl itoaerase 

VQASSPAFPFICSNiaCGCLVPIlSLSNKjrSVO^^ 

KZIOCEXTLPCFWQlAPDIWASRYPTMmXXX£^ 

SNSITQ roLOECAXVEPSYWPAISDLPWXanSLWEEAASEISX^ 

ATKTPraPSOECTFISISLHVSKSNOINASSAAIFTOC^^ 

TCHRWErmPEIOSPLRCDTtTrrVMAVIEVSIPEIDDPUWC^AE^^ 

QLEKQAKDKQI^ICRrSEAIJULAHLVDFEI^SLLSPISLITBFTrT 1 fflR LI OYCSDEE 

LEICJUCSELXKEAEZDAT1CALKXXFLTHKI rSDEKLTISREELOYKKWCSRERPC300P PIC 

O X SNX7TWELVMSARBRLTYSXAIEHVLRXAEI.LASTPSA 

CPn.0849 961752 965285 

aotl/snf-SWP/SNF family helicaae 
AWXIHSYSK3EW^FRKIJUlDPSANIL0CXaCiaJTW 
CLYIinYlCnEVB RSESX mmStKJCSYWYPC^rVALLn^ 
ETOKXNEEVKXtZJCZTPVAAATXEEEiUCDREKOKSIL 

EKXiSA^JWXJ^VSVKEXTrPAPAI^PXEF^LVlJttJCIlSXPPYX SNXRTPLESVLYQEPXV 
. .„_____„ — - jRYVRYIWfrTianCLXJCSAYWPPAl^ 



DDgV0PEt7TMX£S»miHXFVYKRPSPQIK^ 
LPVFOEYAEIANVHLLNSPVIXPYVDEVIOUC^^ 

LQY0DVRAT IS DBS XIJWtL ySER WLEEVPSCFI YDERDGAPRVKSEKKI VEnfTTTX P 
SAJCICRPiaJ?KACTOSItGTWCKWS^ 

KCPLWSLTCISLDOFEALPVNFSMSERU EIOKOIRGEI EFDFOOVP<»IOATLRSY0TE 
CVHWLERLRKMKLNGX LADOHSLGKTLQAX X AVTQSlOJaCSGCSLIVCPTSLVYNMWE 
FRKFNPEraTXVIDC7WS0RRXQLTAIJU>R^ 

HH I KNRTTRNAKSVXX IQSDHRLILTGTP I ENSIJ£EX>&LFX)FIJlP(3XSSYORFVGKY X 
RTCNYMGNKADNMVAIJaCXVS PFILRJWEDVLKDLPPVS EI LYHCHLTESQKELYOSYA 
ASMCOELSRLVlCOEOTERIHIHVIAaTRL^ 

SSLVDSCHKTVVPSOYTKMLCI IKKDLESRC I PFVYIXC^TKKRLDLVNOPNEOPSLLVF 

L2SUCAOrrcU^W3AI7IVIHYDMWWNPA\^2^TDRV^ 

KX LTXONRXKSLVWC/INSDDEVVSKLTWEEVLEU^ I 

CPn_0850 965254 966390 

mreB-Rod Shape Protein-Sugar Kinase 

LGKJCYWNCCRYDFNS PKRWLTKUO^SNRLYNRAJLCRFDKVrNrP^OAre IDLCTAfn*LV 

YVRGRSXVLSEPSWAVDACTTHAVLAVCHJCAJC^^ 

GMLKALIKRVTPSRSVFRPRILXAVPSGrTWEXIUV^ 

CVDLPVHEPAASHI XOIOCC7TTEIAX ISU»IVESRSLRIACOEPX)ECIINYMRRTYNLM 
ICPRTAE£XKXTXC3AYPU3ZX}EXJ^EV^RX)0VACIPXTKRXN5VEXRECLAEPXCX}XX 
BCVRLTLEXCPPELS ADLVERGMVLAOOCAL X KCLDKALSKNTCLSV IT APH PLLAVCLC 

TCKALEHLDOFKKRKCNLV 

CPn_0851 966378 968195 

pckA-Phospnoeno I pyruvate Carboxy kinase 
REFCIVMVWSTNIICHECLICSWIIJWAICLTTPKDIRLCDC^ 

NP EPHPNCFLVR SSADOv ARVE0FTFXCTSTEAEAGPT^MlX)PQXMRKElJ{OlJ?R£XMQ 

nRTLYIVPFO«3PLOSPFniVCVELTIHPYWC»IICr^ 

:^WPWPCEAWSVfPCWICSMRXVHFOOOSSVMSF*5SCYOt^ 

KJOWLAEHKLI ICITNPECWCICYFSASPPSACaariXV^PKl4H^IECICOOIAWI 
R K ntDGPLYAVflPEYCFPOVAPCTSERTNPf IALATCPSNS I FTNVALTAH9VVMECLTE 
^PEPLTDWLf^PWPaWPAAHPtORFTAPLJWSW IFCCRRS 
IT I PLVYFAfcAffiHr VT10AO4S.TTTTAA IVHCLJKLPHDPFAJfLPFCCYNMAYYFOMWL 
; JKAOIRSIJtLPK IFttVWFRKNNQnEPtWPTTCENLP'/LEWI FQRTDCLED I AERTPICY 
UtHOKFIILMf Jl /n^U.nVOELFWDAOJWLAEVEN I^EYUC I prVWPCO ITDELLfl t K 

- iTi.'iH'^ »m074 iViKU 

T'MI \ift**tU*tt ic.il protoin 

l KU* ( ( I jYYYI , I NTVTLy Pr!Y tNFTPNVTTALfiOCK It/T:IA I D ^JALFFOELOOKAOC 
1 4<l lAWIlAMvU lAt^LKPACWrj ( iJYLPTEEi»i!R f ( 5A« ? ( I DRTM PTFTO DEVKA X LQ 
NIWRRX 1 1 VD :U)KVFR:n*LttVTPrEr; lOPSNPE^A 1 ILWY ITLUWLKPKFAACST 
••riWVDYHALYAr.il JPFVKElEAUCAAOAPVr^KVUA^K tMTfflflJKJVLCYPVTDYLN 
VVIAI/I CAM rTAAOEVCVYUCNFY.'; » LKU I U lf> VT^f ^;ATHYPAOAEYNARIlM.»V IQS L 



LNLscaiYROLToaiLPvr:*. cti xacirsfom mct : : a^ktllj 'ttnrlttll- 

rYTTOCCATrrUKSYCT J TPAXCy/ 1 OA INOEKSYWOAR ANTFSVTr^X r.TPQFA TNICS 

mrtciajyowtrmirFir rv t^ trf RYPY^^Tt^yrctigxa^sa c^acL : 
sgwsto x AT T WQ»m PSLPcrpgrweM«cpym cpt q o Nv 

IASMXC>fnfSinUUUCYUiXaJICEITrroSADIYYSL5 

KOEXTRAMADrTROOtlKAAl OXKLVEX KADAELSK3C I RXL\rTLTNPKSCSXLIWL 
SCtijCPLSCLTLKAVNDPflATYEAFTAE Z FTEPFKNWKRCLATTX3fVt0CCt>C ITPOC 
0OCXJXAKESSOCDr7rP?F7COLALCLES3A»CXEWTLV3AAL^^ 

LT712 nypocnecica: protein 
NXJOfreiEJOlHSLPLTAVAPVPEISWPSVATTVDYVW 

LCKAPLTSMKOCFINTCTELAI IQA3LAX3QSSRESRICKEEKIPHOKLCKAAPOAATATSC 
VQPTADPVAIXHPLOSAFAYVLLDXY IPAOEEALYALCRELW^A<WSPLLWI1tS 
rNSAPIWW£SYXSOTSCTANFAYCYEKXLSRYWIEVS0CRLOIAS 
SVKANVSLTDAOXJCQXEri XASYTICSLWIKTC^TOVKrKLASITTVPCnj^ 
OCOLSX IALQNDOCVL VDGJCVD XTTA VNI^GCLLN?TTTVLTUVt>JYGQLAOTOOSJCJXX 
UCAlCQ96L\^ASLXLLICa(YTTVXSSrW 

CPn.0854 972849 971806 

anpfl-Outer W— Pra ne PrateiA B 
CPPCIMSICMJQajUATLSPSMPPCXVSSPAWALCAat 

CNSYDLPAALAQSmmf YUA VTSESAHITNVPVITSV rTSCTCTTTT ITSTTOIVPPO 
LMMSSXSSSCVPATXALOmPAAIPLliJXAJTAKVOCLKCrYYlU^ 
ESEVT«XIEWSDYCIVWCLSU»CVU«XWSFVC^ 
XYPXMTt3CarLSY]CX3riSA5XSXSTYXm7YVL^ 

FKiRKrwn3RvwcrcTTCCis»<rnfsvra 

CPA.0855 974001 972994 

gpdA-Clycerol-3-P Dehydrogenase 
CLMGQKXCYLaCXlCPCXJVSIXKNXCYIWAWSIu^^ 

LSFTnMJCKAIKNAFMrVTECVTSAGI RWAEOIJtOITDLSVPFVTrSXOI EOVTCLLtSE 
XKXJTVLSDSVTPYLSYLSSPS X AKEVLNSS PCSWVS AYBSOTUOO I HIJkriLPTPRVYP 
WroXJSAAlflCALJOWXAIACCIAISIaST^ 
CLACLCUXLCVXCPSISSaaAPCKLLAOCLTP 
ILHP i TT Ul tKVLYENLJttJCIglAIJXORIOTgPL 

CPtL.0856 975410 973995 

AgX-1 Kosolog-UDP-Olucote PvTophoiphorylata 

C^MWA^TWTESVYSPSAIOfVNSLAOiaJCAIW 

VDIDPPlUCOCMLLSSPTMLJtDPHPmPASSCaPEWWiACTT^^^ 

QSlu^KCXSPHBLFPVSPXXXICPIJOLVAIXVIUU^^ 

ESMXWPKLOPtCVDPFCQPlWLLTLSCDLPLmiXTr^^ 

WO«CIEMVSVXPIDNPLALITI*EXXCPHAM^^ 



DUJIOLPWCVTClOn^PmTPllJUDPYY^ 

CPn.0857 975808 975392 

CT716 hypochetieal protein 
LLX4JtOnxrAJtCXSWADIlLSSLSLXLJCVKZHK^^ 
XADLKXJOTDiFISiraXJWyDXIXEYLXTL^^ 



CPnJ)858 977115 975757 

flll-PlagelluBi-spaoific ATP Synthase 

RNSET1WRjrrRPSTPCPl»MHXJ«EXX^ 

CEIOUSSTKDPNLLAEVIGntNHTTIiKSLSPIJiWAL^ 

RW*TCWIIJK1CE1XJ?ICTWIKPLLSLPW 

(rtTSEPGSGJCSSLLSAIALGSKSTINVIALIGERGREVR^ 

AHBTAPTICVIACJWAKriAEYPIlEOtSHEVLFIMIJSLSRW lAALQEVALARCETLSAltTYA 
ASVPKHVSErTtllAOMNDKBSXTALYAILYYPKHPDirrOYLIC^^ 
SPPXOXLSSLSRSAOALALPHMYAAA£IUJ(SLXJOrmEALDX IKLCAYTPOQOZSX0KAV 
lCXJ^XIUVLA0PLSSYCYUKrLJC0LIJUJU3S 

CPn_0859 977597 977055 

CT71B hypothetical protein 

N^VTTPCVPGSLSOSHLPNPKi3PWDTEPTSLPISPN0KAS0ELHSLVHLP 
EVEXTWLKPDLLELALLICEKPLYKKLIWPOELALLLSTALOW^^ 
PEXXJCTLTWXSTHELPKX KHAEPPPDTSCRRSCFK I ETPNC X LRQEXSEZLDKLLSVLT 
A 

CPn„0860 ?^S639 977608 

eiiF-F lege liar M-Ring protein 
RTLVTTONIAJUCLTALGIirLGCLLIOTWSCAIL 

LTCI<WPICLIESLT/KKECLD<DLTSFHPIASAICVAIALCTEDI^/HSPLW^ILTLXXEE 
SLTPSLLFS XTOYIXSSLPGLKREKI SL50NLCMLYX PKXTVNSLf IHTLEHYLQKIPP 
KEMFALAYHAItAEKPTWLTLNEIIYXAHLTKEESEKIVAMTKHYLYOMYIK^ 
PrARLQNKKSFPAXVL XC5MX LVXSLM IVALAS FYLARH AY ERVSPEPRK XKRCXNXSKL 
LEIIOICESPEKIALIL^'I^PKKAEAIXr^PEDLXMOVLKYKL 

CPn«0861 «»-9752 V78925 

nifu-Nieu-ralac»i notein 

ASYPfTWKPLKTLPLEI^IFWSSLSAr/MKKFUTPHCACTrSEEIUEA^ 
RUOJCVTFYWLVDKKNCV I LDAXPOYPCK PYLX PLAEAVCNLVCCKSYSEAYWTLCOI 
OKSLRVHAMOPALPEDS I SVCH FVt OALOT AVEQCLE I P LCDCSLP WN3 PMNLDFC3AN 
!L S 52!^V™. E0KLVALRAT ZA£ * I^PY I AMCOCEVT/ESLENF IVT IAYSOCSQCP 
SSLC5TLMS XCQLLRAY IYPELC7VKVCE3SLNLSHP 

CPn.0B62 -»jM'i24 jT)7?.2 

ytho flits- r#fl.ir«>| p:or«iin 

*-iPOT £ FRTTOCirrx: I :>tEKt^r»RKAPP I FWU0OVAI M'3ERVKE:^AUU:0JreLPPC 
™ I^JH^^ 1 ^ >;:,,rrRP,W,FMr/V!l IV1AALVENUWV4»HHI XLPAH 

ERVFSTWFPP»ITSA:.l *T;: AVAAH0TMRER t :!AU>f.FTPirT:WI i'KKL lQVHWL?::t 
LCKSAUIF.TLTEB.'JKOI^-.KtAIIAKJU^ t KIII.TTLUSSS 



idpm* f<hoephmMyi'i<t.ii ,. 
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OIMALLlUi»MOOSVWMtlOILr3CWVOrPLflOOC .FSACRAIONLPIDCIFT3TL.VR 
•ILKTALLAKTrWHSKK IPY tVHEDPKAKEMSR I Y5ACC0BM4 1 PLY523ALNERKYC1X0 
.^KWCtOTAEOF-7EERVWJt1U^OT^ 
JUHGNSUtfUKDLttL^EEVUa^PTO^ 

CPn_0**4 9U658 982374 

yjbC -predicted pceudoundine synthase 
YrM»IVTlrVRri«FLA3ArcVA3PRtfre^ 

vi v m 7??\vwr.*t":% :• ;v:. •. \-Krm • nrLV : : ■: ••\!iLPVPvrr.n^u*.»'LT:: ;:.: lvt:: 
: • :rp w 1 i 1 ii*. : : Wrrr.sir.r'M v. mrpi .• »" -ms-TTF re* :nrjw -r - v : * p-ttvk 

IWSEOMWEIRIJACW^FPXLEUCRXRICB^^ 

CPn.0865 9B2412 992942 

CT865 hypothetical protein 

SPHSYVTYVlACSin^ISLGAYCOLYYSVKSVLrSWYXXTVYAl^^ 

Ef^SOttXPFlSOCDKLSl^FLIMSYEIIPTrR^^ 

AXLCIOfl^lWLFDFEIMYEEAVtXYlJCI^ 

CPn_0866 9B3494 982916 

birA-Biocin Synthetase 

NKKNaYYEIEEIPSTWTHAltfYWI^^ 

FCFFrrDLHICVSRIJ7U^7ri*WALCra^ 

ffiU^A/LGIGUQfrTTCQAiJt^^ 

PDSUTXSmCNI 

CPn.0867 9B3405 984667 

rodA-Rod Shape Protein 

CIRIPOHHIGFCHCVROOlFPYFVXMirHXLEroU/^ 

LTLKU-SWVISSMDPTAMLVTSS1CCLLTNKS OtSUWFALWWPFICAYTUYHLFKRW 
AWLYFFMICAIAflSLFFWSVONvWOOT^ 

DI TSXTTAf LACLWALPFTLI IJCEPDLGTALVLCPVTLTI FY15IWHS LLVKFCTWAT 
rGIIG5U,IFSGIVSHQXVT3>YAIJCVIKEY0YER^^ 

AAGXTVYXJMHVXINXSIWCGIAPITCOTLILISYOW^ 



CPn.0868 986733 984670 

zntA/cadA-Ketal Transport P-cype ATPate 

NrRNCLCVRDLKHTREYYL I XNEI X XTW YVPSKLTFTSPSAEVWrPrESCMSHITSPL 

UXQNRKL8HNU>LKSAYLSIinYLXAXX5FMIJiAJ^^ 

NrCOXVAOTDILKrSAAFGSIFIGCAIZGJtfliVLFAIS 

APTTOWLVU2X31LQJCVAX NKI EVWILRIXSCEWPLDCCXUCSSS INLKHLTGEXVP 
KSOfPCSrVPAGAHNHBGSFDLRVLRTOSDSTIAHI XNLVX0AONSKPRL0QRLDXYSSV 
YALSirAXACGIALLVPLFTSI PI^PQSATrPAIja^AAS I<ALIIAIPlA YI^AIKA 
CAKHWLLKGGVIIiJRLVSXNSVVN^ 
SSSHPIAZAXVSYtMBO!CVSSWAXmYI.TV^ 
l^IEOKIYQAKQHOEICSLAYMBWFAXTifPRDIPRPOJUCE^ 
HKVSAAITAEXtCISEVrFX&TPEiaCLAXXP^LAroRQXMf^^ 
MSEAfiSATAISAADXVXXKOSLSSUVX XQWU«JTKICVVSONLAIJaAX ILLVSWPASLC 
XXPU^VXLHBCXSTVIVSI^ALRLLXS 

CPn.0869 987479 986658 

CT728 hypothetical protein 

ESHRFPPPKTSlMrSXCRJHOXIJUCXMrQDPHDHnCSRTPDHXn^^ 
TFlCBFFYHLAWAl^rOTIFFIRTLFFLIPTNRA 

WAYMELSHWMIiEroiEIEDJrEOCKin^ILJTJJOGrjQPt^EKVrYVCSDST^JlOT 
MIREELYIRJODIJ'HFLIQGCSRIL«Zja^^ 
KAXILKNOXXSS24VWVLCXFXTSASI ISSLMCLL 

CPh.0870 988881 987448 

sorS-Seryl cRNA Synthetase -2 

TTrHPTOrCGAVIWrePISIARRIIOtSCCSTOSrYSHPCTUIJ^^ 

TPEECETRXJWXDPKISLEI-Vl£LI»CEVROUC^ 

NLIOEVETLAADUUEOHLXXHMAOUfEIXS 

PX FSFPPKHHLELNOELDI LDFOAAAKTTCSCWPAY WJRCVUJMAliTYKLOKOAAHGF 
OLWI-P PLLVKXEI LFGSGQ I PKrDCOYYKVCDCBOYLYL I PT AEVVLNCFRSO D I LTDCE 
U^YYAACTFCFIWEAGAAGAOERm^ 
LTEIJaJ>YRt£l4£TeiHSFTASmDA£VWL^ 
QSQCXWFWmJOSaiATPRiXVAXUNWAa 

CPn_0871 988766 989899 

nbD-Riboi lavin Deaainese 

EYMEDFSEQQrJFMRRAIEICEKGRITAPPNPW\A2^^ 

LAIONASMPXSCSDVnrVSLEPCSHPCSCPPCANLU KKXVSRVFVALVDPOPXVAOOCX A 
MUIQAGIOWVGXCESEAOASLOPYLYORTHN^ 

PEARHOWBKLRAESOAI L^RTVLSOJPWLTAROPO^YPKOPLRVVLDSRCSVPPTS 
KVFOJrTSPTL\VrTERCPENYXllVLKLr^ 
VEGCTTUiTSLLKOtFVNSLVLYSCPMXL^ 
3LKWWE ISPOVFEPIRN 

CPn.0872' 989903 991216 

rtbMribB-GTP Cyclohydratase k DHBP Syntheee 

KEP.IFRVACLASESWAJlESMIETREir\^SANFVSLD*AIEDUW3KrVTVVOEASREDE 
CDLIIAGEKITVEKHTFLLQKTTCWCAAl^ERI^ 

AAH'T/TTGVS AADRTKVV0LLADPK3KPEDFI SPGH FFPLASS PGCVLKRAGHTESTVDL 
MELACLQ PCCVLAELVNECYSWRLPO ILEFARKHN I AV I PVTS 1 1 AHRKLSDRLVSKtS 
SAPLPTIYCOFT JHWESLLIXaf0Hl^VKOn/ACK3Jf/LVRWSEC\TC0IUSSKRC(C 
CE0LS3AMSYIAEKCTCVLVYLRO0BCRC tCLCHKVPAYALODNGYCTVDANLAMCFPVD 
5P EYC 1 CAQ 1 LVOLKLTT I KL ITHNPQXY F*Z L0CFCL3 I TER VP LPVR I SEDNEQYLRTK 
•>Efif3HWU3LPCOmRVQ 

TPfi JJH7 i <ie | } mm IVJ4 

r »Mv»ftihiryl lun-cine :]ynth.in<i 

I XUNrt V rfWFPXYHKTlJCaiLSAKNLR I A I VCSCFt F^AMADALV:* TTQETFLK FOCliE 
1/ SLKT I RVt» IAFEI fVT T KKU^'SKRKFOAl VACOVL I WTUIYNOrVNOVAAiSltlAUi 
LEKCU* tTI-: IVAARZAE I AWtfRO: [ KCAH L/ IVSGMTTA I EMATLFTC I 

>:iti.iih74 -miM 1*1174*1 

•T7'j'l hyp* In.* t»Ml pi or #• in 

I.I r.Ult* I LTKOMI1RKA: WLK ILK tKVf,VTPLALLfr-/.7c: I'lYAI'IXM'JU/fKJOTKVK 
I ''SRVW f WW^yvrKl^LTEax^inXTnTr l»W/;:EKLPNKKVrALD I A I r:*m iiil 



RMVocJAXA'/rrntpY rrc^r: rojrf AJCA^fVLaTrr £,-x.- . >\ R^ri^r^ ^ pR R>. 

LRSHRPILNy CTn p yTTVt^SETTVISGClJVFAEJIA^-R IHUCA\^tK1 



CPH.0R7S 



R WRJISRRLFAP>^T - , K LTUjVv'ANFKTY AEK I J UiCCRCLJ KV7 aiAAiM J 1 JLAL» 

OCUKtULYRXR^PULXEAXJ^PALXiaor^ 

CVISEIDXAAFACTU;U3SCTVASIVOCERWPELVDXVrr 

CPIU0876 994123 995517 

daoA-O-AUnine/Clycine Permaee 

DSSSKA^SSYEAVACIlJkQmrTCNIACHAVAiJkCCSPCAL 

SYLSSirmptiafxcsrxocPXACLArCHMacx 

LCAECTPCIUXVCXLLALWXPVXJIOCaraXLJVSA^ 

XLPAiirxxcssAJCiKAajscxamrrutjvxsTCxrauNmAT 

FVVDCLVTLVPPVIVI*fVVCSXTi&VLXVSCAYS5 

AHXLTTm-rLTWFACAlXStjOYMIPaUUSNU^ 

PSOINatJCXAtlAUJCXVUTNROVAUJ^^ 

CPIV.0I77 995521 995982 

ytJcL easily 

PJUlXtOU^PA/AYGAPXPIOCrrCOGACISPPLTrVTTVP^ 
OOXJ#IH«VYNl^TTITNLAE^XFAVOCWSCKPV^ 
VUPETENVTRDQLYEAKETH 1 1 E0 AZLHCTYtXS 

CPO_0878 996660 995992 

SET Dose in protein 
QCXSTVTTCPCSSXKXSUZNOUIUSSQm^ 
KXSVOUU.XSPIA»4U2IClJaeOOLX£PPAPPVS^ 
TC XWROAXWDKNDYCFIlYPmJTXJnrT^ 

ECLFWiiRTVAP TtmucrwGMmnnuzEFiniz 

CPn.0879 997463 996645 

yycJ-«etal dependent hydrolase 

YRII«WSWCfTPtASGSXC2HSAYXOTS^ 

XFVTHEHSPHISCIKSrVICAYyrPIVOgJTARAirHIXK 

VQTfWPHDAVDPVArXFWREI2*lCFCTC^^ 

OSORPOVYXXRVLSXLCiaSNOIX^LUKXXT^r^^ 

SIASITSXAPEXAi^ITSPXYTSRLIVACPR 

CPh.0880 . 999B64 997444 

• CtiK-Cell Oiviiion Protein PteK 
PHXRJDUaCMPPXPTXPUUJU^YWACFSGULW 

FLLYTFGAAAFFI PLYTUAJ FLYTTUffPRPLFTTKAAAJlJ LPFCSAIIiSMLfPWTL 
PAIiDTRIJTCrilOOTPVSYVOGIPFTLrm^ 

SOPSPRRVSETI X WSXSPr^mPCSXXESFFLTPMPCKRrLTIVVIVOa^^ 

TXAX^STPTVVe^S]SaCE3UUUJ>XLK5» 

LXLWTtTSFCIOAXJUWXCSGPTlJUkFmPHaWC^ 

APIPOCAAVCX EXPTPFPOAVNFRXUIXDYO1CTNRXL0X PTI TfiWIf ANftlWLMAflCATMP 

HLXXABTTOSCKSVCXKrXVMSMXMTTLPSnKLVXXOPtOn^ 

SREVYIULVWLVKPffiSRYEILRYliOl^XOAFTiSRTR 

T pfT CTf f T eeertnT*^**PTTPf-lrtll*ieA*JOTHLTLA*rflePgBP/T*ffBI-T*CAlfT >> PeeteffK 

VSNKVNSOX X IDEPGAXMJtSNCWLVIAPSVraiRA 
TQWIPSFHAFODSOSDKSCI3Cr»l^AOAm.ILOTa4AOTFW 
0LEEARI ICPSBGAXPROILI0NPLEC 

CPrv.0881 1005646 1006209 

No robust homolog present in Oeneb*nk/EMBL as ot 11/7/98 
WOCFAVHMPVPIDNSSRJ^EVTESLEDL^HAEESPTHOSAESSSLOLSLASSAISIRV 
EOLSSLVU3KENSDFSSLRDVPI FS AI YESSTHTPVPTPLVCVCYI NCSOSOYYDTORXS 
U<LSQIX^ RRVEVVYNtXWFMEAS LXWLCPRRPRRD PS P X S LALLEUrfEAFFLCH PPCS 
TFNPIFFW 

CPn.0882 1006169 1007404 

No robust homo log present in Cenebank/EMBL as of 11/7/98 

NTPOVAIXI0YFFCNC^nfVREAIJU.TPHA0NIVLVCICPSLYP» 

RFI)ORCFVNSCVETLPYSSCSFGXFWXSFTDPTTNFAXVfnTMRTAC 

TSLIEMROUEOCCANKrOSUOEESt^IVGHTVCCVSmVTSSm 

TLACAEENPTFPNSTXDSU^I4^VRXSMVSXFWZFpI\n7TTY^ 

NGXCSTFUtLTNPRSRRDPJffiNUUMVLCYU 

LYAMVTUX>m;AI0OAWYCFPSVRDAFYRYCI^P^LTO 

HLEDOOWASILNL^FXItrFCPVCUnTPTCLEISPSCRWDA^ 

CPTU0883 .1008904 1007573 

dtipP/nqrc -Phenol hydro la ie/NAOH ubiquinone oxidoretluctase 
LYELFIKSCrFrvmWI^LYFICIASLrFCAICVIU^CVILLCRKlJ'lKVHPCK^ 
NEELTrPVESCOTUVSLUSSC IPX PSPCOCKATCKOCKVRVVKfiAOEPLETORSTFSKR 
0LEET*fP4^CCCKnA>IWSLEIEERYUIA5SWECTVISN^ 
FKP0CYMrTVP^KT7C:?O«<7IKAPC^YS0WEMFH^ 

5YPAELPTIKFN IR lATPPFINCKPNCEIPMTVCaSYVFnLKPCOK ITVSCPYCESFIOtO 

OOTPLrFLlCCA^W^FnRLZHILDLLUfFHSKREIOLWYr^RSLKW 

NnrYHL'^ETLrEOrAAJ^^DOPTrniFLFRViFNUX;^ 

::^lLKLLCT\\n,TR:*::i itj»Ff;.': 

CP* u«HM4 I iWt II. H 1 OO'tiMr, 

ITT74I hyf«triu«r puir.oiri 

i xxnum rvnvFi>xt.s:i.»xFAaEF^y::Kfrn--/yi*A\mi aia n,h>-vp , n.wn'Eu»CRR 
KAMKKRKNnLAKvXKvrAK : t urivTM/i praiTV i lii t kj jr.'jvsnx* „\ i ::b i ijrrwrNxr: 

Or'Tl_OHH r t ll>tih,Ml I'Ml'Mf'l 

y«|i;A ' f KNA Mifliyii I .ui;i|.. M :i. 

A:;ii.Tit7r9mrnin.v^&u^:H^ur/:nc:i^ 

.-A & mnm^FFCTYK-KKc-j t w i : :: rrr iw. l r vm :LLUiFvn«KtJ(i.Ti«Wi*eji(« 
pki jiaykpi'KNk* j- :i ; rn.Tvim :: :i \m iphv 1 t/PT y m-RYHVNivu *l UIMKKI I j . 
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tit AT* I YVttrVAAAC I S TTYtTKLLNTlA PZ I .SnCNSASFGLRPRGFFOPOXTO 
AAKI terAyETINPBCCCTLlSL'/CCACTICIMLCPYVKWtCVCI I PDAVASA0ENI KA 
tWXEDLVrrrtXnAKATCKRNEMCKAPOVr r tDP PRCO«OSXVUCYILRIC3PKIVYISC 
NPCTOTCOCAOLlSOCYIItlUWOPtDOrPYSTHLOiltLLCRErOP 

CPn_0896 1011299 1010909 

nccA-KistonwLiko Dev^lopraental Protein 
RTLPHALrOTAirimKOJJ^ITHOLAICAeKtWKAAAORVm 

\\..rW. :> AJ Ai"."T-.\KKK.M ,,F " ' 'WwVT r.V.VA.."*" A. TfTAASKWIT.* 

CPn.0887 1011692 1014157 

CHLTR possible phoapnsprotem 

KXKLYHPTI^PLIRLSLIFALSLTLISGNFPQQXSFGHCCAr^^ 
OPIIJrVUWITLTARDMCTWVW OKI I ALRLPTOMUQ) 

U3ILWHM^P£3OAPUU^n^tX^rTICCKKSL0DHIira.YTV^ 

rayraAXEXJucELvjuuxcsespKPEXvoxscr^ 

TPYCLSEIArrEAffflALVLRIAKWVSRTNBVDSVUJHALOtt 

HGAYLESTLLYY AYFS LLELYHQTO&FASLXIUXEICGBAVFVPEHPYTPEYUFFLGAYFY 
AKGXYESAEKYFLQ X X DPAVKl^TTARAYEYli^IAYVQNHYEXAEEYFUlAYXSfcGRE 
ESGICLFI>YAVOKKXTACEI^YHPKFSFTYRiai^ 
AVI^mSRCXYIMXJCYIUlVTYTHPXXMYWRt^^ 

AFWSAL05SA5VPRSL XESStJVDmrTXf^CALYTKKPDAIAHLPOAFSEECNSNOTA 
LR LVWn.VRPKCAPNKAKYWDHLVUlPKCDSLYFFGYDLQr^ 
PKSSU^VYYLOCYSESSALRIO/eWTVXA^ 

mSIXWFSOAVHILEEVKn*WASHPXUfFI^ * 
AHL5NKLX-£HVEKNUSPRSYRDYYCESL0ini>Cl^RflCV 

CPn.088B 1015441 1014119 

henG-protoporphynnogen Oxidasa 

AERRFCVKRAI X IGAGISGLAAGVMLHKKFPQAEILVLDKEAYAGGF VK I ESPUJ &f l) L 
CPWFLTRCOGEYTLICUKE&SLONSLXPSDIUUU^^ 
SLX103FRAPCTODSSVODFIJCIWSSC*OTmLDPLITO 
REASSCglJL RSYLJO^ PKKSTO 

FStJ>RCYGMlJADII.PLLGrvWNSOIFP^ 
YUflNQlOWAI^SSCIOa^HAVCFLEM 
AYHAICOLHTEETLAOPOSSL 

CPn.0889 1016841 1015462 

hemN-Copre p o r pft yr inogen III Qxidaaa 

FDGFWNFKFUXXJfOPAPRYTSYTCALEWEPSt^ 

CQSHCLTOCSVVLiraREDIVrAYINrLIQEJO^^ 

EXJTLXJWIHKLFttSKAEEIAIEVIiPRSLIUJISIEJU^ 

QCAVUU^HZCSWYCKncnJVrQSINXOLIYGUKOTXESTSiaXOOX 

LFSTASVPWXKPHQKAHXASafPSNEEXrATrSOSIWU 

LAFP«TLXRyrQCYSLPPFFnr rr.r/yrsTCrrrorYijQKAXTiJEfflNTVLRgrrATy 
KSiaLTQaDRXfucMAi»aiCTrrxHKEeTFNXjTnrcrtr^ 

SPGSUCVTPL^FVRVrATAFDHYFUOVSKKICrSAS I 

CPn.0890 1017829 1016819 

heoE- Uroporphyrinogen Decarboxylase 

STLHWDSMSAFFDLLKSDTASHPP IVIJJIQVGRYMPPYOEIJCCSOSLKTTFWnTAIVT 

ATLLCPSLLHVDAAI LFADILSmcrAVTYPFAPCPRIOFS PEP PF T F TS DP OTIFSYL 

L0AIRTLX08£PVPLXWAASPrrTACYXIIK»Sro^ 

ECTATriJCrOKnAGAAAVOLFESSSUU.PSALFTRYTO 

CFEHNTYTWATQADTUiPDYHVDUWIOKNLMI^ 

VPUWYPNFIFNSCHGILPrrPLENVQLWSYVORQL 

CPn.089! 1021079 1017819 

m t d -Transcript ion -Repair Coupling 

NFMAMDFHPVNLDFSI SKEFKEETL PLLLZN I HPGATAFLAAXXFHBCRASV IM ITT PAR 

LDXU«lLRTFtJ3QAPVEFPSSEXZ3UPKlAmXDAVCm 

AUaCTBSWTSqQHLDIANCTVLDPeATTELCia 

FPI^SPEPFPJEFVCEXIISIRSYWPSDQL^TaCVSKISISPAYT EEASa ^SKSUlTy 

FSTPPLYLFCNLEI LEDDFAD I SGTLSS LPDRFFS ICTLYDR I STSNOVYFSETPFPNVX 

NIJCi^VIIEAFHWmE^ETOAI?ILYPEOIIOt©EWPLI^WHLOEYMPPMCKPUaA 

lYffTKTKSIJtEAJlWjAETVAJlCDVtlYEKTGNLTSSFALVNEAF 

QKORTMFSVTTEEVFVPIPGETVWtHNGlGKFLGIOWPNHLN^ 

YVPSNQAYL ISRY WTSDK AADUffllllSSKWK RSRDLTEXSL IVYASKLLQLEAORSTTP 

AFVYPPHGE5VI KFAETFPYEETPDQUCT XDQ I YNDMHS PKLKDRL IOC OAGFGKTEV IM 

RAAVI»VCDC3«WXVMVmiL\TOHYETFKE»W:U>I 

VASCOIOI I ICTHKLINKSLEFKKPCLLI IDEECRFCVKVKDNLKERYPMIOCLTVSATP 
tPRTtJOiSWGAIWLSVIAKPPLDRLPVSTFVMEiOfrET^ 

tESXYTLAETIRNLIPEARICVAKGOMCAEDL^IFTKFKIJOKTDILVATALIOCXOIP 
NANT IL IOHADKFQIAOLYQMKGRVGRl#<Klttm 

JJCGMXIALHELEIRCAGN I LSTOOSCHICTICFNLYCKlXIO(A\^AUOafTSPIiFNDDV 
KIEFPVKSRIPOTV'XETWMRIEP/OKICNAESSEELTAIOEOtRORrGPt^OEICWLrA 
LAEIRI^AWHCISSIKCTANALY^/QICCI^ICSECTKKTLPYALSPTPELLVKE'/IESIER 
OFUNAS 

CPnJ>892 1023673 102104* 

-ilaS-AJanyl tRNA Svncnetaae 

EFFFMLSNTI RSNFLK FYAMRHHT I LPS5 PVFPHND PS I LFTNAGHNQFKD I FLNKEXVS 
YSRATTSOXCXRAOCKH^LONVOfrSRHLTFTDUjCNFSFCOYFKAEAXAFAWEVSLSV 
FNFNPEC t Y ATVHEK00EAFALWCA7LPT0R X FRLTDKDNFWSMANTCPCCYC3ELLFDR 
OPSFCWASSPLODTICERPLEYV^VFMEFtlRTSECSU^^ 
*TTIfTVPEADVLRELIAJCTEOL£SlCV7HP0OGf»AAFRVtA 

' JYVLP.K I LRR:7VtfYGRR L£FRNPFLA£IVPl>LATA)GEAYPEL)QUSLSO IOIC/LTLEEES 
KFKTLOJ* VHLUlWUCZlSttSZ t.XEDAFKLKDTYCMP IDE WLLAKDYOY3VDMDTF 
I IKLEOEAKER;{HKNW^V WTSK !7NELIILT££F tnYDHLSrOTF IEAI ISKDHIVSSL 

VQcosnA tVLw.':»»PYAEm JovcconE rpcnEvTTF cvrirrr::rKAi:L i vwrw isocsl 

TVEAAVTAOVNR YRRKR CANNKTrVTHLUtKALEtTLCOl I [ Rtf* CTVODTK I PLOFTIIPO 
AC;pE&LU:iETLVNKJ t R ENEPVCf R EALYSfAXtt'lGE IKuTOOTrcOWPVVSMHS 
HKI/77 mtACW JJ U ;FFR CTRF3IA7AH! tt<R I EAVTr;nc AKATV1 E7LEE [ ATLLQ 
VIKU; I V::RI.TATIDERKWDKnUIELEN::LK > PKLOKLrHNi1 VHCJ ITCLVHHLAEH E 
NJH<U/JVA*> I J Km I I'KKL laWTTEKM-iKY C VLttVtfmi . f\\\ VIlAQDLLrAVLTPCC 
« ^rXl^tAU'K^PAU'ATWtWETLWCWI.'rrOLi 



VAJCLVAMPESIREIE 



t kt R Tl.utrfkHt*»l.i::<* 



EFLAFCLCLTYSCCFYtECL JUMKKLC :\'tl^K IACATKC I- lEo I^KAJSCHPCi. 
PLCCAEiJkAYLYCYVUtONPPOrHW [ NRDRr/t^AClCSALLY JCLHIACFCVSI-C3LCE 
F*QUlSRTPWCtCtTraEATWWff^^ 

YCLACnCCTMCSVSH BVCS FVkOSLWWW T YCTWUVLOCYMq W Igl RI W F1A YC 
WDVYEXDCYOFTH WETTSSX KROOE*rVLVI AHT t IO«3PltISTO8JOCSPU^|XrrH 
ETKt^U»EEKFFVPPAVKNFTAHK:CEORJLWCOW^ 

HKLPWLE3LVCSVEMP3S t ACRAAJNKL IQVLVQH I PYL IZZiidSSSOC^IlMOCi 
IHTYPF^RNIKr^ErT3aTI^LAr>0NT*PraCTFL 

■:*• *fth:. — . •• o patsv.— v.- ;\ .:r}fT"T: 

v: J»-.v.J. — ' :; - '* : ' :!•>*: fYT! .F". r * •«■■ ■: .\:. vvks;^ 

LOKCVR V\-JF KVEU'kJNw L. V K w J ;VJJU-J IRVJIaA ^A Wi'lTf i JJGOAIAMD 
RFCYSCA5D0VSEECCFT7K XURIISS 

CPIU0894 1326823 1325888 

amn-AMP NuClaosidis* 

FAKLHCVPVrtCSHTSAAWPMLmXLDrig^PCJUCTXDLCSrLro 

CLMHrOVCOYTWVASIIWECTSDAYTPPEVPAlAKFVVOWT^^ 

KTTNIRFVtFHXKFRXXLYrrKAOSAOffiCATLFAACYRJfJLP ICALLilSDLPLWlCI 

imCSSWXFKrYTEimXLTOOIVZEIUKVMXJC^ 

HAfiCSTTSCSOY 

CPIU0895 1026973 1027557 

afp-ElonQAcion Factor P 

nXX^HVIWSTSElWC^IEXOCOPYilWNDrVKPaCSOA™ 
YKSCCSVETAPIVERSWRLLYTtaOECATF MDO LI't LCEWFVEKLENIRQWLLEOTrrTL 
VLYN5DVVAVEPP I FHELS XAETAPCVRCDTASSRVUCPAVTN7CAKXMVPX FIOECELV 
KVDTRTCSYESRVSK 

CPO.0896 1027574 1027822 

CT7S3 hypochatxcal protein 

EKYTTFTVR*i4EAiaXl — ^ 
KE»BVtTIW OAXAEPLnCV 

CPIU0897 1028794 1027853 

(phoflpbohydroUao) 
NFSXJS1irVI»10»CSNPRf1OBXPM<VMtI IHISDVHFKVLPVNPVHCnflOUIOXIJOCV 
F<n>VHFOATTXCOR rPICWHS LCAt»SV^ 

LP ^7 NKP > i^rTTJtSrJV!A7Tri^HFPrff! | 0 I.i O Q>nn/CFtnf Trrn^m^Y ^ - nf TCTr* lr j M F8AtCV 
VHMOXSAirmi^PEWIXANWfPLUSOTPSTOLIJOn^ 
fCHEH0AAVY>CACTSPSYIU«SG5ISLPTKSRJ^DLYPEKY0^ 
LEIANEA1WCQ KL 

CPn-0898 1030511 1028904 

Mitochondrial HSP60 Chaperon in Momolog 

TKKKUS5VXXUUXCVOI5E0EXZ«SNYNADXXLFSCI taCLFQ IVXCSYSPKOSLSPTSTr 
KI3(CTirAlSOT£LSNSY^NL0VDFAJ(AI^ 

CISTWaiASUrUJGflCI^EAIjTOSWIimALICVVfl XresXifMPTXAI»ryNAPCVVG 
PESUSXTKERE M JU'I fiM W FQ G* ILIPACYA^TYFVSDTASRLTRIAHPXJLrTXMUClCM 
IHSlXPf, IQEXSEOWQHLI XTCECXDPCVLATLVVmCLKUOVTVVTXPOXSTIBQ^ 
roiAITTCmCICOBaSHVXAPEKVTLGStiSIE 
AZCTCTSOJTTOttJKSTORXOSSVAXIJTOaa^ 

VAX^FYASLTLCTTPXDOADENS XAXSIXOXACCAPLKUATNAIUIXStAVXAJaiSSUTXTS 
USZSVrSREXeX^XACGXLCSI^TTSTILAOAUTAXLVl^KXUl^^ 

CPn.0899 1030848 1032315 

DiurF-Muramovl-DAP Ligaaa 
MiRCCRQNYKRAMLXEXWSXifLSWSCT 

CTQFIJCHAATACAVAA WSMiA QGDS > CLEL I RVDDTKS ALQEAC STJOCWLFQCTliVCIT 

c^vorrTTKErsmusrricniASPicsYNSQLTVP^ 

HQOtXRXVQPEXA\aTKIflIWAMKFP0OXOEXmKSYIL01C^^ 

SC8PtA8XrSF8FTOPIWU3FCYXAISCDSVVXOTPEEt<YClJ>IA 

SWILEVPEEGVTRSl*ELia.FPMRrEHSM!tfCHOArc 

GKXXLXLCKMAELCRYSEECKALVAEXAASRCQMIFFIGEXWI PVQSVUCSYSCEVSTFS 
SAODWU3IU(QVARYCOVILLJ<CSRALAIiSU-ACF 

CPn.0900 1032208 1033381 

araY -Mur aaoy 1 - Pant apept xda Trantfaraaa 

LVFNFU2ASMX PL I PM FUCQS LFFSLALTOfTTLVLTVAIXIVPVMICWUtRJCWYRXJYIHlCE 
YCEXLEMLHKXJKAJ^PWCVLLnSLI A5U,V^PVCKFSTWFFI ILLTCYACLCWTJO 
RIKlKRKOGHGlJ^AKHKFMVOIAIAAFTLIALPYlYGSTEPLHTUCIPFMBCMXaLJ»FWL 
CICVFCLCLALVA I ICT5NAVNLT0CL0CLAAGTMSFAALCFI FVALRSST I PXAOCVAYV 
LAALVCAC IG FLWYNC FPAC LJ>CDTGSLLLCCLLCSCAVMLRACC I LWIOCVFVAIAC 
SV ILOVLSCRLRKKRLFLCSPLMHHYE'rOGLPETK I VMRFWI FSFVCACLGI AAVLHR 

CPn.0901 1033239 1034537 

murO-MuraaoyUUnina-Glutaoate Ligaae 

FCmnS RYSCCLMEI DMCQR I L ILCTC ITGKSVARFLYOOGHYL rG ADNSLESLXSVDHL 
KORliMCASEFPENIOLVlRiPGIKPWPWVEOAVSUIPVVTDIOVAUCTPEFOW 
C ITCSNCKTrTTLFLTHLLKTLCIPA I AMGN rGLPI LOHMOQPGVRWCI5SFOLAT0EE 
H t PAL5GSVFIMFSRIML0YMRMLDAYF0AXXJI XOKCLP^OXTFWVWEECSLQNSYOXYS 
EEIEErU3KCOAiJCprYLHDRDOTCAA7AL^ 

KKOGVHY t KDSKATTVTAVEKALHAVGKC7V I V I LCCKOKOCPFPALAWLSOTTKHVIAM 
CECROr IADALSEK I PLTLSKDLQEAVSI AGrTIAOEGtnVLLSPGCASFtJOFOSFICEIOA 
YFXLL I REMOAVR 

CPn.O*»Oj 10.14 V)7 10)5241 

nlpO'Mur.tauLiUt- linv.isin repeat t.inilyi 

AVOOWIA%i:EVKM>mROMV fTAT/VNA [ LLVALFVTHKR tC.VKDYOEGFPNFASSRVPC*A 
WJEEKV I EKPWAEVprH P f AX ETLAAOF I ECXPV T VTTPP"/f "WiTCTPEVPTVAVP PO 
PVR ETVKEEOAf 1 YATWVK t&!3)F LER lAPANlfTTVAKLMQ [flOLTTTOLX fOOV tKVPTS 

yovi:w:mvn.TANrfMrv t vC/Eno^rvr i alrmi i p.loollkmnoi^lrykarrlxpoo 

tr.«W ».*»•! I Imvi:;»mi l'i.»i.in y* -m 

: y(0f a.*»ocwFv i : a *i j / : \ nn « :i« iwn/r::r..\ tvu^nuRsn »k al i kjwi, i u ai w 

A^[ i7MKW*R I)n J( I : : I VI J . i ; AAI t ' TIV \ t<Ut If .lit* SAkKWU 7^1 .T I J W ;»VK 
YI.VI'1 VAI.Yf'l .TRr.l.7ijK0UCMH.KI.TA I U' [ M LL IA I Ef-U« fcJAAV UifcJH IWIM 
T:n/RI4<YWl,UT J* VI .1 N i AYPMI YWYHLNVYLHPELD I W akaM'TCAKIAAtW. 

ki j / arr ;»x -jui v)KLTYu«Avrn// 1 aa j v akkftfuiklvl i uumtrrf* :ya i a i raj 
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riUCAALAKVtTLrtSMOAFWIl^WSOLLrnrv. . fTZXyTtZZ L I ANMCCVTLLLKV 

YDEOirrrnr u jcr rfrr pmc p5su;kc;pf3 
-:pnj>?04 10X320 iwm 

•nu rt • Peot iuwlycan Trans (erase 

RYMKXX I RKV ALAVQ33DGH rVPAJJVKEXTSRECIOVl^ttnJWMPSLOOCISYRt * 

?S!p™!npik iMSimJLCSCYUCWutBJtxroPttvra^^ 
LrtJtEo«r//rcKvtrr.rrRYAPn r^wr?PVTrHFP cpaeevfi^rtufslcspwicrct 

•arnrh-.v;:.- • Lwiurr •/•vAuw.ia'vr'a'r.iBiiv « •K.iLW.iir.tK'.- 
• ■ - 4 nrFfiKU.LVLL»ViLV :. ».vjat : wff.'p ; : :. : rtr*v t % i. s-.Twrrr: 
ovuxzrmi unLTcxu.viicvtmDSiaiRSNO^ 

CPn_0905 1037400 1039835 

nurC*ddlA-Kura»ate-Ala Ligase A D " AU "P"*i*L i^Xi...™^ 

VHYIOCTPgYKflCIMiaeALAJIILMWEVSCSttYES^^ 

SSHVPHMVVVVSSStAra"^ 

TSSLIIUUF0EA0ia)PSYA2GGXVUUarLiro^ 

NXWEHUtiyACNIJNLVOVIOOrSIWVTOl^KVF^ 

HxvsYNgwwosHFsnrwEYqoiEi^^ 
KxrscvHRRLnuansEsrLTLiXT^^ 

EXCLOTFPKAfOEADIVILTDVYSACtSPRtS I ILSDtAtOI RKSS^CCYVPrarVO 
YU&^RIHOVCVSUramiGEAX^^ 

YISPEFYCVSYTIINR0Ct>'RTCKDFTHLIEZTrOCDSPLSSErASALAK 

PEiriONLXOTsrPMivin'AHiiKSXGiPtvR^ 

SREXEWIGHSSWYOlAaPOTRr^^ 
VROAERWRAMOCrcSARIOmjJE!^^ 

xvdhfi idalhxfpkqqtieoaftk80DLVkr 

C?nJ>906 1040514 1039915 

CT763 hypothetical protain 

IWGSEVTZLVNDSOCJREASAFRIXIDFFXLNITfPFF^^ 

EFVAYX VXMLVTNPEAVEI RS I EPSDMES I KLEX RVAAEDIG Kl XGRRGWTI HALRT XLR 
RVCSRIJaar^IDLVOPENCTDVIAWDVICDNDSSHSTErTTC 



. CPn.0907 1040816 1040445 
•cut A Periplasmic Divalent Cation Tolerance protein Cut* <C- 
Type Cytochrome Biogenasi* protein) ' 

FAFSKTLX IKSSMTAVT.ILTSn > SEESAR5LARiC.ITDUASCVW^PKCTSTYI>n2C3CL 

CESEEHKIOIXSIDIRrSEIClAIOErSCYEVPEVtXFPIEICOPIW^^ 

PLSD 

CPn.0908 104160? 1040780 

CT764 hypothetical protein 

ILAILFMl X IJO^HEIiflRJlFTTCTUTPC^OYSXXYASXIJVl^SLW 
LSKFOT>$PIRNl^VSSTt^X\n>PTAIAOfcRX£^^ 

SLVIEKSPaTOmFYTLOTPIArVDNRSimJCNLECSCn^ FTSOE 
. DLXHQKLPKOCMLFTKXLZJCQ.AHZSPKX IDl^nAYPCEIIVTXSSGSLLRLPIICrLD 
RAI^LYKH}OaCSPVIESEM0YVTXJIJirPtTLLLKAL 

CPo.0909 1041592 1041966 

rebV-Stgma Factor Regulator 
IISUFTRmJ»XIl«LSAKEYGOIIVrnj0CSt^V^ 
KTTOWISSACIRUASNPiaVOSLQCIOtrLCCV^^ 
CLSXL 

CPnj091O 1041970 1043004 

laiaA-tRMA^Pyrophoephate Transferase PTGSGKTOVSIALAPMID 

GEXVSVI)SMQVYOGMDICrAJCVStJCAI«EXPHKU 

LSRWCV^ILVOCSCFYFHAfLSCPPKCPAAPPOXREQLEAIAEEHty^ALY 

Y AQTITlGiDKNK I IRCLEI IOLTCKXVSOHEWDr^KASREYCCRAWrLSPETEFLKKNI 

CMRCEAKWECIXEEvKCUJWIRENPSAriUICYREWlEriJrC 

SNSWHYTKXQKTVFXRYS X FRE1*PTIXSLSS 0AX AOK I AKDYLLYS 

CPn.0911 1044079 104298S 

Fe-S cluster oxidoreductase 
SLLLAIFWNYnCNl0(RISFEEGt£X«FVSSPXERI^ERAM 
PtrmilCKIDCTTCAPYRJCPKSPDAYU^FtaVRSU^R 

0YLE&LVR XTVOErPS X HPHFFSAVEX EKACRVSC X 5 X EQGLQRLWDAGQRT X POGGAEX 

LSERVRKIXSPKlQCPQWXNLHKLAKUCFRTTATMMFGHVEXfPEOX 

SCPGTYSFIPWSYKPCinAIJUWVPQQASIETYYRILJ^ 

LGAKALHYGADDFT^ILDESWWk^ 

TVSSL 

CPn.0912 1044120 1045760 

CT768 hypothetical protein 

W IMI^ONSFHTLETEOCSFLNDELAVEEVASTEST EISOATLCFA EKKVAF X LNKHRE 
ALTC5S0CjDLRL/VfDLPJC0CLPLFNEIEDTAKRADHWRCYXEI.TKECRHI^U}DEECS 
FWCQ I DLA ITC L£TO I LK FQECTEDK I FKOR EENFLESQ ALDKHQAFYKQH HTSLLWLS 
SFSSXX IDLRKEXI^^/CHR>^RlJCSKFF0Rt3NLJ»^FPKRKEUIEKVS0TFA!^VDAfV 
AKYFlGSDKETUCXTVFriJUCEIlOKLOHAAKRl^ 

r RQEOCRLR WS AENSKEVROKZ-AEVSS LLI ECNDL3KVRKDLEG I SKX X RALDLTHDOV 
inLlQCCMOOLFDOLREJCOOAAEHSYOEOlAXMCVKF.EAARSLAEn ITTFSKTCSECNIT 
S E5REEW0TLK ELLiKKSTUPP PEK I SUDNOLNLALOT I VNF FEEQLLS S PDSREXLVNM 
RCNrtJeORRERROEWDKLEQCKKLI/SSGLDFtt 
00 IOOLL 

i.'PnJttlJ 104570-1 1045945 

Nn n^iyr hfmoloq present in i '■wtHMnK / Embl nr. ot 1 1/7 .»•»*» 

U |. * TYPR t EATD*.;A I AMRRNC I YAFl)L£)fjTLLRvJI^W;IFYCffU*U,x;LF.Trt(TLPPt * t 

YKKPPFKFFRIiniPSl tR 

' 'huff 1 ! 1 4 UMViO'i 104*1"* 

Nil ffiUml (KKmhUkJ |/Unu!ftt in i^nelMfiK/EMUL M Ot 1 1/7 /'IK 

vffwdc.p;:k7y:: rvTT<u^.svrco^ 

v i i*y^.-.:i t »f iviiri aoqu : i : i.'.vYA.'aTRto^Atvr i ykkclt »kkav t urn kk i no 
AH:a rrr:;« i c uji .fKi>UijEEKTVvR rxxnt lkkkatk yywn i v 



RLEELMC0CF I VTCK ULAJ H - ' "* ' 

CPnjmi I04»«t) -.048084 

rjbr.A^l Carrier Pr*5tetn Svntnase 

• \ m t tv. ••••• ** f - "^'-v^'v "va :.*«v.*:'W":*r:".: •— — .t.T!T^!."» 

- i , !•»•■• ■: if* •. , v.\r/»: ■ >y : '•" y !{*.■*■' *' T* *ATTt**^! 

d^yc« ; vscraom: :^x*t£aavnr : ^LZjriMMM^ixio^^^RWQmocr^ 
2t^Tlvletlejauuidapxfaemlc^y\t^ 

L ^pSvWVKAI^TrPU=IJEVl>VT(KA^ 

WAIOAILTCiaJ<PTINU^IAEIEDFTJWANKAODWDIW 

RYVP 

CPn.0917 1046054 1048539 

hydrolase/ pnospnatase nonolog 

F™itXVCTt.Wt4KTrrEYSFCVIPIiriTCTPOIO^^ 
KfiCPOEAAEREI.VEETXXUV\WFrPia^teOYSFKNE£ 

pwrTr re trt ?t flr^ ** Lsr?tL*DLTvz*atriutmsss 

CPfl.0918 1049232 1048579 

ppa-lnorganic Pyropnotpnatase 

S»>ej (mWAHP^PTLTCI»VESUCT^ 

NFCPa.YCLXWYCCTAaWSCECTWlBCIOCOia)PLDV^ 

lOCLRX X OSCCADOK 1 1 AVt-EDOLVFAE X ED ISXCPCTVUKIOKYTLTYKATPNMLIItC 

SPAJCIEIVCXYCaCXEAOKVTOl^Eim^ICXTrAEVN 

CPn.0919 1049375 1050430 

ldh-leucine Dehydrogenase 

FMXYStiiTICEXKXDDYERVIEVTCSKVRLHAX I A I HQTAVG PALQGVRASLY SS FEDACT 
DALRLAROffYJCAl ISNTCTOCCKSVI ILPOWSLTEI»<IJlAFCOAVMALECTYICAED 
LCTVSINDISIVAEETinfVCCIAOVSGDPSIYTAHCKFIXIICCT 

pSmWOKIJu^CIUlIVC^O^ 
ECRWAPXEVUJCVEXaPIVLSKLYNOSXTTCKX^ 

CPQ.0920 1051423 1050431 

cvtO-Sulf ite synthesis/biphosphate phosphatase 

riJDISMKSEl^HYONIVESVVTEXTOtXWfRSEHRi 

LKQOXJUCAFPNIPFICEETXYPDOOWEKIPEIIJOTRU.TSSVSRO^ 

LPWTOPIW>iinRJaUfAVMSLIYrYlU»I 

KSWLDRRFWADIU(QT1Q0FCEA5LAALN00^ 

AfiOAVDmRYPnDSPARAWBHVPGAJXVEKAra 

LA MHAJEIH11 lU OkLQNQUIVVPTPKLIAL 

CPO.0921 1051526 1052293 

snClyeerol-3-P Acyi trans (erase 
GEEMLXlOjWRATYESCCTLVCAXIJUJEYR^ 

XLKWWSRIT^PKAVEYWSRVVQWFI^SVRSI PIPOLVPCKESKRSLEJB4WCm 
ASWttNRCESLU.YPSCRWRTCIKEEIVNOYSAYVXXKRV^^ 

PKL CTAFREAFRALLRRGI FFMPKRmXTLCOVDHlJUCOFPTIGOrjUirnAS 
WFN0GDOMLPXEVPYA 

CPfl.0922 1052266 1053927 

aas-Acylglycerophosphoethanolaaine Acy It rans (erase 

OFAiUtSSLRrnUaJUUCKDORimCKNNHKLRlAPGSTX^ 

CSLSYRELRNAX IAVAI KVSK FSECRVGVMHPAS I GAT I A YFG I LLA UM I M >e4 >*iC0 C I 
RELRACTKTVEVRRVLTSOQ FIKHLTEVCCFVEYP FPLMYMECVRXRX<SWCXCXIGLY5 
ICSVPWUJlIFCVSCvXSOXyrA«UTSCTEXLPK>VPl.TO 
DVMlArLPPFMAYCFHSCC^PUX^WVVTASWLNPW^ 
DYXXJCTAKKGNSCLXCSLRl.\AnaSDAXJam.YEET1CKX«P^ 
TKESPRKSECVGKPXECMD^XISKETHXPVSSCEOCLrVV^O^ 
SUJCOT/YLTCDLCH ICPSCDLFLECRLSRFVK IGCEHVSLEALES XLHEHFTENONI&A 
GSLWCGXPCDKVRLCLFTTLATTIHEVNDILKSACTSS XVXISYVHQVESIPXLCXGKP 
DYVSLNALAV5LFC 

CPIU0923 1053966 1055093 

bioF.l-Oxononanoete Synthaie.l 

vrKESFI.TTSDVXDFVTW)FlCFARSPTIYCEVSKRFOIHCCCFPHElCI£ 
SSVTDOLESKrASYHCAPNAriWSCYKANl/:irHHVSRSTDVU^EEV>0^^ 
ISCOHHTFHHKNIXHLESLLOCY^ISSKCRIFIFVSSVYSFRCTIAPIXOIIAI^^ 
HLIVDEAMA>CircCDGIGLCHAI/rf DJFYAVLVTYGXAI 

SPPUIYSTSWPHTLXSICTAYDFXASECEIARKOVFIOJCEWFMECFOSKAPOCVOPXFL 
PHTCLEEA X SVLETTC I KVCWATAK HPFUlV^^J^A YNTVDEVNU>OVHKPY[i)CSSKR 
VMXNHEFMLNRELCCH 

CPn.0024 1057)01 1055028 

priA-Pr taoaooul Protein N" 

KRrTATTKSHC^rE^FRLYAEVlVCSNIWCvXDYCr^ENLEHrTKCTAV^ 
•A7/IY0 I KTTTQC KKI LPXLCLSOSE IVLPCDLLOLLFWISOYYFAPUJKTLKLFLPAIS 
3rA/X0PK0HYRvAnjcCSWKTKEILAKLEVU<PSCCAVLKIlX0HA3PPCl^SLKCT 
SOSPlHSLEXl^ILCIVTJAAOLELOEOUTFTPPAPKOLHPEOOSAIDKIFSSLirTSOFH 
THLLFC ITCSCKTE t YURATSEALK0CK5TX U.VPE IALr/0TVSLFKARFCKOVCVUIH 
KL5DS DE3RTWR0A^ECSLR I L XCPR5ALFCPMXNLCLX XVDEEHOPAY KQTE3PPCYHA 
POVAVMRCKLAHATNMIinAT P" LEJYTN ALSCK YVLSR LSSRAAAAH PAK I SLX(04NLE 
PEKSnr XtFCOPVLKK lAERLEVOEO'/L t FPNRRCYHTrfl/SCTVCKHTLKCPHCONVLT 
FHKYAtr/LLCHLCNi! J PKDLru.XPKCLCTMTLQYRCSCTEK I EK I LOQ I FPQ I RTXRIO 
SOTT1(FT^IIETLLROFATi:KACAfLlCTOM I AWTPWFSAVTUAVI LMrjDTBLYIPOFRAS 
EOVFOL XTVVA* «UVRj 3tf.l1 :F. I L XO:;n.PPII PT I H3AMP0DYiAFYf:0EtTCRELCEY P 
PFIRL If-C t FNX^'rK^h^llhVlfMlLRaMX^TNPL^r'/TPCtTIlFK t KDTFRYOFtI 
r5AW [ PWKKUiHALMUNKUM-rVKPM tWDttfTTFr 

'.PiO^iS 111'. /-US |i|* a .'.*2< 

'TT't'/'i riyrwr.^wf i.mI i«i..«»m 

KjfWLPNDKONFi irm i x>u t» Y.-:FFJ.Yi'ri-\:niNVTLrf ita i CMrm: i tekavevpn 
aepui' iTPPPiTriuvwrKr-:rivwKi -vm jtruu'««A i lf.exypalt(/.*::li'AI'K t rcsi 

r/YKR »IEEVr.FFNRI ^UC 1 1 .T'^l J i TKJ .Tl. 1 1 lAKTNt r/JftlPNFFLM AIH t RYU I r 

TTOYi fjZLTvnr. i vuvvt::. :i J&mttA jcicNUfA XLNRLPPAYTir 
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'rpn^onj.. IT.SOn* I05MS7 

Ttuorffm-in OisuUtd* tsoraereee , J flinrmil f 

cwrorrY:rrRrFia>3DKKPww^ 

rjwmAEALEMSXOt3KXPrCLrrPC3DMC>«C I KHQ DQ I ^SSEFICKFAGVH UOjVtVy F 
TOIWH0PtE&«0»CI«EUCAOYKVWrPCLVFXnABCKOLAR^ 



SWMie tnM*7n 



rt xsnLFiPiJUiJWuacicomLP^ 

SAADRWIOYTOtWDSLAtKI PHALPHRWLYSQO^UgKLTOCTg^t^Tg 
SNU.vrNYPCDtSSKGEAICP^VKSYOAC^^ 

pEinYNSiwoon^saajTOE>cv«pnja*iwTSCTKiPiPiR^^ 

VDRLAAVI SNYUDSENRK50CPD 

CPn_0928 1061035 1059884 

•CHLPS 43 kOa protein hoottloa_3 

RRKDTAFTLLNLSNHSDI LSSI FSNPHPVSYFSSWAICQLSOrSKKHPIL^nff^iyiCI 

FKLLXGLI IPPLCIYWLCOl.VCSLKLr PRSSKLYSVUCtCPXICtWJOClOOYfVIW^ 

SFKDPAVSESIOmXQQOU*XDTLAIHFSTARPIlP^ 

SWKELAJai^It4YWP(7VKSS^^ 

VSlOTVQSAAl^lWPrTOSITSWVAVKt^ 

OWSRELPCPCILVYSADIOTPSEVCODrALLPtmAHAIJamTW 

SSPLKHPTXOXLAEAXLESUXXN 

CPn.0929 1062301 1061186 

•CHLPS 43 kOt protein horoolog 4 
EXniAPIHGSNAFVEDILHSHPSWTYTSSTItt^^ 
LXCmLPLGIYWLCOTLCTNSXLPSIOKlAKXFiaCOPl^^ 
SMRRVP ILODNNrt^imZICX^APTlWWWJSIiKKSI^ 

ANILVYtJYPGVKSSTGSSSLKDLASAIOnCTKYIJ^^ XTYWSXOIUOAE 
ALRDQX XVANDCTWI AVKDRC WlSVtT^HSCSKiaKiMMIWanXVW^SQULPC 
LEIFLYPTDSLWISTVRONKIXAPELTI-MUIIWSI^^ 
VALATPXLXXLS 

CPn.0930 1062851 1063330 

No robust honoloa present In Qenebenk/DOL as o£ 11/7/98 
NKHSgAPCSTCLOHVPKIWWALDTW 

viocaiiLiLrssxAUVLYiocrREwxAixpuwxsKrasxiopvKnwASi^^ 

PAYTHTHYYIXttMVTYPXIPRFHLCSYLAXA^ 

CPn.0931 1064078 1065718 

lysS-Lysyl tRNA Synthetase 

X OFRVLGMXSDXYTNI LEEPJfT AMEyUKEZ^YRSKia^XUEXXSVVLYPYCrPCVFS 

CEDIKKTFASOEUWSEAAMSRSTPIlVRFAeRLV^^ 

ErTSVHGl^£DA£ITPIKFIEiaCLOl^XU3IDCrSfI^^ 

NXyaOA£MCPPTrn(EA£J1SCMF»X5ZXZAUCKXLVa^ 
PBTTIIIEAYMYMUYXEVMVFVD^VD&VPJWN^ 

MI700ESX ATYAG IDVT7VHS0QKLKEI LKXJCTTFPETAP ATASRGMLIAALPT)ELVSX>tt»X 

APHHTTDHPVETTP^lCTtASGrrrAFVZaTC^ 

TltmJJ»DSraPIDOTraLC0GHPPA«3roX<^ 

FOAEJCTN 

CPn.0932 1067160 1065721 

cysS-Cysceinyl tRNA Synthetase 

VKSPTVMAFSH I EGLYFYtffASOKKELFFTMHTPVRL? 1U.P T V Y U VAHIQ JF K U * Vf 1U 
IU0^VFTCflrSVT1<V>MITDVn3CT 

DrYPHATHyiPOMXOArmiJQCIAYXC»DXSVYrSLWWWGia 

1 5MTEVT1f PHPSDfVr iHKft VWPfflDCVTYWESPFS rtiff Pq W HirC? TMWTT T / T? t nr w 

AGCTCNIFPmENEXAOSEALSCXPFAP^WljfSira 

FTCOEVRYMIXOSHYRTOUnTEEAIiACRJttl^^ 

SSOFIEAFSRAlJWDLNVSTCFASLrDPVHEINTLiro^ 

GVT.PLTTSVriPCTVMOLVA£REEAWCnCNWAHAOTL^EILAACIXVEDSI^ PXVKPL 

CPnJ>933 1067532 1068578 

predicted disulfide bend isonerase 
PVXUJQNXKP£SUCOU<\^T^LSULPTLEAAOIW 
PLLVIFSCSDW^PCMXIRKr/lXSPE^IWT/Ora 

KSK^KINELPCMILI^HE£R£^YRZC5l l ^NCTCS^II^0SLCHIVC5DSUJUlAPPKK^SL 

SLSE2jQRYYRlAEE2^HKEFLKKAIXLO/P^:7frrLo^ 

KUJ^PKNEJCQTHFTVALIEFQEIJUUlSRAflVRO^^ 

RVQe4l AOn'l^StX^WHAUJHAEVAPEAAPNrVRSH XSRSLEYXPJtCS 

CPnJ>934 1068948 1068526 

rnpA-Ribonue lease P Protein Component 
YFVHPLTtJ>KQSRVUCRKOFLYITttSGrCCP^OAT^^ 

CPn_0935 1069100 1068957 

tl34-L34 Ribosomal Protein 

EDTWRT^OPSKRKRRKSVCnrrRMATRNCRKLiNRPPRHCRHSLVDL 

. CWljm^ 1(1**3 30 1069470 

t !">'••(. j>> Ribntiooul Protein 

YUflCVSS A'KADP^KJOKCWlWCCRLYVTMtKOIWKOROACPARKK 

:in_m>/ I0«»4K7 l0ti**7'lM 

r.;M -:;M HiUm. vo.il Procein 

/X W4AK K: ^: : V AJ< KAKN KR LVEANPK KR^LRKIVK" LZ7/r. EEEKEN AR I JLNKMX ROTS P 
TMJMH I.I.T»:Rm:Vl4<KKAI:;RIi;PRuMAUMi;EIf'r/lKA;» 

■ iiv.-msi u)u'«H4« 

'T/mw tiy)s«tif« i.-.il |Mi«r,Mn -Hauler »'.*n f*?it i«ii< i»t?r ipluwelcl 
...t i n! rri j»tmi.i • i r. i ij .pyv t u *:l:;ay iadkktpj n 1 1 wpfa* wfof i^lwllll 

I ■: :HWIAI 4* l\*II4VTN::iU^POl.KK.«;LAr WOK t r*J.7a,Wn I V 1 UTFKWFY LNXDRKNV 
' 11*1 .JFUW Wl 4 .n VTVPRK I WVWKW *KUW^VKL7K;imjALKKA:1K 



CPnJ>9)' IT • l« 711 " 

CT7 TO hypotnetical prot-in 

1 1 I NRWT IPXSLTUXLSTVLTTFSICI ll.lUL^^WWIJ'yKnBXffl.y SJ Ul^l jf 

VWEOTI«IVSBX3rfTOHVAAVWX IICLTQPXDR XOmCEBVSVCOLPSPC 
DFUJMSB 

rpn.no 4 n I07lfli0 1071204 

ER I PFLKWCTAS I CT rV/SKmAIXi-OO^L* KCHH PKYN^"LIJCCDICTFTCtAI*Lfl(SW 

Si^fci^rJUxrsso^ 

ftf» S^ Apf ^ y L * PLJ L<Q CTI JP^XIJXKSKXEEV^^ 

LAYRIUUSAYAATTt^STI.PYOD TON ILRMSOYPYR X En^TKA*0e3CAKATCVYIV7T>aiJ 
^UP ^lF S X0SOCTqWDLAX4JIVIXM XO 
Tt^^IOWlAKDCSNKSlCIJOCIXXrCCTrP 

xstswoncKAxraaupcxccvnox^^ 



CPO.0941 1075504 107301B 

■utS-ONA Mlsnatch Repair 

wrtagryrPleXIOWCOCtK I^CDSVUJT^KXOrYCAPYODAVX J SQKLC.TLT0WQCX PM 
SCIPVSTVOTrVOUJGWrKVAVWPCEPAJtlXESXKXCP^ 

flSTOAIVHOLOOHlJaJX^ADWAPWF 
AGCIJ^IODXIiXPTIWIAXPCTRClCOOl^ 
jHTSTPKWUiROXUSPPWPttXLVIOM 

HOi.Puiwix»iin^KnwDuau^ 

GmKV^SXnU>QLPWXIIR0S1U4(ASIim 

ICanriTIGAGrKLtJULiMS'I'f H"/PtAETAKI LJKKATDRSLVXLDE^RLi'lSnXCLAIAQ 

AVVETLLTrliOOKAICItJATKYlCIlJTlJa^ 

OXSPQXHVAIUJCPTlWSRAQQXUQX^PeSXTRPAQC^ 

CPIC0942 1075955 1077754 

dnaG/priM-CNA Priaase 

NCSrriUcTAKrTSXSUMlMSXDXVDVLSEMXKtJCllS^^ 
PAOUfYNCrOOOMCDAICYU«9KLBYSrr-AXL^^ 

Duasomx^rACPTGNWPXFAiux x PwojAx^wxcrsARjcpiawoonrrwrPET 

PXPIXSRXI<reiilP81UUUAa*3CVXLVTOAtX^ 

Lfflaxrajn/rixrDSPtJvta^ 

XAXXE0SOOn.T^XSD04SSYPKFQPfttXAXXVtXAIR^ 
|fl4VPtIMVLSUMPOVTAnQNXPI)C0KVPXrHP^ 
PYfVP«riOIPtCIUO^PWISYYDCYRIOWPrOEACC^ 
TXrVQSLOia4ADilRMUtWl^WNXODIO^ 

CPIU0943 1077972 1078238 

CT794.1 hypothetical protein 

P F IO mm L LPFLSVX LCCCMLLSSPRSRAISVTCS latSAVKTXVLSBCAKinJOXCr 
GVUASSIXJOWWWI^CSIAAOKCVM 

CPh.0944 1078503 1078997 

No robust honoloa present in Cenebenk/EMBL as ot 11/7/98 
IltXl«OarmPlAAII,IFSPST.VRAXl£PSEN^^ 
ItXOtPCXLVPTSERPTPErADLlT43SrSLSTPIAKGPWV^K 
ILYltSSPTAPTCmvSGPIUjCYILXDE0OC^OAVl4>LETKN 

CPn.0945 1079001 1079660 

CT795 hypothetical protein 

S I PI0«ILPSYPOTTOJLRRNY>IRXAL5L1^LL«I FP irCEESRPCSECCNSOTOEIVC 
SOtn^VCLYKSYfX^XAASAXElSXPLVXVVIXNSCOOCX^ X 
PSllJWrWLWSCVN^ryPPXEDPILAEJVKPTCELritOESrPTCLSX IWCVTPECPC 
. DIICVSPVSLTVKEXm^ECrTCVESTSELOSEDPAIA 

CPn.0946 1082816 1079745 

glyO-Glycyl tRNA Synthetase 

CECOICWtCYrLESPVSeMPLTtOSMX ATI LRJVSEOGCVIK0GYDLE\ICACTrNPATrLR 

axjgpepywu^yvepsrpjoowycvhpnrlonyholovi lx pvpdifulytcslraxgl 

[XJUWDXRPXHDOWO^ICAWQLCVewl^ 

ERIAMYIX^KISIYOVIJ#n7rLTYGQITCASE)^WS 

RTUCNCLSVPAYOr^IICASKAITirUJARCTISVTERTRYI^ 

UWPUJJLSSTSEPKETSESV^ISSTEDU.lXICSEELPATTVPrCI<»LESLAR(^ 
TOHNIVYBCl£VLCSPRRl<ALliVK>IVAPEVVO KAF EKKC PMLTSLTS POCOVS P0OCX7T 
AS0CVDISKYO0WPJOtfWXRTVM3Sl^flJLU<PEIP4Jn'AD 

«VWt3NSCVEYARPIRWLVALYCEHXLPITLCTI XASRNSPGMROLOPRKISISSPOOYV 

ETLROACVVVSOXERRHII EOCLRAHSSDT ISAI PLPRLIEEATFUDIPPVSCCOrSSO 

PCAUVELLXAEMVIIHOKYPlTHLTSSGAXSNPrtV^ONSPNOT 

CEFLP1CC»L0TPLTTPTD(LICSVrYP8MJCSLYDKVERLKAMOP^ 

A I OYCKADLVSAWN EFPELOC IHCETf LKHANLPTASAVAVflOILRH ITNOOKLSTXCT 

LLSLLDRUKLLACF I LCLKrTSSHOP"^ALRR0S LEVLTLVS ASRLP IOLAJLLDRLADH 

PPSTIEEXVWDXSKTIHEIl^IlCRLKTFMCSLErRKOElAAVLIDSATICNPIEILOTA 

EAljOUJCEEKrEXLAVTTTTKNRLXX I LSSLKLSHTS3P t EVU30«E3tIPKeVLflAPPOP 

PKETnAHAPLEYrU^LAOU^IOOFUrrVIIIANDOGAXRtlLR ISLLLTAHDXFSLCWE 

SVAV 

^PlL.^ 7 IOHI4II l'»H40*/l 

pqsA 'Uycorot i I- Phuephatyp/lr. r.irr.il^i.isr; 

ttcvclpny iTpani jp xtr i pm i lylt ':rwpr; rrrwr ,i-r/t XAia-\ ma .TtwianvA 

RKF.%'/TDt*;KlXDt'MAlU t YH \?\ I Yl>TPT>Jt'l VHf J'ULL'/P I n ARU,Vl;rTLRT\VAP 

k;pat/aara:» ;ki4<ai ;v::m.ru.yn i hi::u ;lc w:UKl pa.- vrvri r iavy:: iaj 
i t BYPvetNiw^'AJHAirrKi^-.KjcM u 

•••HlJIPMH |IIM%4H1 HiM4ii47 

•j|i|A ';iyi.i*i«<i :V»irikis:f 

'IE:»Oi l VUVAVKFTI' l VRW 1 JU •4jr,VA*n~:KI'l4SKOWA'r/U JWH.UXK^SWUIF. 
Pr:Pr<EPLCKWJA: ^ I ;nf ;:YH:| .TUT I ITI« ^ I FXnnT:r/ , ^;i-><WVf<**:'AKAAAAAA 
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YWCAOrALI I VI lUCSOI^LLACLLKff PLK7VH: r OWfGYKCYCfTOLUSASOJD 
r*!Un>Y0UnU)rgTSVli(XCALYC50Y!TTV2L; . VOCZXHDrcOYSLKOAXLMtNSVr 

jr; : :m; roEDvwwprrDPALAVQYDA^u^EPwa^KX^^ ssoyppu 
:v t^u i veexcpepkke 1 1 lhamehsyaf t hctsqncvllnztwlqxusspwirli 

LOFNDPLAJtLTYAAADHlCIPffKREACCLTCLIAMRYCTVPLVR 
TrFOTWFNEFRAMLSNAVTTYTOEPDVVLrfl^ 



10R5RR7 



I1RMRT 



KKFU3MLESGALS5TVF jLSYBCR I IKALVXDIOYQ ITTYDV tHLOFEELVEDRPIKUU 

PIRCIWVDCICWWjXSlJ^XRAVllVVCXPKDXVPfT^^ 

ACIETITPUCEVAITVSRR 

CPn.0950 108(470 108702? 

ptn-Peptioyl cRNA Hydrolase 
psLEtMUKLivAXCNPiucrANrxm»cruju)iu.vei^ 
sscpwxjcmFVNiJGJUivvtAjacYFW^^ 

«LKSITASLGSNEY»«LRrt!VGRPLEXXa^^ 



1087457 



CPn.095l 1017113 
rs6-S6 Ribosonel Protein 

€FL«)UCETOLYEt^YVTSVTLSEEARRXALnmBGI^ 
RG AAECYYYF XYTSVSPGAXTEXJQCEYHLJIEXS^ 

CPn.0952 1087469 1067723 

rtl8-S18 Ribosomal Protein 

GE 

RFOCVLSOAIKRARHLCLLPFVGED 



FZTERCKVLPIUttTCVSS 



CPnJ>953 
r!9-L9 Rib 



1088248 



1087727 
ol Protein 

TARPGYVRNYI^PKJOCAVIAflAGTXRIOAXIJCEO 
_ _ X*IVLEr0VRVDPI*M«GSVTTO^ 
FPHAHYAIKNLGXKNI PLKIJtEE\^ATlXVEVrSDKEYVTVtAOOXOTCDJ0IC 

CPH.0954 1068259 1088708 

ychB-Predlctad Kinase 

GwevcvTCDiicrpspAicuariJ^^ 
rwfflu^i^rwKsiiirRRcrowoPvswHLKKSi: 

rQTHrPITTWWAR£IGSBVPPTrLQEW 

CPn.0955 1088612 1089175 

(Crsme-shift with 0954) 
RAPWYSYNNIATXCSRNWO^STFTSSCTJajCKr^ 

LMSGSGATLFVCn£ELB30S^ 
LECFQPQI 

CPIV.09S6 1089545 1090909 

CT805 hypothetical protein 

LWPwii^pYSYsi^icAAvijTCsrurrrLTPKLracosY^ 

SELTOl^RVETVITl*lAVTLFT>^YTECYKr 
IVYFA£LVl^IAJCtfimPKS*»*mJaAPPU^ 

SRRPAYAT WLLPSKXS ICCLTSWSSRALTL IFPALW£HSFTLSKFAWKAJVXILIST 
TX YYTXTKKE J EKP VDrPSDWSVlCKVPWWI rCWI XTVGS I ILSRSTPLFK3AIALTY 
USFOXrriTYOTPINl^KVCYVGIJTfACLWra^ 
IFU»ALVWYLVM^ATXrY7fY^^ 
WCWLTU5AICPS IISZjCVFWLLICNVPETLyCFTR 

CPIU09S7 1093812 1090963 

ide/ptr-Insulinase £anily/Protease XII 
KrrnwciwrwicLicpiwcTSLsmcEOOPTCwpwp^ovsw 

CLPUJISOPNLP«AAIiVRTO^PEE«atAKITEK^^ 

SE»BVI«AFTYPMCTWrSVEHSAF 

AHPt^DOUWHRIOQLVAPOGHPCARFGCa^^ 

YTSAftSKAKKOraXFWIP1lS»*YWEPr^ 

HIYXSSHPIPLCCYXAIAEVLIWESIWSLVSLrjCNEOLITO^^ 

ELTEJOOTWYSOVIKTTOYlJmOEHCIPNYT^ 

rVSTOTFDIi?TYPYHSLVYPKYSSrnF^Ar t^vsdpeqarfvl^sknsehweeat^hd 
? I FlKTrmCAUC^DYCKVOSLKP IALPXPNLF I PKEVTLPGVHLLKXQEFPFAPALS 
YQDt^TLYHCEDHYYTAPKLSSQIRIRSPQISRSS POFLVATELYCLAVNDQLLREYYP 
ATQACLSFTSALGGDG IDLRVSC/TTTVPALLNS ILTSLPNLEISYETFLVYKXOLLELY 
QGAUJICPVItSC^P-ASDVIIKlTVgMTyifT CAf rvt cgf^y^y i cvt roewt M«n 
CJlI^EOOKKDYLEMLOVTTASRSSHATKPTfYELOSOEISEIHHDYPLTAfCML^^ 

33P5:ociw:AEia^oiajo<iTrEZLim)oou»M^^ 

EEXJAKTSUTinCVSASPEXTCISOEKF^^ 

PFVKFSTPI^I AIACTLTYEEPXKYCOCFI^ELOTCTSVYI RG7QKTS 

CPnJ>95* 1094803 1093793 

pUB-olvc«rol-l-P Acyltransterase 

iyrat/>cfsrylpyafwoylpeplyokfsvfkowidaatkxaaadoa£viclowvkv 
f redlkmppipppykkkirapidlfrlsidftslvi ddknsrilnlhrlxeieeyi aflcd 
nwllankot ecd polmyy alg kth pelmenm i fv acdrvtsdplar ppswgcdllc i ys 

KftH TATPPEXJiEE^LU<hKJ)tSMOILJCTLl>JBCXIKFIT/APAOCRORK>UECRLYPSEFSP 
ES IEVTPXi>XA3N0TTHryPFALPCTYOILPPPPKrQlArC£ORAIPFAPVFFNFGA^ 
nJALCwKEELIHCOKKAOWTLRAEKVFSIVKNLYEEL 

':m_«wi itw*j7ii ium479<> 

•MrC-Axi.il FiLtaent Protein 

Agf 7*n f^RKVMGNEILiHIC*KEIRYAIIU(NCOLP&LT (ERKRVROLKCN t VRCRVTN I 
I J<NrQT^F fNTDERElAlFIH ^TOrL£J^KKPEQHFW<DVQAI.PEEA£EAPLLGoEEAP[E 
KKlJiLUirviAW/KEIU^ICUARLT^^ 

P«JL I k-" KEMI Vf ** JL f l.*RTAiTTTAljPEAL (NEAHDCLLTWKT t LEKFYSTEOPCLLYSET 

Wt*rAYITi:CDF.OTKKLLlDOTAr/Q^ 

KlSIWATHHKIWL^iTVt^FTJKT^ 

MOI^LPIWra:i.VC tD^£DMK;TOlCWRRVLERLJCD<W?/XIAARCTrUCMSEF^LVOfTROR 
MlfF-U44Vfl«fTU.*PYi?:XWAC !KTPESWtEtERCXJ f KVfNIIKDIAKlJCLWKPEXA5YM 
K^OILWJWI t NLAKOI JCAKl«tNTSO:WltUO<YOPPSL tTCES IOL 



CT907 rrYpotnatic.il pr-»'#tn 

VCSCVrOCRPL [RQEl^JtSOCFEICSl^CPERICN ( LXFLEORXXKOCKSPPEYL 



CPnJ)9«il 
rl Rn 



109710* 
, pmrain 



I0»72i7 



CPn.0962 1097301 1099275 

pltx-FA/ptwtpftolipid Synthesis Protein 

ILSOFKXVOIGIDLK3GDHS PLV\>C^VTMJ<£QSSTI Pf AFTLJAXEXIRJCOIQtXFI 
SXXJCSOTKIISAa^/AWtBSFLAAlRIOg S S^ 
TUUUUCII^IVAVSRPAU,VCmi«CHAVIL 
DSKmXCLlKIGSttRIBTEAWWTrRmJ^^ 

laarLKTAxcvpmoRzx^oxLSACxoRJUjrrri^ 

UFTCILCSIN L AQARLCTRILSKLI 

CPIU0963 1098374 1103224 

pjsp_21 -Putative Oucer MeBbren* Protein 
TPUirWAMVAKinVIISYIlSSTSH^ 

AHVIEA(yT5VlJCCSDf^SqXESEICVLYTWPLTC>SS KXStXILMKinBiTWLFIZ 

TTvroiPOttvwsDi g ii twrwP i qgwn shavsdcie - - 



C)G0VTLTOATGU>LrAUU.VXI^^ 

UALGEHIGLTDrKX3GAI^AOGTLTlJNN^^ 
VAWOHTAALTOCM^A M Ds^I^IICTtSEIl^ 

TPWIGXYVOQCAIIJTORVTIOOiSaOmi^^ 
Pgg U ^HSCH YP^^ 

vDiSAwuvEPvsNSsacnmvcyaiasv^^ 

TICCNOC2IIAFTtl«nmKEM0RSCCCAI XMI5SVNXTJtMACDZXX»NnOSYCSArrV 

CSLVASKlSNPim.TrrCNSCmrLFAJWSTOTAASLEElCr^^ 

V8PyGNRAfSaACVOXAa30T^CXXAfan)XLPB2NI^ 

VQDSani^DAITYEPfflRC^PqPVSPX^APSUFNSKP ^ 

KXf^XAAXOBTTlJaSaNAilMJtfXJOOCroSSX^^ 

EETLVSAGVQINKSSPTPraCSIUlVinTVa^ 

C TgJl CTa j OtJritDPTTIVCYPWAU^SMmiP^ 

vswmATYoricvwsEsmDcsLvvwpTm^ 

LK0E27AKHnAORNEXI3F871«W5GtCV\^XCONXCEriX^^ 
OFtJarcnOFTQCTESOSYKAffVZArKSYKSAAYAC XIJ^PWUXSAPVYtV XMCtttTTD 

YonaxncsiaaGBFXAOTSxxmrxvimuirxsAiv^ 

IJAVKilUAlR rP^IPTCrAIJaUYSRCSIUUVKS^ 
AYSMCSYGVDXPaCAMUUUAOrrOmSYLST^^ 

CPn.0964 1104812 1103301 

no robust hamoloa present in OenebanJc/DOl. as of 11/7/98 
rfcrt jreTTirvrvT TMMgirMWWgMOTgT^coirT T^rvwT tpit^o ! y Pr ff r fiiTTTiyft r» 

O0m/TNVA0VXJ(PSSLP5KSJC]aLKX^CXSCCPLCm 

IVAflLRLntPIJPIOMVCPLStyrTgETyEVIQPFIFAL^^ 

JUUW>NPnjOtt.VAtSP0TOtAIOUIPCt£S0XJ0CVLKS 

EHDgypDKKTTPrLriaLIEALTCXSS U ' R r PS I HJAItJAALFI ASUJL 1ULP IHHYTT 
RSXNRLYSXANSC^LXJWVQEPnSlEXJfSXODCOCMEI^ 

PUOCYIMSGNQUmSVFK8Y«31CMPET^ 
DPCTYXPMKAC IAYLLYVUCI X EL 

CPn.0965 1106769 1104925 

IpxS- Lipid A Oi saccharide Synthase 

IttFSFSlCVlZJaUPSGLVYlXYFLCVLASX^^ 
ICAT^KrVKGTI0S0FPVTVU<VTNLirnJOa^m 



WX Q ,LT «r .TAI l ,VYrXRXCDPXNXLCYtXaLrPSIA|gJU^YK^^ 

EASCOX Uaaa^QSIXSLYPNIRFWCVaCPAMROECLQP lUWEETOVSGFAEWjCSLFR 

LYRNYRKIUCTILia«PATXiri0rP»Hl4XIItKUUCHC^ 

lOlIlXOHLflKIiLILPFEECIJKWrSLXITVYLCKPLVEEI SOYKEOASWXOCFUISORP I 

VAAFPCSRJUIDISRNUIICAWLNSSLSOTTO 

I X PTeVFTTYELMRSCDCALAKCCTIVLETALWTPTIVMCRIJPFOT 

SLPNXII«SVIFPEFIOGXXOFHPEEIATAU)L1JOHCSKEKOICI^ 

SEEFLKR I FOTLPAV 

CPn.0966 1108055 110<>748 

pcne.2-PolyA PolyfMrasa 

LLXTX XMVCOM t LCCRO.ELUQCKSN ITLTFTI YSVSNHNI KLKDFS PKALSVIItTtJOC 
ACYIAYIWCCIROIXL?nTPKDroi!7rSAKPEEIPCAIFPCNCtL\CKRPRlJU4IMSTtOI 
IEVSTFTlSCSTOEOV^rTKDNl>CTPEED^T-R RDFTINCLrYDPEHEEI IOYTCCVNDLR 
NRYLRT ICOPPT1IP100DFVRHU»LU(XLSRSPiTVET0TOEAX«IACTWELXR^ 
ELXIMLNSCAAICNFFOLLX EMILLEtl^YKOKAF^t/VALEECTATYLJUUXOKXLKXE 
AEYORWLMAt FUPLVNFWRYIClCKHPYl^Lrrmr/ira 

I LTAL X LOMOYRLTPL I PTKKALPTTnUCLLHMTRFLEAi^LLEIRSIWPKIJXCVYVAWI 
RKKOTLKCKKOdHSQK 

CPn.0l»67 UOH411 UO^SH^ 

m r n A f pen - Pt w>« ptwt I urraut .in*» 

PTAYKPAP tCACR^EK I RR I H I DFRRNM(X>JVP.r. L FCTt/^CnAW F ErKnTTTVLLCyC 

AVARVU£Y»:«:iniRVVV.:KDT1)UX7mFTlML(A<:illUfr;(m < VU:n 

AYRAOrV 1 1 N I *J IN PVRPt* S IK I FlTUJ 7FK 1 . 3//I.E0P t ETMV.'JFAOft SPLTEBHAVQK 

KX RV I OAHTWW t'PVKATF l*Kf 3>TI JCf «J< t VLLOMK JLTrtCVAl": VPEFJJUEV tCYOCE 

pro ik r NEJiaMLri vv' toKAV r WKjAi iu; i ai x/ » ;lp 1 1 HvnEKiai i vrxnm i l.tica 
: ^^tFLpYNrrAai: i v;:ai^/i jiim irr.R:M( j:ultai* t '/K::rorri.iNVAVRFjr i put 

I PLI ERTMfLVCnAI i \\ i\ M 1 1 j 9 7TEN 1 1 -I'/MyBrairKllOVt)! 'l-\KAU\tV I HA Eli 
CTlt.O'll.M I III'lHNM | | H 

iltWiMUtunNkmiitti tu+ifw : h Aim r.<H f .ui:;lor.in.- 
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ORMORrFnVUWODCVSXVLBCLWCLerRCYOW.x .VB0ELfWKTVCRVOELSW.r 
QERTI PTAj*V tCHTRWATKGVPTE tNAH PMVDECR5c' AWHNG X X ENFXELARILTAGC Z 
3FA3DT03Er t^U^LYYOtSOOLVfSrcm^I^C^ACAtlHlOJHBfTILCASOtS 
PLtLCLGKECTlASOSRAFrrmmSOUASGm 

VACGMYHACYLAKYI I t^VSTPVH IEVASCFHYRRPY IGXCTLC I LISOSGETACTTLA 

AUELRIW«AYU£ICWVPESAXAij3VBHCLFlXACW 

KUWirjVLIWEOCSFOOCmttPOLCgK^^ 

r.wivw .Kf # k i a . < kai ixn • ? ;K*Kir ;t ly. . : . ttti v : ap : jcbSTTCPM i- ammlvk 
;. , «r/iA!An , -*!«!%u{,vv;.:fv. tik: .wWiTrj^uww: as-we::*: 



CPn_0969 1111803 U12999 

cyrp^l • Tyres In ■ Trantporc.l 

VYVKSN1CV1JGCSLLI AGSAXGAGVWVPVLTAJCDGFF PATTLYTVSWLf SMASGLCLLCV 

HT*««ESIWPVW1LSMA£SII£HVaCISIC^^ 

USISWIPJiLGPLGFAILHSPIXKACTKVIXfl UW P HPaL ^ 

VRSBNLTTINAPPVFrTAFWOSIITOTTnfMOl^^ 

VLGAV5l*XI^QAKGCYTAV8JUJ©AIUlSNA« 

ADCLKWNXXSHPFSIFFLTFI XPLAMAVCYVStVLTCXJOTAGCFGAAVI XCVFPTLXVWK 
GRYCXQHKRCKQLVPCQKFALrUCPUXVZNVVS XYHSL 

CPn.O970 1113452 1114648 

cyrP_2-Tyrotina Transport _2 
VYVKSNKVl^LLXAGSAIGAGVUVPVLTA^ 
NTVI«X^PVNMLSIlAESILGHVei^ 
XI5WIRHU:PLCTAJLK»XIMACTICV20YCMtFPKrCLTVA^ 

wss^rrxNAWFrLArcrosiiPTLm»iowc^^ 

VLGAVSUXLSOAJCXGOTAVIJtfJCQAKRSHMYXACQJXV 

AixajcwMqcaprsjmTTxiPLAWvra 

GRYGKQHHPXKOLVX<XOTAIJXJ(FXXXVXNWS XYMEL 

CPH.0971 1114193 111541S 

yccA-Transport piimmi 

EGSMQLYZUU3YI0DSRV0GTTASHV i GNHTAGtXl/TSCVAI^YFSCLYIISLFSflMVMC 
PATMVSFFXNSKIOTXSVSAVOariiYSTlJXa^^ 

ALVPGLAAVYGAFT1CSDLTKI S KIMT FAL IGt^VTLVFAVVSKTVSMPLIYLLXCYLGL 
VIFWTAABAOAIIWrsSTXCIMffLSYia^LHFAIJOIYOI^^ 



CPIU0972 1116377 1115430 

f ciY-Ctll Oivition protain FtiY 

RCIMttLIJTSYLV5FTII£LTUJU^ 

GT£XJE£LCAlU*lrT1ClCAnACTI10XITVL^ 

SCRXTTAAJCLAKYYICWESVMLVATXyrnUM^^ 

ATOIQSAIARGYSmiDTSGIUitVHPIlJMCg^ 

GMMArBQVPVFHDWPLSCLrFTKVPSSAICOgrX^IA^ WW 1U HfiTCT JfTTf MET 



CPa.0973 1116346 1117527 

•tucC-Succtnyl-CoA Synthatata. Beta" 
EQGOCXaFMHLHHQAXDLUSYtrmPPYW 
GRGXHGGVTVAXS&AGXLOAVAJCXJaOirTO 
. IMD«aWCPVl*LS1tAGC»IEXWAHSSPIWLTO^ 
VMHOGVO L I KKLAKCFYDJDVS LLEINPLVLTLESELLVLDSKXTI DONAL YRHPKLEVL 
YDPSOENVRJ>VI^IGLSYXAJ^GNIGCIVI»ACXJUCTO 
ASQICOXGEAVSLVl^ESVlCVl^NIFaGXMDCSWASGtt 
NVEUSKeXVQQSGtPOQFVSSKEEGAJWAVELSM 

CPtu0974 1117523 1118422 

"•ucD-Succinyl-CoA Synth* cast. Alpha' 

YDSVIXAK0ATGCRATMIFV7PPYAAJEA2LEAEEAGXELIVC 
NSTSOLXGPNCPGIIICPGEClCXaiMpeYIHLPGNIGVTO 

XCVGXGCOFLNQTSF X OVLQALEEDPYTCLI LMXGEXGGSAEEEAAAWXOAHCTKPVVAF 
IACVTAPKCKRH5HACA1 1 SGNSGDAXSK IQVLR£SGVTWESPAH IGKTVDAVLRAKEL 

CPn_0975 1119038 1119637 

no robust hoenolog pratant in Ganabank/EMBL at of 11/7/98 
CXEEOVALSIAIKXLKI IlALXl^LVU^WVIRYOUiANnfCSWPTPGFSVNOAYKCS 
EAKIEEMU)U^IJrnXWSSRCUlOOKTTANRIX£X^ 

IXLTHFFKPPKRSRVESVCHEWFPVITOXJtREEEX ICOGPXTRSNXZLMALLDHCTARC 
(HKTLWFS X FFKYLTQXELF 

CPnJ>976 1120079 1121185 

No robust homo log praaanc in Ganabank/EMBL at of 11/7/98 

I LKLVYCFD PSVPTS PEHRLHAALOHWFFLGGHRAR X LTLEGNHYRAFQENXS ISTVEX X 

LKLXSYLLX P WLI AiXIRCFlJ<SRFRCNWKCOSL3DAAVPHWOPFNDF0LFNH0ERl-N 

tWXNRRYVSG IOVLMVPVOYLRS0FPCFKEX PEA1RCENYV50G0FSEESKTSYLRAHLT 

DlVGYXLSLOETYWrWrLKIRA>CITFESPPGKIADPW^ 

LCECf*«VNRLDEALIRTEXPCXEGECITKQFU^ 

RCPSXLNSAVCOVIORKCOEHLLKAI I KEANR RL FCKXNSJ FTMRGNgVLFYT X FS PPKL 
PPAAS3VYF 

CPn_0977 1121329 U22402 

No-roousc nonolog praaanc in Ganabank/EMBL at of U/7/9B 

LY If JO FAN I LK SSFLMEVYSFSPSVRTS FWP^UALDNWFF1IX3UUJCVVSU)SCNS00 

ACEEYVPI3TTEKVLKXLSYU.IPIVI IALLIRYL1«HSNPTAKV3QXPWLKTLQLGIDIK 

-PILPCSllVrtmiOSATLFlCAIRLIXWRVDW 

PEKEIIUCTEYVRHKI^IIVMCYLTWKEIUjO^ 

'JEPQRLLNEN ITTKCSEOKEV^SDI^WAFOCWWPOF ! IQCPTFSEELVHEHSOKLD 
LOT. r/PEOOEFEOKFUm^VLHHCFECXSVASMWt FL ICPXKLALQIPFLRIIOX 

i:ilC«'i7H U22b5< U2H«»s 

»*• r»rf»uat htMiltui t*f«««nc in «:<xn«4\utk/EMfiL oi U/7/'i« 
KY FFHEVYm H I PAVRTS FUI I R VHAALOAWF Fit JlHIR LKW: ' L Of 27NJGWAYQ EL V5 1 STT 
KKVI JXLilYLLVP IV 1 1 ALL r RCLLHSNFR I DVEKERWl JC tRELCI 0 1 ESCKLPS3YVN0 
VlWKlWFKKfllCttHPR rOVCmrrLHSXOWWFP f VFQK t PKTSRFnYWFSQKETR KRDYV 
KWB4JIIV U m.T.M JTIfcWLyY IZKr. rifO::AT::LrPERVI^:LTtJN0ELOCEVORLLNEE 
:5ATF:-*r»OKLVLCj:iIV{JO t CCOCUWPXFLEVigiTAP I PXLVEEVrSnKLNLOFLCLEXAN 

n j/jw*N::t j MWims.'HiVDi kfvhaol 1 1 ytea ioi joipfsr.^ 



her A -00 i*arin^7^lrt>e«*»«•• , ' M CT t h , . 

GWrnPLilSWCAVt1TO3LLAU>C360Xy- < J R£S ftV StW Ofr^lhSgdF^WfrlC 
ATPAWYX E5FPKSOAVTWPSPCRRCPYtNPFnYFfCEFFNRFFCLPSOREKP0Sl(EAVR 
CTSFLVSPOCYXVTMIHVVESTCX IHVTLHOCOKYPAr/ICiCPXTrUVXK XKSQHLPY 
LSFQ«IWL)CVCD^IAtCNPFnU>Ar.TVr7/ISAJCCRNCU*lAI3ro 
fTVWLLMXPWICWTA rVT^SCrTYICTflPA t P5LKANRI I DpLIROWTHCFtCVr^ 

•TJiSMA-.w.sr.'V *"*al v;:av:;——.:;.-mfw«v.; 

TKCIL: XSVEPCSVAASSCIAPGOLXLAVNROICV^J IXXLNRTLKnSMOWXLLMVSOCO 
VIRFXALKPCE 

CPn.0980 1126988 1125504 

*ii*ilarlcy co Saccnarooycaa taravitiat hypocnacical 52.9KD 
p roc a in 

FVMXJO^AKKHAICPYVLXFTSTKXMCLSYCOX IFWCSCICPWLOSKHFDINSAKFLtirAJC 
FXSFPSISAOSDKX^DCIJCAKFLVDI<VNKXFtnrEI^P» 

KLWOTtPVOPAOLSOOWCOPF XUIEOO^YAJ^DNXCQCFYTUCALONYYTSOOIF 
PLfnXWLXCCEXvCSSSLALrtVlJICICKEAUUO^ 

KXSX^EGMCXMH5GVLCCXAYVnmALSCXL5SWP0NS XAX ECFYtaLALPSOSOKPD 

LPl«DTXJttCXaWLCniPOGYEASYSPa»AlJlPTV^ 

YTJCKLVIWPOJCAAHOVXMHLICOOVPSSWSYtXIJ^ 

YSX2LYKESCLRL VM P ATX PXGFU£EAAQTSPX ICCTSYLSOOXKAAIIHFSKDQUOCCF 
LSXCOLLDXLPKXKE 

CPn_0981 1127019 1129952 

Zinc Matalloproxaaat (intulinata easily) 
VTZ*WCACarrYP^IIKSCKDLPTIES^^ 
POTSNQVAIrVLIHMVLOSSX^nriWPF^ 
FYNLl^VYIOAVFHPLLTTCOSFWtAWRYEITCEWir^ 

ALNAAX FPSVTYCVNSCCEOTIVrLSHEXTVRAFHOSGYS INRCLFYTYCHI KPfRXLOF 
L8SSIJWTXX^K0AVSVPL0ntrXEPVIQaLT^^ 

ELLALHVXXX IX^tJTDASPlJtSRUJtSCFCKOTEMSI ENDIRIX PMTLVOOCSPAOOK 
LEAUrASLEEZ XHBZSBttVEBkVHQLMLSSOKMXTGfSUf^^ 

£I£VSIXLTSZ3aCDUQQWRELT£^ 

OCEVUfKECPTNDXVriXTVVUXPPLSGfl&PttUUXVn^ 

Hra2VX7VSYPrSPHAWWSriJPSVSI I «aCAL SS K S1 2^ 

XJJOnqSALTMSVlWS PM5Y AV8MACSCMS XTOAWSYLTTCl»PYVKKIKXXTI0)iFB0WIP 

EAVVIWRLYTKCFSGTOIVISCSAIWYOCLIffiNKrr^^ 

TSRC&HIPAWUPI4ALAPPXGCZAYDHPDAAALWAAeiLXMV^^ 

AAMXA^rKYSYIffiPEXAmiCrFlJQmeZA 

C5IUfiWAFY1UJCS0RIPVtJI0AF1UtS\Wrrxm 

XJWHVLTLDKDFPZVPAI 

CPtt.0982 1131215 1129962 

yigN easily 

NX«SIWQEQLIZgFS»RLAXJSHKLXKZ»OCZ<EAOWYF^ 

LrrrriTHAgTCPJLJCEotsouAVEKn.EHiTH^ ,rt n itt 

AGNLKYCXffDSQTTSAOSAF1UU}XXZIU.POORCLXZl)AJCAPXSD^^ 

IKXWIKTXJCSKSYTOFHOSPEYWUXreESLFKDAI 

XJJVLXJCnAYtfUKOXWKOXQEVSLX^EXmUU^ 

ssroYRVLiroucrsGmsssKOXEEpTPXES 

CPH.0983 1132045 1131206 

paaA-Glycarol -Sarina Photphat ioVUrant t trata 
KNPIXTEQXKlJOIDKAGLniXA^ 

FMR10CLSIiI.IS AKX ADFSDGAIARIMKAESAFOAOFOS LS DAVTFC XAPPLXAIKSLO 
GIYVOfFFSSIXLrrSirrSirCVUlLVRYNLFSOrrvtJVSKPY^ 
LAX^tASOmOX^AOLRVtSi^FAX^J'XGCa^SPWKFPGVKHF^ 
AACX/TSC&VX3KFVEVFFLVSHLYTLVGFPIFSX IYRXKS 

CPn_0984 1132370 1135510 

•nrdA-Ribonuclaotida Rtduccata. Larga Chain* 
GRMfVEVEElC>0nrXVKRICMrsrPFNQORXF0ALEAA 
TKJCVVKXa^XSEGOWTVEAXOOL^ 

IAIIRJltXSGSAXFNPMXXSAALEJCAJ^TI^XIOfrTPATLSEINOLT^ 
EEAIKl^IOOIVEKOLMVACYYDVAJCWILYREARARARA 

OXEDGTTnXRXTOLEXRFSWACKRFPKTTOSOXXACKAFM^YSG IKEOEVTTACIMAA 
RANXEREPOYAFIAAELLT5SLYEETLGCSS00PNLSEXHKKHFXEY IUCEEYRLMPOL 
KDYDLI3ALSEVLOLSRIXX>FSYHGVONLYWYFIlLHEGRRLETAO X FMHRVSMSLALNEG 
EOKNFWArTFYNLLST FRYTPATPTLFNSGMRHSOLSSC Y LSTVKOOLSH XYKVI t OUAL 
UKMACC XCNOWTDVRATOAV I KCTNCKSOGVIPFIKVAfrDTAIAVNQGCICRKGANCVYL 
EMWLOYEOFLEUUCNTCDERRPTHOINTASWIPOIJ^ 

AYGLEFEKLYEEYERKVESGEXRLYKKVIAEVU^KMLSMLYETCHPWTTTXDPSNXRSN 

0OM\AWRCSNLCTEILLNCSE3rPAVCNU^XNLVEHXPi«DKLDEe3aJCCT 

ONVtDUf FYPTPEAKOAt^THRAVCLC'i/HGFODVLYELN I jYASOEAVEFSOECSE X XAY 

YAlLA53lXAKERCTYASYSC3KWDRG*aPLDTIEXXKETRGEHW 

OT IQKYGMRNSQVMAX APTAT X SN X ICVTQS [ EPHYKHLFVKSNL5G EFT I PNTYL XKKL 

KELCLWDAEMLODLX YFCSSLLE X ER I PNHLK KLFLTAFEI EPEWT XECTSRR0XWI0N3 

VSL^YLAEPCX5KKLSN>fYLTAWXKGLKTTYYLR30AATT7EKSFXD(NKRCIOPRiaa)(N 

KSASTS tWERKTTPVrSKEECCESCQ 

CPii>09«5 1135432 lUfi57| 

"nrdB-RlbonucltoBidw Radiicc.ir.i*. <:n.itn' 

ISVNKYrGRKKNNPRLFNnRRLR X UJ ITEKPa^KMEAD I U/ :KLKRVEV.**KKCLVNCNQV 
DVH0LVP 1KYKWAWENYLNOC ANNWLPTEVrM,\PO I ELWK.TDELJEOERRVr LLNLGFTS 
TA EJLVTMN I VLA C FKI I ITNrEAROY LLKOAKEEAVI CTl IT F LY I < '.VJllll •LDECEVPNAYN 

raiAi;i PAKD0f\wi.Tvnvi^>i«fiFrwQ^:Fi:t4 w murm 1 1 MnnrrtmnmiLS 
niRONFHn: t i;Eon»v 1 1 j\ oft iiilni ■» j r oi. i w ; \ keehmta/ttki^ke i valc ekavc 
t.E i eya*3x:lpR': f Uiui:i:*r r dyvpi i i aim< m .kk k;i y p i yh: ^Nmi#t;rrwLNK 

RKNFFETPVTEY(/rA» 

• *Pn.U*«Hl. 1 I U.7 1 * I I IV «-•*. 

yimtl f<r«xJiv'f ««l i KMA M»'t l«y I :« • 

ki .t >w rooi * t f f "Ki MKvm u i « ■ i vi < ivi .yvi mi 7KKi njfiFrrtz «vi >ijKFPt*n rn : I avixc 

:> MX*PWAOAUKUI%tVIM I AV»,4*H 4^/PK IW::KM INIFyCyMI i<i VI > TAfCrFH/YWP 

i^Ki4ji(LVVNmipwiiCMNUkRiitii4 m*::ki\4:i ::ic:lul:^vfa< jmrnrrvi j J'.:t fv\ 

1 4/P) II A PRHFTf "YY rKKmrpr'JI-.^HuwirrF. I FYTKF I WKAt 1 1 
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CPn.»l'>H7 111749 J IHH115 

ytoB-Hka predicted cwa «wtnyUse 

loc t fa rcrrMFAYKTuri nw^vshe I fkttwpcdtvx datcocnds lflarllo 

GECRLVVYDlQKEALSNALUJTTmSEOERSVIEH^ 
WaOfEnTUUnTEISLTfAl^IVRPXLrrVVCYPCHP 
^VSSPWANRCRAPRLrXFQROCSCSSVDKS 



....... • •■ Ii v ;vyi-|;» • . mtn» J^-Im-' »• • 

KFFINLIfU30CILKMK£AAMHFTF^VRR5V>^RYSTFRtOCt > ANYrKAXKTIECAR£ 

KATAYNCPrSGLEFAAOIPCSWGAIFKWGTOESDI^ 
Et^TOSRFHROOOTLSATPOLSiaOQVSAOHSISILC^^ 

TSAQCLIDMCLKGI>IQCA0ISPlJlANriUn , OCATS0EVTC0LIAII0STXJCT0GIB^ 
KEIRItPYQPXXKSPVSDC 

CPn_0989 1139552 1119016 

CT832 hypothetical protein 

LRTSLA VKCVliT IFWIXVKATLS PEXFSGSP IS I SKEF PQOXHRE I ZLQHLYALDKAPS 
AEDSLWU>(SCrAV^0KHVLVALN0TKS imCSOnXLI IGNAUMCSFBSLDLVEICNV 
LRLTLFEHFYSPP INKA I LI AEAIRLVKK FSYSEACPF IQAX LND X PTOSSLNENSLS X 

CPn_0990 113980b 1140440 

infc- Initiation Factor 3 

SVAI^ XMtOlRAl>KVRLIGSAG£O^II^IK^^ X 
KDVGXYR YCLTKKEKDS KXAQHQVRI KEVXLKPN IPEMDF STKIJCDARTFVEXGMCVX I T 
CMFKaiEXAYPQCFKVVQItM5QCLEDI(ffVSACFXXABII5UCW 
KA0OW0 

CPn.0991 1140394 1140612 

rl35-L35 Ribosooal Protein 
HOIOOilWSLKPniRItflCSVSARTXLTASGOUam 



CPn.0992 1140622 1140996 

rl20-L20 Riooaonal Protein 

GKLVMVKATC^ASRjUUUCRIUCQA^ 

LEATV rSLNR»ILSEIAIHKPBOrA£IAN0AJCICA 

CPn.0993 1140975 1142030 

•pheS-Phenylelanyl CRNA Synthetase, Alpha" 
KSFSSHS XgDUSHEHKEEItAVKQQf^ 
UCQCTDXAXLC^INDFXTmDUOCKSLV^^ 
HIUCSILDDNAfDirVHLCFCVREAPNIESEANNmiJjrTETO 
RTHTSNV OAR&KKQO PPIICVVAPCLCrRWn) ISAR3HVLFKQVEAFYVDHNVTFS OLTA 
XL8APYHSF FOKXTLlJlFRHSYFPFVEPCIEVDVSCECaJI^^ 
HPOVLRNCNVlVEZYSGYAVKZEIUAMLKyGVSOIRX^ 

CPn.0994 1142371 1144440 

CT637 hypotheti cal protein 

LFNnOtt3GRigCRSRRNPIOALQ«LSaAU^ 

8AIJaiVlMrIXgISCVSKSHA DKAUC£ BOFLIAfl^^ 

KAYISVlQCNIJlEVre¥DLiiTDEJrri^q«pp^ruq^ 

NDALVQIIYKONKLHEYVNIXaH^TICrLL^ 

UXEAVVKVWMVKALFTSRTCASMVFKSPlOQOillCT 

QUCQTKLLASALSLC I FES KLVTCEASRYLYTNIOTKLOIANCKKPLSPOOVLTDAyEEL 
HRLISKYl'f CPLF lCWimVIJHESRPYOPMILClLPSL^^ 

SS ILYAICNEETLGFLNAKAHRSEVTLVLNIONRI SRJCEKAftSRVX EEALEOSEKAPYVH 
AFSFPEPTFr.fOW FSIICDIFrrAPrFSIMEEFHKPLIASSFTLTICEIJCErVCSPIJCE 
KLTA 2£! FFAWXUTO KLliLHIiSYLIVFKUEimn»NSI^^ 
AGFAresaBAJWPEMSLjgJXTl^ 
LKS7TKXSXEGWEF7GYLHEZ.TEVSHXHNL 

CPn.0995 1115515 1144415 

CT838 hypothetical protein 

RMLIWKRMLLTRFWFALTS LLVLALX FVAS IKRSLHTLKGASTAASGASVKX5 1 LYYLAQ 
X5LXAEFIKPQLVAVATTSTLF AMQNKRE I ILLQASGLSLJCSLJOIPlXI*SGAVrMHVLYA 
U2!i?ES5L XSrTICO,MD * GTl »ETOIPALYLKOQT^YSSIEPKTLTLNNVFWI 
KDPKT I YTMEKLAFTTLSLP IGLNVTOFFAMJSENXXLKEFTDMKEFPEI EFNFYENPFS 
SIJiSE^^ ILSA 
Y^I^SRTPTVTlAYLrPLGTVNIFFmJCAGIVl^^ 
YAYAKLQ 

CPn_0996 1146593 1145519 

CT839 hypothetical protein 

^«H!S2:I F ! YLKTAAFcrLSLIC 151 issweivayi akdwdtvijilkayoipyl 

LPFILFGSCFVSAFSLFRKl£DMWKfFLRASGASQSX IMFPVUIVSCAICCLNFYTCSE 
LASICTYOTCKEIAWiAKTSPAIXUmOKKEKffllFIAVTOCA^ 

I» r^^JH^I?I^?^ LN ^^ SHU>Un ^ ^ IGFLCITLTYACM ILC I HKPR 
FRKSIALYFIFPlLDLIUIVCnCWIWI^LAFKlJ^ 

CPnJlW 1146699 1147664 

meaJ-PP- loop super taoi ly ATPaee 
AYKJf^SOLWtW^LFFASUJraRY 

HHURCT2 AOEAKEl<£El^\RECVPFVLyTLTAEECCCKIS^N0ARXKRY AFLY ESYRQLD 
AtJCIFLAHHANDCAETVUCRU»Ci^HLT^LKAMAER^/VnJVIXU PLLH IPKSSLKEAL 

^rL'^i^. , V?5 CELrCPDCL I WAFLCKWVMKKFFWNAC lAVrJRHFUOMVYDHLSPJ; 
. * . ATUWRNK I V 1 1 K PUVW I D 

1 1471111 u<io*R4 

t»!:ll ATi*-ihjp*tnii«fit 2inc pr»c*ioe 

L I ::i .V /KFK n? UXKMX PE PKKN F nVFFFLLP .WP rr/AFONPt ^KKARVCFiUIQ r EI I 
I tVNLKL t VPfEffc a IK I ALN0NLV:>F(.XiRFRDVQTC | f77vLR Yl IY LEL I DOl'.HRLOLOLOET S 

p ;m/xevni::![«r£A t.^wp tPEOCYAisvr:;e/ry^ vi.TicpLWTrjPAtPOLfNi. 
H:!U»i*YiTu;R:;rtttLirrYfi::DLYeLiCOT 

T?E Arr !? r WVI - :TLKR waswvawjnn ^.rm^RU 

UAOl «t[ M iFJi WWYF»«ELlKilSLEKODPEVP'aiWFAi VAKEWTAFKFM ISUYKA 



POO PRNLVLEJCTTKCOEP 5 P< uFTrLF::L\^n^vr;Rc«Rocc3AMsrcw 

PARKLLKCOWCVTFAOVAC I CWJXL I C rVDFLKNPWKFTS U3CR t PKSVLLZSPPCTG 

iffLiAJCAvrxxAi>nywwDrvi py^gawAj <^ 

RHRCAC XOOGHDgMPrWGtLV LKOCff U ' ffl lU V tOU A I HH HW Lfl PCJ»Ptt»fBMtV 
VMIX^lXCRFEXUft/HAKRtKLOITVDLMAVAJtSTIGASCAOLElU 
TAVTAVDVAEAADKVLYGXERRSLDlMER^ I 
PRCWLWTHF^PEWKl^flTOELYroW^ 

VR5PJVCO«MS PQLCN' < TYPERStCLTCYCCYHEXSYSErTArr:DTn 8HT f HMYORA 

..: : ::MjyA!::ELMr..n::-:rrr: .•>:rv!tE:!irirr.crn::::s vl» ee ^lfkkjjec:. 
s-i-mupp: — .;'".tn.v.r 

CPn.0999 U52B59 1150766 

pnp-Polyribonucleotioe Hue loot idyltrani (erase 

OCTFWFOT X 3 XNLTEOCI LVFTTCK IAROANCAVLVRSCriTWASACAVULOOKVDrL 

PL*VDY0KFS5TOCn^:KRECRPSTCILVSTU.tt^ 

WSYTX»Vt^i>iAICAASAALAISaiPO^ 

VIMTllUILNXEQtCOPTTEEOVI^IEFCaaCKXVTXCXlU^LW 

LPAEVLTAVKECAODKFTELFNI KDKXVH AATAHE X ED4 IXXXLQREDDOCFSSFMIKAA 

CICTXJCSimOUU.IIUSREIIUXXWSLT^V*PrriETSYT^Pm^CL^ 

GSIAMAORYEDLNCECLSKTYLOYTT PPFSV3EVCR ICSPGRREIOdCLAUCALEMALP 

OSATFPYTlRlESNXTESNCSSSMASVOXriALKDAC^ 

Xt^XSGtXDHbSCNPfTIACSSreiTAFQMPI 

MOAl^APXAin^YAPRIE^IKPTXIASVICPCCXQIROXIcrP^ 

XSASSASAIWUUCnimLVCEVEVCinYTCRVTSWAIXyk^^ 

QRIENIStA^CECCIIDVnaaSINEJCCOIJa^HXATXE 

CPn.1000 1153193 1152891 

rslS-SIS Riboeamel Pracein 

SAFAAXZUUWPHSXAJQOTXEErrKXrOLKEICXnCSAXTVQ XA ILTDtlAtLKOOJCMPK 
XXJMSRLiAlAJrf .VOQlUUCLIXYWSTITrCRYia^ XTRIJILRK 

CPn.1001 1153369 1153869 

yfhC'Cytosine deaainaae 
YYIja^XEXLXJO<EXDIFFM30AJTay<RJ^ 
DATAKAEXLCZGSAAOCLIXMRIIinVLYCT^ 
AaCSWVNIFTEEHPFH7VSCTOCVC5EEAEKX><XXrrVEJCRR£KSEX 

CPH.1002 1153844 1154089 

CT845 hypothetical protein 

KSAEfUCVKOTIVItJJOLYEXXJESPiC^EEIVPNLTPro 
EGVLSGIGEVRAAILAAi*£QSN 

CPn.1003 1154862 1154092 

CT846 hypothetical protein 

TSMCTIHPIXWPOTOIA<a5ASMRVIFPOIO«NFW<LSlClZ^ 
IXMXmXPGUII^XJVXCIOKHMrMQFt,TYPLITAX)Sl^ 
XUOrrTfYlUaOHLZRXX^F5VLWX500ALIIGAVUC7>^ 
. XTVQiri JPE lgFTO PTt^^IICWCfXFVIXyY^ ft^i^ni^n- 
FOOCEXIPNPYTTSXJtr 

CPH.1004 1155418 1154679 

CT847 hypothetical protein 

HLSZEZI^XOPVStnTnUDKVXPOSTKVXro^ 

GX8ZtAVLEX>rriVQ00RVX£LZiaJ»LLlCVPDX^iaEIXSro 

ANIOtXQOEUSAOORAOANOUVNSTriESMOIX^ATSSKLSTLI^ 

CPn.1005 1155957 1155415 

CT84B hypothetical protein 

NRXPVRLNMWI IDPLSAKKPtAAAXNVPCTP ITOGPNTATADDI XAXTSKSSNPUVTVY 
YVYOSVLVAODNLS 1 1 AQELCAN SS AQTYLNNO EALYOYVS I PKNKLNDKSS1YL0KIQ8 
OIOAlGA5ROAXONOXSSLSNAAOVISSNL»rrNNNX IOOSLOVGOALIOTFSOrvSLIAW 
I 

CPn.1006 1156493 1155990 

CT849 hypothetical protein 

TKVNFF IMS ITTLGTLPTVNTINSSRPPLEPLNTPK IGAVXFS X YELLLOAIEIR0OTVL 
TOSQQLfIDNTNIOOQLWETN0IRYAWSACAICEZ)EITRVON0N0NYSAC^ 
TRQN30 X I LSHASTNIN I IQQQSSQDSSF X KTTKS XGSTVNOI20CPLC 

CPn.1007 1156689 1156907 

CT849.1 hypothetical protein 

LWYKSIACEEKDVSCNECNDYPEVF^ IQVOfTYBCDPAVIS 
YU.TKAROSLDES 

CPn.1008 1156904 US6223 

CT84850 hypothec icjI protein 

VU<YSF XCWLKPHYVLSKR LYIWVTOLI KUIDL^ 
!^nWSLVPFU.LFSFLAFPLILCFRGKCYALU^W/rt.WAJCYV\«m 
TVSFUAFC LFLQGVWLAO EEEMVXGKEQIJILSEOLCJAORSAYEDUXTKSOEKEPUAR 
AOGLORELT ECOELLKAAYOKOEYLT IOLKX LAD0WI3WLCDYAELHNKY IELVSKW0OV 
VFPWAEPSVCESOCSERVOVSRWSAiODCEESLERUWEILVIJCORCSOTIMRCOEL^ 
LLLOMFTALERRCEELONUM3KCTO INELHQ LVCKSEEXVSVEPSAHAETSCVEEXQYX 
GLYCOWEOFLEKSETL^LVTiKKLFAVOEKYLTLKICKEELTKODISFDDISMIOCLLERI 
EILEEEYSHLEELVSRSL3L 

CPn_100'» 1159085 1156 186 

rndp-Monhioftinn Atnamipepc idase 

YRLLHR7 1 LHKRNDPO«CC^RKWKOCM YPOP PKHS PEALKQH YASOYN t UJCTPE0RAK 
t YNACO tTAn I LOELCKAdCKCVrrNELDELSOELHKir/OA I AAPFHYGSPPFPTTICTS 

f jiev tar. i nm i r-LKron ikn xovnc ivdcyycdccpm'/k igevpeikkk xcoaaucl 
ttur.i ai isriztnrzuirjitKABAtjro'sxwvQnjnirrfiu EniENPYvp«YR>msmp 

I AI«M I l-T I El 74 1 NW;KKE% WOPKMLWEAftTCOf (OP.-JAOWFJrr I A rTETCYEtLTLLMO 

'.■Hi.Mll'j Il*-H.7 r , If r «'MM.7 

'T»f*,;: »«yi»iti.<.f i.-.il |Mi*ti.|ii 

vm.i I jitijii .i yrj\ v. t i-vkvalucnp^rffv^pv n.Rft *i i lfvtfor 

::PPrjKIX> I : ri.VAK' I f. V ;p|.| t KhWtWI ^PEFAFDCrmifTRP I FFPl AFPV ITRPA 
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f T«M Hypothetical prv.^in 

CTWLKNYm X0F3FFLPCTC I lllaiog ltn I lau . . . ^nny-jvicqamlvllaes TThF I 

AMFAX/rraJO»JCLAVUfTPVCA I CWGC I AVTLACVRA V LRLiiKEESW ! PYKFNMSP5YS 

pcrnpiALPLMrrpsc 

CPn^lOU 1152220 1160421 

•/2»r0-A0C transporter perae«s* 

AIF.1L rTTJFHKFKF I FYFV WrrjLLrUrtKTSRHPPTf rFfrpPP.'W t ACTTLCS LPU*L 

. Afflmju;: : :w-»:t:- ;v;j , ,t:«u--.*k.'A/'. :« * ^;*:*ALAi:.:vL>" 
.w: w;:v^"M.Trw<.rr.rrv , .':::."?rf!R:.::v:-".". ■ :*t*tvj :>:;j»rHiALP 

H I FJGLKI A IC5ACFAA t ACXWVASOSCLC ILKLXSR RKYIMELAf ACLATLS ILTLSLF 
0ITLLlOaiFSl^RVKIWSLKHKSVW«Al^VLALIPtKLIP*^^ 
TUXDWTPWPNH I PLYAGVARGYPKOHGXWLCWrOSSSAVPHV^^ 
IMCTSIIgyPIQrVCRLIDSSLOCrLYllgDPmPCOaCVajCPCW 
LNRNGWPSEVIOWSSDUSWttANXXDrtYCAIWII^ 

TG POLZ vriTOCGTKASEPEIVEAFQKALQES I IFSKOHPEX1AFKLYAICETKS X PKNLYQE 
YLQWErrFVUAOSOOPLSKOLVOiaiXrZIKKmM 

CPfl_l013 1162209 1163624 

fuflC-Fuaarate Kydrsteit 
RJHSI/«IlG!rtDNROEraSI/JIVEVPEDtt 

KKCAAQANQDLG FLDSlOCDMWAAAOEILIOCPEEHFPtilCVWQTGSffrOSWflWIWA 
NXJUIUffiGGVraKDPXKPNmVNKSOSSNDVmAm^ 

rjwcvEznwovxiGJmuTfflAv^^ 

VCTGLNVPEGFVEXI XHYXJUGTrilEPFXPASftCTSAl^KX^VIlAHGSlATlACALTKX 
ATDL5FLCSGPRCGLC ELFFPENEPGSS IMPQKVKPTCCCALOHVCAQVUaiNQTVZ XGG ' 
SRGKFTUJVKXmrYl^SViaXSEGMW 
UtPVUHDKCSKAJUJCAFKCSXSUSACL^^ 

CPiulOU 1165456 1163732 

ychM-Sulfate Transporter 
AIASTURCXVKVPHAyiOTXPKLmXKaSYSFOT 
GVCVSPICGLIA«IGCLLASAI4KSNVUSGPSSAm 

CVFU AJt^TCXCnTIICYMPYPVVTCLTrotAI I IPSS0XKDFLCLQW3ANI PADFLPKW 

IAYWZWimroSKSPAVCSLrrLLlMItFI W tRP R yPCViaAW 

SWOTLmXPLPlCXPO«miL0X*Z'nALT» 

GtVWaWIGTSIJSGIPVTGSl^HTAASIKSEA^ 

KZ PLTCXAAVLXLXAWNKSeXHKFXKLrrAPnmrvVlXTWXLTVK^ 

AA«JT«OHSDI^D\aSTAKmKDSraT^KAJVPQ^ 

DlPCPPlCIFIlJOffTWPTIDASAKHAIJZFTLim^ 

EXXGVDHI FSNIKSAIXFAOALTNLESKTSTRKLV 

CPO.1015 1165550 1166693 

CT8S7 hypothetical protein (possible IM protein) 

KNMYKNFSFFTSVRVRSKVDHEI XLEVT}OJG£LCA£JLFGYlAXVFEHrVRVNXSAXAL 

AHO3>ft^VCFSHIF«ADHWLVriXM»tS0VIpTt^ 

XqSimiXWUCLSrTLSAAXJWLTSIX IIISXXJgtt.VKAHFTWT.T.Tr.AXCVIAVHAOC 

AWTPLOIWmHLWINNXITSWIIWttM 

LQSAFPKSUfI XPIOL08LlWFW«ACU^PfW3J«IfiUa.V^ 

KLWPHXLTXIOISSITFriGILIAVNALSFANXiTDFS^^ 

VL0NVPLVAATM9<ntJLJU7rX>nCLXAYAAGTOCS XLX IGSAAOVAFMCLElCVZfflWVT 
XRXSWAIASYTOSLrSYFVLESLNFPX 

CPIU1016 1167027 1168896 

CT858 hypothec ical protein 

KHEVTMXlQGiajGAlvrGI XrTSSVA G* SJUX l'AlJMAYQDI < NVlEHLXStJCYAPLWKELX. 
rCWDLSOCTQQARLO L VL £iW^TVYCOKVLSWYVRSLKUYHAC ITTYRTESAY I PYVLK 
LSEDGMVFWIA'CTSGCOIYLGreXIXVtt^ 
SAAFBXttVPSGIAMXJaJtRPSGURSTPVRIW^ 
Vl^RSCVNS05SS5St^SSYMVPYFWEELRVQNXQRFOSNm 

0KCTnfRSYIFIWa)S0GWPHRICFLRZS5YVV^r.Ff3 JTTHKDSPWELFGEI ZOKLPCE 
TDALI IIXTrHNPOCffmXYSLI^MLTOHPLOTPIWRMI FTQDEVS SALKMQOLlXXAfFT 
DEC*VAVZ£ETMEGtfOW«AVASLONrSOS^ 

HOYTKPLfMLIOg^SCCOlAPAILKDWCRATLrCKPTAGAiA ^FVF OVr F PNRSQ IXOL 
SLTCSLAVRKDGETXENLCVAPH ZOLCFTSRDIjOTSRFTDYVEAVKT ZVLTSLSENAJOCS 
EEQTSPOETPEVIRVSYPTTTSAS 

CPn_1017 1168997 1169935 

lytB-Metalloproteeie 

VI XKRKLXI£NPRGFCSGVVRAXaWEVALZnGAPZYVKHEI 

'/EELVI^BCERVIYSAHCIPPSVRAEAiCARXLIDIDATCCLVrKVHSAAKLYASKGYKI 
LLZGHKKHVEVIGXVGEVPEHITWiaCVADVEA^ 

ALLKRYPSZ ZTLPSSS ICYATTWIOKALRSVLSRVWYVYWCDVN S SNSNRLREVALRRG 
VPADL ZWNPEDZ DTNZVNHSCO Z AHTACASTPEDWOAC ZRKLSSLI PC LOVEND I FAVE 
OWFOLPKELRCS 

CPn_l0l8 1169895 117062? 

No robust homoloa present tn GeneDenk/EMBt as ot 11/7/96 
RMSYFNYOKNSWUlCLCa^JUCFFSRIAYRW^ 
VYSUUCPYAAXGIU^ASIAKEQflnfGETPW^^ 
CITpiTSHVVRCCVZGZONOPHFZRF^SNMHWCL^ 

SFSRRLLNEZ ttKUJE^PCSWISZSFPLDSFSRGXECFFTEKSCSVRFPVCKTZAYKN 
CRKC5 

f.'Fn^lon 1172146 1170636 

>T*f> hyporhot ical protein 

I H RRN I NTVOYQC t JTPPPGGEFOX FVDGNATEEAV7AAEVOVALPAGECYAMLRATSEL 
''PCI LTCSECAL1UALPPKEKPLOEEQFLVTNG I LHP3T3LPNLK PGQSCQT3LASH RNP 
LAOO:7T^r^5VTIWLTirrPSSSFPFFSCKAPECWC , /DirrFW 

: OAtiPl IVROT^'ST I K D LZAK £ KV^O bTtCTAETO K KTOTKS D ATL^ PMJ L YCTLMK EV PO 
AU2 rrrJJA'WiF^IKDL^ECYECEQK I 
f PDPIW.PAU:E; VliT/lALJKRVTHLOVLP ICTELHKl JCJCRANOTKTRLEERELMERE 
AIIEf^AJW.'RUAKYAkWU: I ATATLC IUW f ArMVHE Xl/GOt J Ui iFVOK I STIP FKDATAK 
TPFWII' JKVFr^UVLTKAAJKVIIELSE; lAVRAVAEYRKKVFWWJIlEVTRT I EEVKfJNW 

k::m*ifi j * r urr wi wvai<:3 a\ 

• 'tl|_|A;»ll ||/tt.'t.| l{7l!|Si» 

iTnm tiyt'ir i«4t it: il iuiWcih 

« »»t.TH:aWlJ ^A:-4-.VI4M.<IJI»YI ITIAC-K W.;.TfKf 1 TVAI VKAO^n UPKPFTOX 
PWV 't?i Sfl *p f Tl 111 KTP M ATPREW f I <RP> 5I.1KE; Vji J tf/ft JgAUf * £ :rVMLF3QKNSTEA 

: :kaj jej*-a.TMri<::i f y.TrMOMawNi^: w s ^rpr/rr Jii^n jclki ^/mlysosl 



DSE00ELLC3RPEEH3ETYA JE1«XETICVCZKDUncCLF5QDCOSW0F OR<S PPOC 

arS^KAAOAVPViPPP^ -^^^iH° r ^SSS^2^ 

LHEWIETIKV3IWDmE«M»«m^ 

LI SL^I 'TT *X ZIXrwte<lXKHtPrPtEtt>RXItTT ZQWVVYL! PV mi LlLAVrPC^ f 
SP X ZEGAX KG 10 PA Z tSTHAALRCAJ tTSOAC Z YXLNCKLTK ZOLO I EUCSTDMCKTEA 
*CIXWESSfTAl^ZlJIWEXJ0VYLH5LRC 



cpn.in:i 



U74270 U7J69H 



.••t.-"rt.— T'U>: "^ t^.vurti : :*"!.v*.i.*s:^:^TF'*' !*:x:ri"^ , 7Y"/r~rK: *ir 

TYOK ZFKi^EXLOr.-f KtAT/MAVLDKDYAKJ ITVFHWLVFKNPFVSKFWFSiCAiUOti 
EQYSOALMA YC"*T AVLADKDPY PMYY AY I CYTLTWEH E E AXXALEHAWRAOKKPLYMEl. 
KESXLDXRKMK 

CPIU1022 1175709 1174216 

CT863 hypothetical protein 

FS FFFY ALICL0 X PVTSAVP SANITI-KEnS STVST ASG 1 LXTATGTVLVSCTALXGSS 
STOALISLAU30 1 XtJ^TCOEXXL0STNW0IXflPPEV\^ZXXGV\^XLVCtAU^ 
EPOeTOTOSR5SZ7nJOOSSSKOSAUPflSWEXSX3SXCOG^GT 
PITGAJl^LSQASSSSQKSLPPOESAPE XTXXiaQK ASS FS PUQ FSAE3CQKIALTTCCS 
HEI.Y1CEAlX30RQOREOKDRXKI?QEIOAESKIOqnrirRG7 flVEAVAgfCEKLDXAAL XFjP 
OKRPPAETTSiaETrFTOaU^PHSVFSRFIPSJWLSVCSS^ 

CTitARJ[flOAEAI5AJ^V»WK^IUUUlJfLTVt.ISlCtli^^ TM RA 

IXXCVTZOXEXYTWnEE3»LLICIMVCKIUCIMH^ 

VUaUJCEXJCrrXYNLAF 

CPa.1023 1176008 1176331 

No robust hoaolog present in Genebank/EKBL as of 11/7/98 
CZJgLKPTMOCVVTXJXirTOrCA^ 

CPh.1024 1177317 1176334 

xaro- Znt aorai e / reconbinase 

IFITWFSirSIJtIAPLPIlJaJlSLASMTM>STOF 

YP^XSSfXTZSAXSSPQDXSQMSVYIFAXtLYWIUCEAJCTTLJUU^XAL^ 

LLJYTOIXEKPKIMOttJ'SVtTTOEVDAXXAVTW 

ww/^tjr^wvgnngrsvTcittfflrrRLVPLGSRARiA 

FLSTRGWGi£RSCVWRJlIHKYAIC0VTSXPV8PHS UWAFATKIiONXADLAVIOXKLGKA 
RIASTEVYTHVAADSLIEXTLAKHPRNL 

CPh.1025 1177266 1178879 

P0i^Stucose-6-P xsofiarese 

GAacy5 & r RE All ttW a^OCtOTCXZX)EXj(LIIPLZ&TAPGVX^ 
SFATEMin3AlLAAtXSXAElKGXJIESMIJIK3G^^ 
OSSTPGEAEDIAVRSRVEAORXJaJFLTICVltSOmiVOIGIG^ 
TDIOfWISMXPP OWG^vy XDCAJWLVV^ 
RXMf^VTCECSFHZJDTGlCfXjEVFHLMESXOQRFSS^ 
AjAjTOIAMPNARENLPKLSA^SjWM^^ 

NGKSIA 0DGRRVGF6TSP ~ 
FQOITSSQKliFAl 

EMKIVrgaOCXNSTOECVSIfflUUJWRVT,, l n r BOADASHFPCAA&tJiTXJMECTK 
CPn.1026 1178961 1179137 

ItuA 

CSRSFGiaCEX)RJ<FTZAVRSRGIXOIHCXLAAIUQGKC^^ 

CPIU1027 1X79172 1180755 

No robust homo log present in Genebenk/EMBL as of 11/7/98 

NMPOSVSSPPLSIVIVIOTVPSSSGSOUOPKAVLICXSXLXFA^ 

<uVPSLVLTV8XIAIA>ICLXClflIfcVTItt.rLSTriU^ BttMMf 

ENRtXRSRNIUkVEinCAHXJEllOaXiaSPEYLHGKrE^ 

NASLSRXFIUYKWG'PLGAIiPWICEDXACXMEO^^ 

CFOSLVNATAPRSRFTQTPKYTOSJUaNEDCKVAAVCAA^^ 

XCEWWALKETLPLI^VYtm.VOEmiLLTAULHXIWCrrSY 

VOL|TEELC£ja-PTICSPEOGALVIu^SY«WI 

LU<SOXEVlATRYIJCr7rFVR>ISiWTCSFEI4MrSYl^^ 

PPSPLSEXCECEETl^PCSElIVSVZ^POIXVtJSMMVWPin;^^ 

CPn.1028 1160995 1181999 

ndhC-Malace Oehyrogenase 

FFUCCVRKAFKEVVRVAVTGCKGOIAYNFLFAIJU«DVTGVDRCV^ 

GVRMZ2J30GAYPLU{RUlVTTStJOAFOGZDAAFLXGAVPROPO<£^ 

OCAAI/nAAKROAXXFVVGNPVNTIOfXAXMAPRi.^^ 

EVPLEEVSRWnOMSAXOVPOFTOARISCKpAAEVICDRD^ENIUVHS^^ 

EARCXSSAASASIUU^EAAASZFCPKSOEWSSCVCSDHNPYGZPEXXJFGF 

OYEZZPCLPWEPFZRNXZQZSLOEZAOEXASVSSL 

CPn.1027 1181987 1182844 

No robust homo log present In Genebenk/EMBL as of 11 -7/78 
RVFVISTMX>CVSMROSFDELS0NAFKf I Z PNXQAFCF Z FCSLCCFGFVFALFLXLCSRXjA 
PEZ5LSTLGLCAFFCAFSVZCA5AZ IVOFIXHKESOCETSKLCCAZ KNTVSSLNLStiVS 
HPFFIAMVAVWAMI^FLCSLPWVCKLFHTVLI F Z PYLSATALZ LLFLCSFSCLFFCI 
PVU<N0E3IOYRKLL£CFRCNILROFtGWIALVPt>I^SWl>LM 
TWSFLAQMFVLIVPIALILTPAVSFFFNFSFSFYLAKOEEEXALVK 

CPn_1010 U83901 1192843 

predicted O-amino acid danyrogenase 

FKVHFNR Z A VUiAC Y ACL5VTWHLLLK3CCT AT I OL FOP 1 PLCECASCKSAILLHAFTCK 
KALKPrLAOO*. INATHAL ITEASXALNVP I V t -W (LRPAI DEDCvNOLFTERVEEFPKEV 
WWEXAffCK 1 5 [ PflMV [ PPNLGALF tK.7WTU»ICLY ICOLAOAt'MKUTrw^FYDEt.ZEDL 

ADiEEr/oir fvm^\j^;r[.pELAt»ir^KVF^LEir>iPKOLAML:{rj inamkywa 

NryfCKP; U * «\TFEIINyPEETPDPA I AYOE [MPP'/UXFPCLKrM^VUK'YAiWUlSiaiS 
HI 4V (SP. I K FJCfJWIl a ULCKXGLLYriC I TTTTMLAQAVLN K.',TA Y I AKKFLFT I 

••rii.i'Mi hh'.w iiRiin'#<{ 

.irt:D*Ar>f tin iMt'iintir inn* a/i(. imh ' **t 

t k »-:rFKT:,'RTK:;r4(Nii n" i aix Mwr.r. r in vi t f^lcvimaatai wV i umi Lix.Tr; 
mkfiaiitfr i RroLKwi [YNrcfcGM* n"f i';KTifa#r/wz*viM wv.nrAviTMOA 

I *IYFFM-YFX> > :NTI J^A I U X t: f LIWVFTIF I VIJ«; I RUA!« II MV liJT I PK I r PT.f t F II L 

takffxlavfktdfv* :iiAVTKAyp:;u;:r;::: vxfrrHts/TiMM' n : I w:aw>o;kaicW' 

JWAT/U IFUkTLT I Y [ UttMJf M W4 1 ttlP.TTAf.VU) l lM*4vWifcVUt<V 
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XIlAVUi.'.'WU.VrtrMEtPFSAAKNfTTFPEIf . .OC3PSV3LYrT33VM0l>»ttX 
VYFGGJJAWrfTKLo rTGVMVLPAYLAJAAPLPKLGK^V fYPKXCSlKAPLAHrTGILGWY 
:ILWLI YALCLXYLWALVLLALCIPrY I0*CKKKKNAICTTrAKItErVC3KTf ICUALTAI 
FLPLTRRtKl 

■tPn_l0J2 1186153 1195566 

CTJ7J nypotncticai protein ri 

• v» - •.T.>nr»v;s.r.v , !i* ywu-Ui ww: at sii»3W» ;sma 

ALGPLNFENAEPAKVN 

CPn.1033 1187656 1186187 

CT372 hypothetical protein 
NM0ODr»irtTrOTVDSXJtfLPS£DffCYX!ra 

LSGLCALS ISSYAETPRCTtOfYliRYKARIOEKHPES tKESAPSTTPHHNSLLSPVTKI F 
COTPWKDOISVSI^TSVEICATVTOISISrSXW 

snwrLYOSPTATOrvorsvTUHwerw^mwOA^ 

OTFWartTIAIOqYSLYArtSTLYCWDQYSOri SYALSOMASATySLSSTSAYLQFTPW 
SEXKVQLGfODSYN ZOGTHFS I YftTKSKYNFYCYASWrPRPSCQDGQYSVLLYSTWVP 
EQNSQVTCWSLWAAQH IHEXLYLFGRINGATGTALP INRSYVtCLVSlMPIinWSODLLC 
tCFATNKVNAMISWMCLRRyESVMlArATXC 
VYGLRAWLSL 

CPn.1034 1188589 1187732 

Predicted OKP (CT37D lludir U8J peptide) 

KTSWOKYXKYWSILVQKlARYVMinVLF^ 
YDCEMFW.XI£LD0TXX0OCEAX^X>flC5KAX0GUKOG^ 
GTVOPIESAIFliXZKIOICOCaCTTTVYTERPirrAKDLTXJB^^ 
PKl^rrSCIIJ'SCDYMKSPCLOLrtEICTPLPAKI IYIDNQKIWLWGBLCOXYCIAY 
FGrrYKAQELHPPrm«IAOVOmSXKXXSy F AAA f . T .r. HH Q KH g 

CPn_103S 1190081 1188570 

aroE-Shikinate 5-Oehyroaenase 
VVOLPU4VPr/KLQIWPTSHmCVSVHLC^^ 

LDmJSDOEUnxmA<>a»aTFTOHKEMSTAl>nOKLYSlJgaXPWKDI 
. LQTIMSHPKIKLILSYHTDiarEDLDAIYNDIlATPAEIY^ 
UOWSTVl£WGTlCLPSRVl5PLISNN*^^ 
KSHXYSUCOPVDIUXSKLSHNFtXSIO^XilATYXXFIVrtGSVVOT 
VTMPUCTAI rTJHVPALDASAQLCES IWTLVFWCXXIilY W TD GB SVAKLIJOQXKISVWK 
HIAI\AWOGAAJCAIAATtA«GJttI^ 

: iNCLPPEVTrPWRFPPIVMDIMnCPHPSPYLSRAOJWGSLI TWrWIEOAtLOFALW 
FPOPLTPESC08FBNYVXNPMMCV 

CPiV.1036 1191180 1189984 

aroB-Dehy roqu inace Synthase 
GTOFCSCRSCIIPTMLCTOKSETirrTPHWiajSlI^^ 

. CnVliMTGFLMTYa^PLYLXPTriTAMVlTrSXOCaOCXlU^ 
MCPQFlJTLPRriWYHCIAEAllO^IArUYlJWErLNSHSJW^ 
KAAXVAEBPYBASUWtUirCHSIAHMETLMCG^^ 

TPOUDOLEWiJCRFNLPSTUCr^SIVPEHIJfliSLYSPEKI XTTUSYDCTWr.SOKFTJM 



RFFLtUCVLWPUafTLYVl 



CPn_1037 1192286 1191123 

aroC-Chorismate Synthase 

LHFSRSSIUlSFlfEXXRTSVSRSHYLVKVMlWSFCSLFSFTIW^ 
GLEX*ESDrVPAM«*RPGNPGTSSJ«iNDrV0^ 



AYI^SLCSLTLPHYLKXSPELIHKIHTSPrYSPLPWEKIQEILTSLHDOSDSLOCVISFI 
TSPXHDITfiEPUaCVHALLASAIJiSir 
TLKSNNCOCTLCCITICVPIECRIAFKPTSSIKRPCAT 
AXRAVPVVeAMXMLVLAOLVLYORCSKL 

CPn_l038 1192750 1192199 

aroL-Shikinate Kinase IX 
WKLEXJWVHTIIU^FT9GKSSl«AX^ 
AYCDQKFSECEARXLeTLPPEOALXSLCSaenUfYEASYIUXOTK^ 



LUG 

CPn_1039 1194011 1192885 

<jroA-Pho«phoshikimate vinyltrans Cerate 

yCFTMLTYKVSPSSV\'GNAFIPSSKSHTU^ILWASVA£GKSrrY^DSPOTEAI<ICAC 
KOMGASIKKFFOlUlVCtlPUIFPKmiDAGN^ 

LOR R PMAP LLOALRNFGAS FH FSS DK SVL PFTHSG P LRS AYSDVEGSDSOFAS ALAVAC2 
LAEGPCSF? 1 1 EPKOlPWFOLSL*WL£KU<I^SCSDTTYSFPGSSHPOGrSYKVTCOFS 
£AAPIAAAALLJKinjOPlRUWUtU>XOCDKlFFSU«NLCASXOYIMEEILVFPSSFS 
tyiSIOMDCC I DAL P I LTVLCCF ADSPSHUYNARSAXDX ESDR ILA ITEELOKHCAC XQPT 
HOCLLVNPsTLYOAVLDSHDCHR IAMALT lAALYASGDSRIHffTACVRKTFPNTVOTLNI 
M EAR tEECHDNYiMWSTHKRKVF AR£S FG 

':PH.1040 1 194876 11*4071 

No robu.it homo log present in Oenebank/EKSL as of 11/7/98 
RPr^SLFl^TVCPSSSFRDnVCAAPLLYPRRRSPDYLFSPTCCPMSTTTVKMFIKTASR 
WR PVLKE IV AJNYWH AQW I frrLSFLENSCAXK X SASEHPTEVKEEVLXH AAEEFRHGHY L 
KTQ tSH ISETSLPDYTSXNIXCCU.TKYYLHLUXPTCT I LV 

TV A I ELRAJ ELY PLYH0I LKEAQSK ITVKS X I LEEOGHLOEMERELKOLPHCEELLCYAC 
vFECEUTIjOFVERLEOM I FDPSSTFTKF 

•:f1i_|tl4l irttJlH II-M72* 

•liinA -Atfatitwy |th«r uinitifiii -m - AmnoW-Ox'^nofMnoat* 

«VkiihA c.MMI<ir*i::« a 

1 4-HR I K f 1141 1 KFT.i '.W VFFL\»HFU)F:;EP5 ISHKWJ3L ILFHCft:»ALTF.POYLnYODK 

k i vi- ;» lAavAEarrnYLnAi 55m:NUMiRiirY itxklceoaoklehv 

I FANFT1 lEr.M .V: <K MP t tLT'ECiLEAFFPGDNTC: TTH t E I AMK f AVQYYYMQtJKAKSHFV 

' ;t ^.nayi* ;i vn* m : i h tt::i ttvpfiidl«.p:j.';t i aapytt ikkela r aoaktvfsesn i 

AAF I YK1 1 4 o -V * SKI JIYNPB ;UCF. I LKLAKHYi /; t AUK 1 1 .T :F» artCPLFASEPTD I 

Mil h*i;:k. u.i*v:vMiAr:ivmeiinMFvr.CKv^ 

1 1 .n *i *vpvm i ki« irjefvkmp u;i iWRVi/nvi AU/YrAiwjYFSQYRDHLH 



CPn.1042 Or*Tp«IIOD5914C 
•btoD-oethtobiottn iyntMtasY 
NRSPrTYTiuwrnwxiivcxincvarrxv^xx^i^ 

HEI^YCHPEAYW^Pt^PHJCAAOICNVSri^ICAPKTrS^ 

KRl«IX^SWSCSWTLVWVt^XNKTClTW«S«^ 

0ETKU»X XCTtAKEKEtTRTI ISCYAECMtlWTSKHOC tOWSnTKLMW 

oiof^J-Oxononanoata 5ynth*se.J 

LCAraJUXTCKSOLCORIEEOUUYKWESCLXFt^^ 
YIKASTrTCIW^KAOST^FWNDLNKlJatWJG 
SD£IRTSAYtIVDEAKAVCVnn»^rXVSAL^ 
SXXJam^NPOlPrxmAOPPKALTAXXlAYEKNORA^IORENX^A^ 

LOLMwrrrPios icvsoshraroaalo iokscywrp rvs PTvxouELXJtxcLiuni 

TIOmDKXXKTLEOXrLCNVSSL 

CPIU1044 1198700 1197899 

•biol-ilotin Synthase 

AKHKRJETVSWSLEOIRXIWTPVFELIHKANAXIJGKrUiSE 
CAYOOSSRYHTHVTPEWiqCIVPWPttlttAVE^TR 

HVXS ITDIi3AEVCCAXia i fl-5EZ0AKKLYtlACLY AYNHNLOS 3 P EFYCTI XTTRSYCD8XM 
TXOVVNXSGISTXCffiZVQffSESElSRIXXXHVl^ 
PPISTVEVtJn-rATARVVFPRSMVRIJUCRAFLTATO^ 
ND XOEDAEM I KLLCLI PRPSFG IERQIPCYANNS 

CPn.1045 1199802 1198901 

•conserved hypothetical bacterial nembrane protein 

GTLIWrSHJim.VFSYI^STrriXLVlJNLVU 

SDWHEIFOPICXARVMrFSAF lAKlXASSIVOIFKFTWASPEWTAWCtJOtJPLWX 
ASUJkTrVSOOnjrVLYTFTWRTPNSSUiLfl^STWrM 

FPOTWiMrmTYicrrrcvLTTPiJYXAw^^ 

CPn.1046 1200675 1199590 

•Tryptophan Kyroxylase 

VHYCERTLDPKY IUUAlJCLKJSt^FONSOSLORAYSTPYSYYRI IL0KZNKEX0AIA 

RKjtc is nzrrroLiJVHLLSi^i040Rixx:sTD^ 

YCPRmi^LEAFClXSDFlJKOAVIKPTEIXrKFSYYPV^ 

AsviomjOTirstTPDUKiaxcwvwLLHPsrsEr^ 

RIQTWSMOAIVICIWFTVESCLIENHECRXAYCAVLISSPOEiC^ 

I»IIIu^PFlJTSTPOEWSIRKrDEX,mTSXLEl«tti30CX^ 

CO 

CPn_1047 1200537 1201343 

dapR-Dihydrodipicolinate Reductase 
rx»Rie4css?owcviGCsamncvmAi4^s 

DFSHPLLTXEVVAHU-ISPKPL I IGTT^TGXCKEAHDSIXIXTHIVPVVVCPNAILOAY 

XJOCUVKLLSObCNranXlUReiTlKRYIClSSX^ACO 

RX»5ncrXEVQSSRVCOXP<BCEWXSSSIQILVRKTVnR>^ 

PQpctysifitmJLVLRHEirajxrroH 

CPn_1048 1201588 1202604 

asd-Aapartate Oehydrooenase 

DOTUCOWIAVLOVICLVCOXFVAIXKICWYR^^ 
PMPEtlVIUTIJXWtXEEVOSDIVVS^PSSAESMX^YCl^CqCVVPS ^ 
IPEVKSSHFOLLEZOPYICXZITSPICCVSCITLALAPLRXrSLXX^ 
PWSLOLXJ^nVWIVCEEEXIUlETVKIU^STO 

VTFSKDVDLOEI LY5YQEKMC EFTTffTrOLYIBfPWS POARIOXUKSDMRVNLOPlT^rCXZOF 

CPn.1049 1202586 1203914 

lysC-Aspartokinaae XXX 

EG3JVSXIVYXFCCTSIATA£NICLVCDIIC3CDKPSFVVV^ 
CEVLIUlXECXMEEXVKNXAIPFPVSTWrSRLLPYI^LEXSDWARXUUSEDX 
RAVCSTRCWDI£FLIARSVILTDDSYTUIASP^ XCCFXCSNC 
LCETVU^ROCSDYS ATLI A£1>RATEVRI YTOVM3 1 YTKDPKVI SOAQRI PELSFEIMO 
rnJGFCAKVLYPPMLFPCHRACIPIFVTSTFCPEXOTrVWAV^^ 
Y0SrCSVOYT\njCCCCI£EILSXLESHCIDPElMXAO»aWVQF^ ISOEAQEHLVD 
VLSI^SVTRLHHSVALITHICCNl^PKVVSTITEXtJ^ 
ELASSX X EELKN0YVKQKAXVAT 

CPn.1050 1203884 1204798 

depA-Oihydrodipicolinate Synthase 

LCKTKSYSRHVCR IHH U.TATVTPFFTNCT I0FASLERZXSFQ0AVGNC7VVLti38TCISL 

SLTXKEXOALICFACOLOLKVTLFVCTSCTLiEEVLaviMrCKDLPISGFLKTTP 

KLCO0XIOTEAVL1MAKHPAILYNXPSRAATFLYLDTVKALAHHF0F 

OSYKSlAPHIOLYCCDI^FWSEMAACGAKCLrSVL^NAWPEEARrrVW 

ETCRWVYTTTNP IG IKAILAYKKAITKAOLRL PL5 1 EDFDLENVS PAVESKLAWPKLRTS 

VFSYS 

CPn.1051 1204956 1205270 

No robust nomolog present in Genebank/EMBL as of 11/7/98 
FFMTPKS X OOLH L I KT ID PVRK I S PVTTFKSS PFRQSLLRF LELFVH F LYC I RS I RPHCV 
HIATFICI^ULFLTTLFLSMICILHFITLPWICKEOPR I IRKNK 

CPn.1052 12C5402 1206 16'» 

No robust homnlot) ptes«>nc in '>neUini(/EMDL •»* i»t 1 1/7/98 
FFI0RMK'/N?REKIKC:aLR tC J^/CtTVFRNMF*X2CY0K IFVULJCYVFMJPNS ICRCR 
;!FTFFR< JKKTEVETKEVK ( KCE I R P:ILF/«NDT VKVAE.1F PFR KAALC >LJ20:<S ICNLCA 

trMru>'./jnu:twr'AiF.iHr.rr rpmnK:?n *nABr5«EPFRYTAOJVi*w ua;klacsycl 

fTVTACLUA^LKOV;:a^ IRTRAr^: r Ur/tlf i»WTRPL:y.TFY [VlIKARPI J.PFFRLT5D 
VRROIJfKlfFRL>:H^CP 

•I^IOM I .Mr. I IN I.!0u70l 

tin r«ilMi;;r tw«i«>i«H| m>^:*mr in 'hiM4»iiik/HNril. i:*. n| 11/7/*^ 
vk:suu4jkak r en ilylti *u h4/;vai i uni i iJnr{::rYKAtWiw*yHi : i.toep 

. :kLATT> WOTS-Y; : I V;U.t •!« U 4< YP/Ti:i'i 7 3« II WH I nHIA/APF ti * rrURAKE I PC 

:r;rnYAK>*.';i.TAnK:ii * i ;;eki jsktm^v.* ;i v hjm !Y : v tfvtktni JCFyYRiiuaiNTnF 

EUS^AK 
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:Pn_IOVt 1207010 120146* 

no mount honolog present m Oenebank /EMBL as of 1 1/7/08 
::m*tKRr IKQVU^POLPPPPOHSVCSISSPSWJW^IT^VFaCLLilSCALr^ 

Aorrx^owEiJoxjTDxoEWAcijucxjra 

IFELRO XVA0ECWDI/lFUI*2C»SLMHrAESKIi)I^^ 

r/AP IXVHT.Z IWIfVAVAPDRNTf WDtAFAF AtrJO££3VYRSLT^SYW)ir7LCSC 

•• . w. ;n iTX!4iLW«'« i •AEVKLi'WrAitWKn- /V^wked ? :f- UMTT 

YVROKYDOC7t3KACERI^XUfALyPEVSV5ZRO«lOr^ 

OCOVMCICEiaiCACFItSOOCILSPCEtJCSSUOPD^^ 

TA£VBmZUO«SW^rvrEDVTOPCIlIttI»rUWAJLP 

CAO tLPCWPYC KESUMffSIPtt.VStP^ 

RERI0Crrr0C3J)LVOCILLCVSSRLWITtCI3CVSCS^^ 

. j -: e?wM7«Mgfp;ji inVQ AfTftM JWit J jnfCIY w LY B rgtt**«M r.vmTrvAT 

O0RVMPGSZSVPEZPCAPCIXPSZJJ>KAItSLPTlUOKT 

CPtulOSS 1209583 1210521 

No roeust honolog pnint in Qonebenk/EMBL as of 11/7/98 
aCYLYKHSYPPPPWSVGAmLSKniV^^ 

FT?/;TQ.S ALOBVLVVSCLLCI LAXRg VPTV RPgl f^^j^ ^^^^ ^^^^y ^ 

WECWYiJccuoDroiCBTLrKSovsiJTQ^^ 

RR ICXTTRKVAJfTTORNAYGVAJCTAJriXAFTZAXXr^ 
DCXLRZCYLEUW 

CPn.1056 1210482 1211228 

No robust homo log present in Cenebenk/EMBL «s of 11/7/98 
<»XKIK£RVEER>0<LftVXQJU*IKOAX^^ 

TSFTBT QSLDQTLERXYTCEyQKRFQCPSRLES EVSGCREHLitEQVKQFCTOCT » TCTTL 
irVSDVLFIWIVSCLVST VHVP y H I V i i L*FU.HRLPXRJtq»ft4ANAErreiCVPJ0u^D& 
KgTtJXAICAPWggYWUJCEBlUCSTCO0U>IIinCItAA00R^^ 

ysrrxBUcs 

CPn_10S7 1211467 1213596 

CT356 hypothetical protein 

I 1 HFYFFNP AMP EPLYTOKL XTDCS P YI^X Y AHTPVNWYPWGAEAFH I AA I DOCIVFLS I 
OCIQtSSMCOVMLOESYTNPEXAAiaNCy^^^ 

Eivswpijwn.TPDLVFFr*VNYuaraa^ 

KVUIASTL&KVPJCEIUESSIJanVAAI^OIDPH^ 

ixy^esp^LFrvcitsuiMaxscMasix^^ 

tXAWCLCXmMXaCOZI^LSIXYSPEMZArrSSS0A82^^ 

Njuzi3>raFamrcxs*ECFFK^ 

UCaZKAfftSHRSiaCI^TrtOTWTYTrAYACm^^^ tiuwamf VWCLYIWH 

C.YRRWRECIAKYRASLZZJVGALILSVIALYtSOCCSFV^FA 

PYSVIXna)STLLIK0SPI^0atrXSGNALXCQCIX5Ua.ZTnC^^ 

AHTHKPSStJaUAS0NVPSPJOIVICVUALCD0I2»^^ 

OEKLETVL PPfOOJ PKGDCTATT ZYVt*EVIWKRFXr3Z^LrRRYX#ZSti 

CPIU1058 1213742 1214836 

CT355 hypothetical protein 

EVWXYOTZJtOrVLVSTOZFLOOOWAASVPVTSSOr^^ 

FKVDEENVVTAIXVIHXX/a^YNSYPKL IDS FPAR^ 

A»ICIUATOTAVN0OTZ3ffCPJ)Z^PW^ 

VMLKVTPaXftffrifPJCZXEEWIUCVXW 

K0RLTALVXSCOBCLVCSEEFSFJX«EX^0SHXQI^ 

UACTSaSlEPt^VNeSXIROHX^AUAESVSXOYJCDPXm 

LFSLL 

CPn.1059 1214848 1215678 

fcgsA-Diaethyladenosine Transferase 

VTOSPAQLSRn^EIQMUTWSIJQNFLVDQNIVK^ 

TaXrAAOAOVIAIEJCPPKrAPSLEELP I RLEI ZZ^CKYPZJOLQgmLaQCIlWAWLP 
WITTPIATKI^EAPDFWirrVTVMVQDEVAPJir^^ 
KVSASCTYPltPOVOSA^HMCVKroPLSOEEXPVFrrLTRTAllQOW^^ 
KXCT^XMUOXCLLLNVR PFVl^UiDYLALFHKMOAC 

CPn_1060 1217694 1215727 

dxs /t tec -Transke tolas • 

YKRFWZHZTKVKTSSSCPIiOLZLSPADLICKliZSOLPClA^ 
SSNX£ZVELTZAXJWTSSPKOKFZFTM3KQTYPHKIXTG^ 

PTESPHOLrrSCHA CT Af . ST AT A M A QTTPLCS RTHV I P I LCDAAf SCCLTLEALNNXSTD 
USKFWILNX*ftraiS»IVQAKSRIPSRWI4WPATNK^ PRYGDSUUWS 
RRLSOCVKNLFCPTPLrtOnSLAYVC P IDGKNVKXL I P I LOSVRNLPFPILVH VL1 1AU C 
^LOOAC^PaKYH^ANFWWESAKHI^AIKP^ 

AKSIGSRl^rKOKrPERrn>VCIAECHA\TrSACIAXA(»PVICSIYSTrU<RALIW 
HIMnCOLPVIFAIDiWCI^YCISRSHH^ 

LHWSSPSAIRYPNIPAPWIDPLTCDPNrLRSPCNArrLSOCEDVLI IALCTLCFTALSIK 
H0UAYCI SATWDP I F IK PFDNOLFSLLLMSHSXV IT I EEHS I RGGLASEFKNFVATFN 
FKVDILWFAI PDTFLSHCSKEALTKS ZCLOC5SMTNRZLTKFNFRSKKQTVCX3VRV 

CPn_106l 1217932 1217666 

CT330 hypothetical protein 

FCSLMVEIHHKDPSLJCKLFALQOSLETLNSLSDIVATYEAM rSLIYECLNKALRKDQLCY 
U^VNSKCfiLUCSPSCOPIVOTrplHPHH 

CPn.1062 1219835 121815«> 

xRttA-exorJoxyribonuc lease vtt 

WJFPVM^PrQAVASLTER IKTIXESNPCOI IVKnEI^VOjOPrraLYFCIKDSOAFLN 
• ^rFlirr:;P(YYDPjrpKtJROAVt tltTKlAVYAPRGOYQ I VAI tALVY AfiEGDLLOKFEETKR 
HI .TAIEY P ATEF.KJC PLP FAPQC I CV ITS PTt^AV I QD I LR VL2R R AfiNYK I LVY PVTVJCN 
^AJICinitAI EVJCIAENIAOVLI t AROGCJ X EDLMAFNEE X L VKA I HAiT I P rVSAWHE 
TDYTU -I jFA;;UVRAPTPdAAAE C VCKS.CEEOVQVFEGYLRl ILL^ l^ROLLTSKXOULLPW 
1M<KIXttA£FYTTAOWLaStE!AI<*LVra^ 
I^KMI^uyiKALO^RaiOWKLT™^ 

f r.-FIIKirnM Xl lAllhM.EWLR^IlVOKt^LUMRl^WtlllUwOK IAVANVKETLATIL 

WKYmrr/ARYrJAXJtEUUUWWCNVIJ^ 

USEAILTVTNtRf'.TCLIKi: 



FCREWIF^DflCERKKTPCv . ICtMKtOtr? rCEAJCEYVCTt-AJLLOCEFUTTT ISI A 

sprrgwxHjviwrrwtiwusaanwff!^^ 



Ft. I AY EPVVAIGTCXVAEASCVOO IW«FCREWAE*FSEATAEEI J I UG^T.TVEMAOR 
FC0C5Z7VOCLi.VCaASl4WSFrtVA»IFW 



CPn_io«*4 



122371* 



122WS 



230928 



CPn.1065 1221140 

No robust hamolog present in Cenebenk.'DaL as of 11/7/98 

juounwiurroPcrtxirrszpEiSLPPosc^^ 

YEKAIFVTCLP 

CPn.1066 1221132 1221488 

No robust honolog present in Geneoank/DSL as of 11/7/98 

SMSOKEIQfnOJTfAFIJIPLFTXVItX^ 
0ZLKXVTSHCAVArCI0Oii4rSTNLXA«ZJlAJ(Er^ 

CPn.1067 1221675 1222292 

del -Polypeptide Oeforeylase 

icjvtvvwyrriicosjfvcjTw 

KRCVCX-WOWCI0W5lXV^CVI»ETE» 

SX PCLRCEVFRPOK ZTVTAMDtNQCirTEKLEOFTAR Z Z MKITOMtNCT/LYIDLKeiFKO 

vnmviSUBiimrtmiisxxsLvs 

CPIV.1068 1223267 1222365 

rnhB-Ribonuclease Hit 

KSC»PPPVVTLTTSAOI4»LIWLKEnTirSOP(3NTVr 

WaggyXlTTTJPEItJnTTTURVIODWPXPrtai aa^ f OPL CIAAVYASmPUC 
KLYIMXVQOSXNUCZTITCZASIARZ I RSI£VC0VX ZLYPtXYNELYOOXJKLNTIIAKAHA 
TVTJWUU^PAaDSTAZSWAASEYTUJ^^ 



CPn.1069 1223507 1223941 

y foA-KTH Transcriptional Regulator 
VIJC2IHIHX£LLHtC£Xr-" "~ ' " 

&rxiocyATYXjG&xxroxi4KXPYVMxxric^^ 
NijaocLiiiacrKvwwLCSLrszF 

CPO.107G 1225523 1224144 

No robust honolog present in Cenebsnk/IHBL as of 11/7/98 

BjtsxjcrriCGHotrYYjrepppwpqopiPiiB 
oaaivuMwoRjoaLiiiwLLD^^ rxrmmiHCKTumaQCCNXL 

ETVt«PIWSAOn«LI00YCPICVGMSTa0WHCTCKIEX»EPlOT 

HRgiJCAAjQpRcicESLv yT .r agjrjf.rjHj crr p iw 

UX^SOYiaaPIfiOAAIQIVDLXXa^XJCIu^VASiaXA 
SIOSYtPIJPWgqTVLDPRVOCP^MUUCVLVmAC 



CPn.1071 1227336 1225885 

No robust honolog present in Genebank/EMSL as of 11/7/98 
W*l IMVLPNNSWFWfCOJFNCKWVEV 1 1 IkUSASDISEEACSSOCAAPITTOPTlCr 
TKVElCRVOFNrAOaJESTIHNXOEACELVTiSIlJHRJnW 
CPXICCTYXACCLnwaa«2VWLVHEClOTICPIW 



lESEEAmSSASNSTRXPACYZI^SPtTTSPXSOSSP^AIUlPSSVGAE^^ 
NtatWLgCZQVTOIXTCTFSA^PTX^^^ 

YSYRTCKTSA5FXJQCX 
ILXCSGHL0T1TLCIK 

CPn.1072 1227924 122883S 

No robust honolog present in Genebsnk/DOL as of 11/7/98 

LAIOIFTO^EXaES8XPZ50hXAZXJkNSSJ^VLCZSSrcZTGZ PATXJiSLLRQNLSFQKRS 
X ASESFtiK XDSAPSOA^FYKCV7JTlCirrA XVakLSOLTAOLDX-S PKK 1 1 FLCEOPEW 
OAVCSACXCWaOffl^WPAOESLFSYVMPYSTATEWEAOCtX^XSDEVAOtT^^ 
PKPtl 

CPn.1073 1229011 1229832 

Predicted OMP CCT371) 

MRRYLFMVlAU:LYRAAPLI^WIKrTt»CMVLKrARErrLVCFNIE2?n^ 
AWLYNRELDLKTTLSEEOAREOAFLEWC I SFLVDYELVCANUUWLTCLSLKRSWVLQ I 
SORFVMLIKNTLR ILRSFNIDFTSCPA ICECCWLSHPTKOTTFDOAHAX EKNILPVCSUt 
NC0PKXMALEVLLSCISSPP50I I YVDODAERLRS XCAFCXXANX YF ICMLYTP AKQRVE 
^YHPKLTAIOWSOIRKNUDEYYESLL^YVKSK 



RNA SECTION 



r.mHWA I WH I »«074 

RlUum. I.-.i:-.- |« RNA 1.117 M2 

rl<WA UMliiSM 10(1211.: 

is i rna iiMM4l'i |oi»v:7m 

*.;: iKNA h»i". fi; I iiir t'lir* • 
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CRNAl 



CKNA 1 


B*gtn 


End 


TVP« 


codon 


------ 

I 


89657 


69726 


Thr 


GOT 


2 




91070 


TTD 


CCA 








!*.*. 


■ ."AT 




• ,., . i - 




*•»»• 


' "A T 


} 


296075 


296147 


V*l 


TAC 


6 . 


296151 


296224 


ASp 


GTC 


7 


409846 


409922 


Pro 


TOO 


8 


462141 


462214 


ATff 


CCT 


9 


672236 


672318 


Uu 


CAA 


10 


677264 


677337 


ATff 


TOG 


11 


739403 


739486 


Uu 


CAG 


12 


781610 


781680 


Gly 


TCC 


13 


784822 


784896 


Glu 


TTC 


14 


784922 


784994 


Lys 


TTT 


15 


836119 


836191 


Al* 


GGC 


U 


843926 


843999 


Pro 


GGG 


17 


877400 


877473 


Aro 


AGO 


V0 


108S60S 108S676 Gin 


TTC 


19 


1142034 


1142116 S«r 


TCA 


20 


1175863 


1175944 


Uu 


TAG 


21 


1230028 


1229942 Str 


CCA 


22 


1137462 


1137389 Val 


GAC 


23 


1030603 1030533 CVt 


CCA 


24 


1000022 


999949 


Hit 


GTG 


2S 


961607 


961536 


Gly 


GCC 


26 


807413 


807341 


AC? 


TCT 


27 


786780 


786708 


Thr 


CCT 


28 


715971 


715889 


Uu 


TAA 


29 


708441 


708354 


Ser 


OCT 


30 


680259 


680178 


Uu 


GAG 


31 


631445 


631373 


Phe 


CAA 


32 


626987 


626901 


Ser 


CCA 


33 


293477 


293405 


Thr 


TOT 


34 


293399 


293317 


Tyr 


OTA 


35 


269142 


269070 


Ala 


TCC 


36 


269065 


268992 


Zl« 


GAT 


37 


164389 


164318 


Asn 


GTT 


38 


87S22 


87450 


Hec 


CAT 
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What is Claimed is: 

1 . An isolated nucleic acid encoding a C. pneumoniae protein as set 
forth in Table 3. 

5 

2. The isolated nucleic acid of Claim 1 , wherein said nucleic acid has 
a nucleotide sequence of an open reading frame in SEQ ID NO: 1 . 

3. A probe comprising a hybridizing fragment of an isolated nucleic 
10 acid according to Claim 2. 

5. An isolated nucleic acid that hybridizes under stringent conditions 
to the nucleic acid sequ ;nce of Claim 2. 

15 6. / ji expression cassette comprising a transcriptional initiation 

region functional in an expression host, a nucleic acid having a sequence of the isolated 
nucleic acid according to Claim 1 under the transcriptional regulation of said 
transcriptional initiation region, and a transcriptional termination region functional in said 
expression host. 



20 



25 



7. A cell comprising an expression cassette according to Claim 6 as 
part of an extrachromosomal element or integrated into the genome of a host cell as a 
result of introduction of said expression cassette into said host cell, and the cellular 
progeny of said host cell. 



8. A method for producing a C pneumoniae protein, said method 

comprising: 

growing a cell according to Claim 7, whereby said C. pneumoniae protein 
is expressed; and 

30 isolating said C. pneumoniae protein free of other proteins. 
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9. A purified polypeptide composition comprising at least 50 weight 
% of the protein present as a C. pneumoniae protein comprising an amino acid sequence 
of claim 1. 

5 10. A monoclonal antibody binding specifically to the polypeptide of 

Claim 9. 
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Contig463 
Length: 273254.. 



1 


ATTGTTCCTG TAAGAACACT TCCAAAGCGC ATTTAATCAT 


TTTTAGTAAA 


51 


AAATAAAAAT ATACTTTTAA ATGTTGAGAA AATTTTTAGC 


TAAACTTTAT 


101 


AAAGGGTTGT 


TGGTGAAACC TTTGGGTTAC 


TCCTCAGAAC 


GACTTTGTGA 


151 


TTCTATAGTA 


TTAAAAGGAT CTTGGAGTAT 


AACAAGTAAA 


GATCTTTGAG 


201 


GATAGCGTAG 


GGCCGTATTT TGAATAGCGT 


CCAATAAAGC 


GCGTTTGCAA 


251 


AACGCTTGAG 


TTTGGTTGTC CCAATAGAAA 


GTGCCTTCTT 


TAGGAAGAAT 


301 


CTCTTCTGGA 


GGCACTTCAT AGACCGAAGT 


AAAGAGAGGA 


AGAGCAACGA 


351 


TTGCTGCATG 


ACTTTCTATA GCTGCTTTAA 


GGCAGTTCTC 


GTACGCTAGT 


401 


AAAGCTTGGC GATAATATTC TTGCTGATTT GGTAACTCTT 


CAGATTTAGG 


451 


GCCGCATACG 


TGGCCTAAAA AGGTCGGAAG 


AATACTTTTC 


TTTTCTGCAG 


501 


AGCTTAAATT 


TAGATTAAAC GTTTGATCTA 


GAGCTTCGTT 


TGGAAGTTTT 


551 


ACTACTCTCA 


CTTCGGTAGG GGAAAAGGGG 


TCTTCTCCTT 


TTGCGGGACC 


601 


CCCTTCGCGT 


TGCTTGCATG TATCCCACAC 


GCTTTTATCT 


TTTAGGGTGG 


651 


AGTAAAGGAT 


AGTAGAGAGG TTCGTTGCAG 


TGTTGTCGAT 


CAGATTCGTT 


701 


GGGCCTACGG GATTAAAGAT GATCCCTGTG GATTGATTTT TTTCGATCAC 


751 


TCTAAGTCCA 


GTTAAGAAAG TAGGCTGAAA 


TGGTTGAGAC 


GCATCTGTTT 


801 


GTATCGCTAC 


CTTGAACTTA GGGTTCAGGT 


GATTATTGTA 


AAATTGCATC 


851 


TCGTTTGAGT 


AGCAGTCTAC GTTTTTTTCT 


TGCCACGCTT 


TTCCCAAAGG 


901 


CTTGAAGTTT 


TGCTCTAGAA CTTTCTGCCA 


GTTAGAAGAT 


ACCTTTGAGG 


951 


TCATTTGGTG 


GTAGACTAAG AAGGTTACAA 


CTGAGAAGAG 


GGCCGTGGTA 


1001 


ATGAGAAGAG 


CCAAAAATAC AGGGTTCCCT 


AATACTATCG 


TTAAAGAGAT 


1051 


TCCAGCCACC AAAGCTCCTA AAGCTAAAGA AGCTAGGATT GCAAGAGTGG 


1101 


ATATTTTTGC 


TATGGTAAAC TGTTTTTTAG 


GAGCAATTTC 


TTTATCCCGA 


1151 


GGCACATAGG 


ATAGTACAGA AACTTGAGAG 


CTCTCAGTAC 


GTGAGGGTCC 


1201 


TGACATAACA 


TTTTTTTTGT AAAATACTTT 


CTATAATTTT 


AACATATTTG 


1251 


TGTTTATCGA 


TCCGAGAAAA TTGGAGAGTG 


AGAGCGCATG 


TCTTGCAATT 
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1301 TAGAATGATC GGGGACGACA TCTAGAGCTA TGTAGACATT GCGTGCGTAG 

13 51 TGGGAGCAAA TATAGCGAGA TATAAAGTAT AAGGGAATTG CTGTTAGGAA 

14 01 GATAAAGGAG CACAAAGGGT GGATACATAG CCCAATAGCT ATGGTGGTAG 
14 51 CAATCAGAGC TATCCAGACG AGTGCAATCG CAATAGTAAC GAAGAGGGCA 
1501 AGCTTGAAAT TATAGCGAGG ACGAGTAGCT GGGGGAAATA GAGAGGGAGC 
1551 CGTTCCATCA AAACCGGGAG TAGCTGAAGA AGCCATAAAC TATTAAAAAT 
1601 TAAGTTTTTT TCGGAGCATA AAGCATTTTA AAGTAGTGGG GTCTTTTTTG 
1651 TCACGGAGAT GTCCTGGACT TCCCAAGCGT TTCTAACAAA GATACCTGCT 
1701 TTTGAGAGGA GAACTTTTGA AACTCCTGCA AGGTCATCCT TCCTTGGCAC 
17 51 CAGTAGGTTT TTTCAGGAAA TCGCGGAAAG ATTTTGGCGA AAGCTCTTAC 
1801 AGTTGAAGGG CTTGTGAAGA TAATTTTTTT GTATTTAGAT AAAATATTTT 
1851 TTTTAAGTTT TCGCGGCTTC ACTGTGTAGT GAGGGTAAGA GAAAAAAGTA 
1901 AATCGATTGT AAAGAAATTC TCTGATCACA GGTCTTGCGA GGGAGGAGTG 
1951 GGGGTAGAGA ATGCGGGCTG AAGAGGGCAG TGCCTGTAGC AATGGGAAGA 
2001 TGGCTTCAGC GATTTCTTGA GTTGCTACTA CGTACTTCAC TTGTCCAAGG 
2051 AAAGAGAGAA GTCTTTCTTT GGTGGACTCT CCTATACAGA GGTAGGTCTT 
2101 TGTTTTTAGA GTGGCCTTAG AAAGAAGAGA AGTCATTCTG GAAAGGAATA 
2151 GGTGAGTGGA TGAGGGACTT GTGAGAATCA CATGGGTTGC TTGTGGAAGG 
2201 AATTGAAGAG CACGCTTATT TTGTGGAGTG CTTTTTGCAT AGGGGAAGAG 
2251 AGTTAGAATA GGCAAATAAT GAGCTTGGTA TTTACGAGCG GTTTTTTGAT 
2301 TCAATCCTAA GTAGAGGGTC ATGAGTCTTT TCGGGTAAAA GGAAGGCTGC 
23 51 CTAAGTTTTT GTACCTTCAA AGGGATATAT TGAAAATAAT TTTTCTTTTT 
2401 CCCTTGGTTC TTCTTGATCA TGCGTTGATT GACATTTTTC ACTTTGAAGG 
2451 CTAGGCTGGT TTTTTCTGGA CTTAGAGGTT CTCTCTATTA AGGCTTCGTC 
2501 TTTAGAAGTC CTTGCTAAAA GTTTTTGAGA AATTTAAGAA ATTCGCAATA 
2551 GTGGAAATAT TTACAAAGGT GGTTGCGGTG GTTTCGTTGT TGCATAAGTT 
2601 TTTAGAAAAT GCTTCGGGGA AAAAGGGACA AAGTTTAGCT TCGACAGCGT 
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2651 ATTTAGCAGC TCTTGACCAT CTCTTAAATG CGTTTCCTTC CATTGGGGAG 

2701 AGAATCATTG ATGAGTTGAA GAGCCAGCGT TCCCATTTAA AGATGATTGC 

2751 TTCTGAAAAC TATTCTTCAC TTTCAGTGCA GTTGGCTATG GGGAACTTGC 

2801 TCACAGATAA GTATTGTGAA GGAAGTCCCT TTAAGCGTTT CTATTCCTGT 

2851 TGTGAAAATG TAGATGCTAT TGAGTGGGAG TGTGTAGAGA CAGCGAAAGA 

2901 ACTTTTTGCT GCGGATTGCG CTTGTGTTCA GCCTCATTCT GGGGCTGATG 

2951 CTAATTTACT GGCAGTAATG GCCATTCTCA CGCACAAAGT CCAAGGCCCA 

3001 GCTGTCAGTA AGTTAGGTTA TAAAACTGTA AACGAATTAA CAGAAGAAGA 

3051 ATACACTCTA CTTAAGGCTG AAATGTCTTC TTGTGTTTGC TTAGGACCTT 

3101 CATTAAATTC TGGAGGCCAT TTGACCCATG GGAACGTACG TTTAAATGTG 

3151 ATGTCTAAGC TTATGCGTTG CTTCCCCTAT GATGTCAATC CGGATACGGA 

3201 GTGTTTTGAT TATGCAGAGA TCTCCCGGTT AGCTAAGGAG TATAAACCTA 

3251 AGGTACTGAT CGCAGGATAT TCTTCCTATT CTCGAAGATT AAACTTTGCA 

3301 GTTTTAAAAC AGATTGCAGA GGATTGTGGA TCTGTCTTGT GGGTAGATAT 

33 51 GGCGCATTTT GCAGGCCTAG TTG CTGGGGG AGTGTTTGTT GATGAAGAAA 

34 01 ATCCTATTCC TTATGCAGAT ATAGTGACAA CAACAACGCA TAAGACATTA 
34 51 CGCGGTCCTC GCGGGGGATT AGTTTTGGCA ACTCGAGAGT ATGAAAGCAC 
3 501 TCTCAATAAG GCGTGTCCTT TGATGATGGG AGGTCCTCTA CCTCACGTGA 
3551 TAGCTGCTAA AACAGTGGCT TTGAAGGAAG CTCTCTCTGT GGATTTCAAG 
3601 AAATACGCTC ATCAGGTTGT AAATAATGCT CGTCGATTAG CAGAGAGATT 
3651 TTTAAGTCAT GGGCTACGTC TTTTGACGGG AGGAACAGAC AACCACATGA 
3701 TGGTGATTGA TTTAGGTTCT TTGGGCATTT CTGGAAAAAT TGCTGAAGAT 
3751 ATCTTGAGTT CCGTAGGAAT TGCTGTGAAT CGGAATTCAT TACCTTCAGA 
3801 TGCTATTGGT AAGTGGGACA CTTCAGGTAT ACGTTTAGGA ACCCCTGCAC 
3851 TAACGACTTT GGGTATGGGT ATCGATGAAA TGGAAGAAGT TGCAGATATT 
3901 ATTGTGAAAG TATTGCGAAA TATTCGTTTA AGTTGCCATG TTGAAGGGAG 
3951 TTCTAAGAAA AATAAAGGGG AACTTCCTGA AGCCATAGCG CAGGAAGCTA 
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4 001 GAGATCGTGT TCGCAACTTG TTGCTGCGTT TCCCGCTCTA CCCTGAAATT 

4 051 GATTTAGAAG CTTTAGTTTA GTTAGGAGAG ACATTATTTT ATGGCAGACG 

4101 GGGAAGTTCA TAAATTACGT GATATTATAG AAAAAGAGTT ATTGGAAGCG 

4151 CGCAGAGTAT TTTTCTCAGA GCCTGTAACA GAGAAAAGTG CTTCCGATGC 

4201 AATTAAAAAG CTTTGGTATT TGGAATTAAA AGATCCTGGA AAGCCTATAG 

42 51 TTTTTGTGAT CAATAGTCCT GGGGGATCTG TGGACGCAGG TTTTGCTGTT 

4301 TGGGATCAAA TTAAAATGTT AACCTCACCC GTCACTACTG TTGTGACAGG 

4351 GTTGGCAGCT TCTATGGGCT CGGTATTGAG TTTATGTGCA GCTCCTGGAA 

4401 GGAGATTTGC AACTCCTCAT TCTAGAATTA TGATTCATCA ACCTTCAATA 

4451 GGTGGACCGA TTACCGGTCA GGCAACCGAT TTAGACATTC ATGCGAGAGA 

4501 GATTTTAAAA ACAAAAGCTC GCATTATAGA TGTCTATGTA GAGGCGACAA 

4 551 ATCAACCTCG AGATATCATA GAAAAGGCTA TCGATAGAGA TATGTGGATG 

4601 ACAGCCAACG AAGCTAAGGA TTTTGGTTTA TTGGATGGCA TTTTATTCTC 

4651 CTTCAACGAT CTCTAAATAT TTTATCTATT CTGGAGCAGG AAATCGTTTC 

4701 CTTCTTGGTG AAACACTTCC TGAGGTTGAA GATGTTCGGT TCTTATGCCA 

47 51 AGAGACGAGG GTTGATGGTT TTTTATATTT AAAGCCCTCT TCTTGTGCTG 

4 801 ATGCGCAACT CATTATTTTT AATTCCGATG GATCACGTCC AACGATGTGT 

4 851 GGTAACGGCT TGCGTTGTGC GATTGCTCAC TTAGCTTCTC AGAAGGGAAA 

4901 ATCGGACATC TCTGTATCTA CGGATAGTGG TCTATATTCA GGATATTTTT 

4951 ATTCTTGGGA TCGTGTGCTT GTAGATATGA CTCTCGCAGA TTGGAGAGCT 

5001 TCTGTTCATC GATTGGAGTC GCGTCCTGAT CCTCTTCCCA AAGAGGTCGT 

5051 TTGTATCCAT ACGGGAGTGC CTCATGCTGT CGTAATTCTT CCTGAGATTT 

5101 CTACTTTAGA TCTTTCTATC TTAGGTCCTT TTCTTCGCTA TCATCAGACC 

5151 TTCTCTCCAG ATGGGGTGAA TGTCAATTTT GTTCAGATAC TGGGACATTG 

5201 CCAGTTGCGC GTTCGTACTT ACGAACGTGG AGTCGAAGGG GAAACTGCAG 

5251 CTTGTGGAAC AGGGGCTCTA GCTTCTGCTC TTGTTGTGTC AAACTCCTAT 

5301 GGATGGAAGG AGTCGATCCA AATCCATACT TGGGGTGGAG AGCTTATGAC 
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5351 


TGTGAGTCAA AATAGGGGAC GGGTATATCT TCAGGGCTCT 


fiTAACTAGAG 


5401 


ATTTATAATT 


AGATGTGATT 


TTTGATTTTG 


TCATGCAAGG 


» TTTT AAAAT 


5451 


CTTGTTTAGG 


GATAGATCTT 


GCTCTCTAAC 


TGGGATTTTT 


PTATAATCGT 


5501 


AATTTATGAT 


GACGTATCCT 


GTACCACAAA 


ACCCACTTCT 


TTTAAGAATC 


5551 


CTTCGTCTTA 


TGGATGCATT 


CTCTAAGTCT 


GACGATGAGA 


ri GG A C TTTT A 


5601 


TTTAGATCGT 


GTTGAAGGGT 


TTATTCTCTA 


CATAGATTTA 


C5ATAAAGACC 


5651 


AAGAGGATCT 


AAATAAGATT 


TACCAAGAAT 


TAGAAGAGAA 


TGrrGAGCGG 


5701 


TATTGTTTGA 


TTCCGAAGTT 


GACGTTTTAT 


GAAGTAAAAA 


AAZVTPATfJflA 


5751 


AACGTTTATC 


AATGAAAAGA 


TTTATGATAT 


CG AT AC C AAA 


a a h a^TTPP 

bAAAhljl 


5801 


TTGAGATTTT 


GCAATCCAAG 


AATGCCCGTG 


AGCAGTTTTT 


AljAo 1 i 1 A 1 1 


5851 


TATGATCACG 


AGGCAGAGTT AGAAAAGTGG 


CAGCAATTTT 




5901 


TTCTCGAATT 


CGAATTATAG 


AATGGCTTCG 


CAATAATAAG 


1 i LLA1 1 1 lo 


5951 


TCTTTGAAGA 


AGATCTAGAT 


TTCACAAAGA 


ATGTTTTGGA 




6001 


ATACATTTGT 


TTGATGCCAA 


GGTGGGGAAA 


GAAATCACTC 




6051 


GTTGTTGTCG 


AACAAAGCTA 


AGATTTACTA 


TTCCAATGAA 




6101 


CTCGTCCGAA 


ACGAGGCCGT 


CCTCCGAAGC 


AATCTGCTAA 


GGTAGAAACA 


6151 


GAAACAACAA 


TTTCGAGTGA 


TATTTATACA 


AAAGTCCCTC 


AGGCTGCTCG 


6201 


TCGTTTCCTT 


TTCTTACCCG 


AGATTACTTC 


ACCCTCTTCA 


ATTACTTTCT 


6251 


CAGAAAAATT 


TGATACGGAA 


GAAGAATTTC 


TTGCTAACTT 


GCGCGGTTCG 


6301 


ACTCGTGTTG AAGACCAGCT GAATCTTACC AATCTTTCAG AGAGGTTTGC 


6351 


TTCTCTTAAA 


GAGCTTTCGG 


CTAAGCTTGG 


TTACGACTCT 


GTTTCTACTG 


6401 


GAGATTTCTT 


TGGTGATGAT 


GATGAGAAAG 


TGGTCACTAA 


GACGAAGGGG 


6451 


AGCAAGCGAG 


GCCGCAAAAA 


ATCTTCTTAA 


TCTTCTATTT 


TGTGAAGTAG 


6501 


TTTATTTTTA 


GACGCTGTTC 


TTATTGCTTC 


TTTACATGAT 


CTTATTACAA 


6551 


ATCTTTCTTA 


TTTCTATTTA 


TTGTTTTGTT 


AAAATTTTAA 


CAATAGCTAT 


6601 


TTATTATTAG 


TCATTTTTTT 


AATTAAAAAA 


CTGTTAAAAT 


TTTTAAAGCT 


6651 


AATTTAAGAA 


ACAGTGAATA 


GTTCATCATG 


TCATCACTAC 


TGAGCTGCGG 
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6701 AAGAATAGAG 

67 51 ATACGAGTCA 

6801 TTTTTATGCG 

6851 GATTCCAAGC 

6901 TTCTTTTTGT 

6951 ACTTATAGCA 

7001 TCACAAGCCC 

7051 TAGGACGATC 

7101 TTCTCTCCAT 

7151 TTTCCTATTT 

7201 TGGATTGGGC 

7251 TGCTTTGTTA 

73 01 TACGATTTGT 

73 51 TAGATTGCTA 

74 01 TTGAAGAATA 
74 51 TAGCCGTCCT 
7501 GCAAGGGGGC 
7551 CATAGGGCCT 
7601 AGTTTGCTAA 
7651 AAATTATGTA 
7701 CTCTGTGAAA 
77 51 CTCTTGAGCC 
7801 AGTACTTTGG 
7851 TTGGTTAGAA 
7901 GTAAGGACGC 
7951 CCTACTCAAG 
8001 AGCTCCAGGG 



PCI7US99/26923 

CCGACTCGGG TTACCTGTAG CTTAAAGACG TATCTTGAGG 
GAATCAGTTG AGCACACGTC TAGTTCGGGC AAGTGTCATC 
CATTGTTGAT CATTTTGGTT TGTGTGGCCC TCTCTAGTTT 
ATTATGGCCT TGGCGACCTC TTTTACGGTA ATGGGGTTAA 
GATGTCACTT CTTGGTGACG TTGCAATTAT AAGTTATCTT 
CTGTTACGAG TTACCGGCAA AATAAGAGAG CTTTTGAGAT 
GCTCGCTCCG TTTACTACGA GGGGGTCCGC CATTGGGATT 
ATCTTTAGGC ACAGGCGAGA TTCCTATAGT AAGGACGTTA 
TTCAGAACCA TGGTCTTAAC CATGCCTTAG CToCTAAAAT 
ATGGAGCATT TCAGCCCTGA GCCACCGAAC GAGCCTTTGG 
CTGTTTGATT CGGGATTTTA GGCCTCACGT CAGTTCTTTG 
TTGAAAAACA AGGGTCATCG CTGAGGACTA AGGAAGGCAA 
GAGGCTTTCC GCTCTGATTA CGACGCCCAT TTTGCTATGG 
CCGGTTGATC CACTCTAAGT TGATTATAGA GAAAATGGGA 
TCGATATCAT TCCGAGTGTC ATGGTTCGTG AAGATTATCC 
GGGGAGGGCT ATCGCGAAGG CCTATTACGT ATGTATGGTG 
TCTGTGACTT CCCTACTTTA GTTCCTAATG AGCGCTTGCC 
TTCTTTGTCC CGCAGCACAC TTCCGGTGCG AAGGGTAAGG 
AAGGAATTTT TCTATAATTT CGGGATTGGA TGACATATTA 
TTCTTCAAAG GCGTCCTTTT GCTTTGCAGT GGG AT AACCT 
AGTGATTATG AGGAGGCTGG GCCCGCTATT GGGATACGTT 
ACAAGTTTCT CAAATTTCTC CAGCCCACGG CCGGCTATGT 
TCCAGTGGGC CCCTATCCTT GGTTCTGAGG AGCAGCTAGT 
GAAACAATGA AGCGCCTAAA GTTTCCTAAA AGTTTAGGTA 
TGTTATTGTG GATTCGGAAA TGGTTCCTGT GAACGCCAAT 
AGATACCTGC AGCTTCCGAG ACTGTAGAGT CTTCACCTGT 
AATACAACAG ATACCATGCC TGCAGCTTCG GGAACTACAG 
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8051 ACACCACATC TGGGGTTTCA GAGGCTGCGG CGGCTGAGGC TGCCGTGGAT 

8101 TCTACACCAG GGACAGAGGA GGAGCCGAGT TTTTCTCTGA GGTATGCGCT 

8151 TGTAGTTCAA AATGTTCCCT ATCCAGAGCC GCCTAAAGAA CCTGAGGTGA 

8201 TGTTTACAGA TGAAGAAAAA AGTCTGATTT TAGAAGCTAC TCGTGCGCGT 

8251 CGTATGGAGT TGGACTTGTA TAATGGCTAT TTAGCTGATT ATGAACTTTC 

8301 TAAGGATGAA ATACAGAAAC ACGTTCCTGA TTTACCTGAG AATTGGCGTA 

83 51 CGAATTGGCG TTGGTCGGAG AGGCTCTATA AATTTTTCTT TAAAACAAAG 

84 01 AAAGAAGGAT TAGAAGAAAT TTTCTTAAAC AAAGAGTTAG GGAATATGAT 
84 51 TCTTGCCCGA GGGCTGGCGG CAACTCAGTC ACAAGCACGT ATTAAAGTAT 
8501 TCAATTCTTT AGTGGCATGG CTCTTGCAAA GCTTTAACGT AGGGAGGAGC 
8551 TGTACAGCTA AACCTCTTCC TACGTCAAAA CTAGACCTCT TTAAATCGGA 
8601 ATTCGAGTCT AAGCCTAAAA ATAACATCTT AACGGAATTT TTGGTGGCCT 
8651 CTGATGAGGA GATTCTCTTT AAGGGGCTAC GGGTCCTAGA GCCTGGAATC 
8701 GAAGGTTGGT ATGACCATCC TGATCAAGCT GGAGAGATTC GGTCGGTACT 
8751 CGAGGGTCTG GTGCAGGCTG GACGTATTTC TGGATATTGG GAGAATCAGC 
8801 CGTTTGGGAG ATTTGTCCTT AGAGGAGTTG GTGAAAGACG TACCGAGCTT 
8851 GTAGAGCTTT TGGAGAGTTT AGTTGCTTCT GGTGAGATTA TGCAGTTCTT 
8901 TGAGTCTTCG GATGAAGAGG GTGCTTTTAT TATCGATAAC GAACCTAGCA 
8951 AGACTGCTAT GCTAAAACAG CGATTTAAGA GTTGTGTCAG GACGAAGCTT 
9001 GTCGGGAGTT TTGCTGATGA GAGTCTTCCC AGAGGTAGGT TTACCATTTT 
9051 AGTTTAGCGT GGGGTAGAGC ACTCCACGAA TCTTAGGGAG CTCCTTGCGA 
9101 CCAAGCTTGG AGATCCTCCA TGTTTTATTG TTTCTCTAGT AGCCAAATCG 
9151 TAGCCGCTCC TAGGAACAAT TTTTTCTTTT TCGCAATATA AAATCCTGAT 
9201 TTAGAGAATA GGTCTTCAAG ATCGTGGTCC TTTGGAAGTT GCTGGATACT 
9251 TTTGCTGAGA TAGCTATAGG CGTCGGGATC TTTAGAAACA GACTTTCCAA 
9301 TCCAGGGGAC GACAGCACGC AAATAGAGCT TATGGGCACT ATAGGTAGGG 
9351 TGTGTTTTTT TTGGAGGTGT GAGCTCTAGA ATGCCCAGTT TTCCAGAAGG 
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94 01 .CATAAGCACT CGGGAGATTT CTTGTAGGGC TTTATGTGGA TCCGAGAGGT 

94 51 TCCTGAGGCC ATAGGCCATC GCTGCTAGGG GATAAGAATG ATTCTCCAAG 

9501 GGCAGTTGAT TAATATCGCT ATGAATAAAA GAGCAAGAGC CCTGGGGAAG 

9551 GTGTTGTTTT GCAATGTCGA GCATTGCTGA GGAAAAGTCG A CG AG AGTTA 

9601 CTGATGCTTG AGGGTGTGCG GCAATATAAC GCTTCGCGAC TTTTCCTGTT 

9651 CCTGCGCAGA GATCCAGGAG AGAGTATCCC GACCCTAGGA TCTGGATCAA 

9701 AGAGCGATTC CAGAAATGGT GCATTCCTAA AGAGAGTATT GTATTTGTGC 

97 51 GATCATACTT ACTCGCTATG GAATCGAAGA TCTTTTTACA GTCGGGCTTG 

9801 TTGGTAGAGG GTTCCATAAT ATTCCCGGAA TTTTTCAAAG CTTTCGTAGT 

9851 GTTCTTCTCC TAGACGGTAC TGGCATAGGG CATAGTATTC TTGAAGAAGA 

9901 GAAGGGGGCA GACCTGTATG TTGATGAGCT TCTTTAAGGA CTTCTTCGGG 

9951 TGAAGATTCG AACTGTTGGA GGGCTTCTTC CATCGCAAGG TTGGGTAGGG 

10001 GATGTTCTTT CCAAGAGGTG CTGTGTAGAA GAAGAGCAAA TACAAAAGGT 

10051 AGCTTTGTAA GATCATACCA CCCCGAGGCA AGGTCATAGG TTACAAATCC 

10101 AGGAAGTACA GGATGTTGTA GCGCTGCATC TCCGATTAGG AGGAGGCCAT 

10151 CATAATTTTC AGGGGTTTGT CTGAGTACTT TTGTAGTTAT GAATCTTAGG 

10201 ATATGAGGAG TTGGGATGCG CCAGAGATGA CGACAAAGCA CTTTTAAGAG 

10251 TCCTATAGAG GAGCGACTTT CTAAAGTTGC GGCAATCCGA GGTTGCGGTG 

10301 AGTTAAAGAA AGTGGGAGCT GCATAGAGGT TTACACTGAG GATACGTTGG 

10351 TTTGCTGCAA TTCCAAAGCC GGGGACATAC CCCAAGTTAT GAGAGATAGC 

10401 TCCTAGGGAT GAGGTCAAAG CAACATCGAG TTTCCCTTCG ATTAGCAAGT 

10451 TGAGGAGGTC TGCAGGGGGA GCAAGAACAC AGCGAATATC GTTTCTTTTT 

10501 ATGAGTTGTA GGGACAGCGG AAAGGAATTA ATATAACTTA CGCAGCCTAA 

10551 GCTTATACAT GGCTGGAGTT GGTTAGACAT GGCGTTCTCC CTTGTTGTGT 

10601 GATGAGGGCC GCCATTCCCT CAGCGTCCAT TTTAATAGGT TCTTTAGATG 

10651 AGGCCATCTG GAAAACCTTT TCCCCCATAT GTGTTGAAGA AAGGTCATTA 

10701 GCACCACAGG AAAGGAGGTC TAGAGCTGCC TCAATACCTA GGTAATTCCA 
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10751 


TAAGGCTTTC 


ATATTGGAAA AGTTGTCTAA GAAGATTCGG 


p pt a r*rc r n zv 


10801 


TTAAAGATTT 


TAGAGGGATG GCATGACCCT GGCCTGATTT 


1L1 1AA1 LI i 


10851 


TTTCCTAGGA 


CATTATTTTC TTGGGCGAAT TTTAGAAGTA 


1 (jMb I I 1 I 1 


10901 


AAAGCCCTGA GTTTCGTCTT GTAAGTCGCG GACTTTTACC 


A 1 Vj i 1 OA 


10951 


CGAGGTCTTC 


AGGTCCTTCT TTATGATAGC AGAGCATGGT 


TATATTGCTA 


11001 


TGGATTCCCA 


GTTGATGAGC CATCTTATGG ATGTTGAGAA 


AATCAGAAGA 


11051 


AGAAAGGCGT 


TTGGGAGCTA AGAAATTACG TATTTTGTCG 


ACGAGGATTT 


11101 


CAGCTCCTCC 


TCCGGGGATG GAATCAAGAC CCGCATCTTT 


TAATGTGAGA 


11151 


AGAACATCGC GAATAGAAAG GTTATCAAGA TCTGAGAGAT AGGCATATTC 


11201 


AATGGCAGTA 


AGAGCTTTGA TATGGATCTG AGGATCGTAC 


TCTTTGATTT 


11251 


TAGTAAATAG 


ATCGGAATAG TATTGCAGAT TGCAGGAGGG 


GAAACAGCCT 


11301 


CCCACGATAT 


GTACTTCTGT AATTGGAGTT TTTATATTTT 


GGATTTGCTG 


11351 


TAGAAGATCA 


TCTGGGGAGT AGAGCCATCC TTTAGGGTCT 


CCAGGTTTTG 


11401 


CATAGAAAGA 


GCAAAATTTG CAGCTGAAGT CACAGAAATT 


TGTAGGATAG 


11451 


AGGTACAAGG 


TTGAGGAGTA GTATACAGTG TCGCCAACCC 


GTTGTTTGCG 


11501 


AACTTGGTCT 


GCAAAATTCC AGAGTGTGCG TTGATCTTCT 


TTATTCGTGA 


11551 


GGAGGAGGAG 


ATGAAGAGCG TCTTCACTGC TTAATCGTTC 


TTGGGCATCC 


11601 


AGTTTTTCGA 


ATATGGAGTA GAGGGGGGAA GTTTTAGGGG 


GCTGTGGGAG 


11651 


GCACGTCGTC 


ATTTGATGAA CACTTTGATG TACTATTCTC 


TCGAGATTTT 


11701 


GTAGCACAGT 


GCTCTGTTTT GTCACATGTT TTTTTTTGGC 


AGCAATCTGG 


11751 


TTTTTGACAC 


CCTTTAGAGA GGGGCCTGCC AAGCAAATGG 


GAACCTACAA 


11801 


GTAGAATACC 


CATCCCTAGA CCTAACAATA CTGTGGCACA 


GCAGATTACG 


11851 


AGAAAAAGTG TCATCATAAG AAATCCTTAG ATAGGATAGT 


TTCTTAATTT 


11901 


AAATCCACCC 


AGATTGGGGA ACTCCAGGCC ATAGCATTGT 


CTGCCTGAGT 


11951 


GACCCTGAGA 


TAGTAGAATA CAAAAGGTGC TTTACCGTTT 


GGATCTTTTA 


12001 


GGGTCACTGA ACTTAGGGGT ACCATAfCAT CGTATTCATA GTCCAGGTTA 


12051 


TTGCTATCGG 


GGAAGAAGGT ATGGAGAACT TCGCCATTGC 


GGATGATTTC 
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12101 TACAGTCTTG AGTAGGGCAG TGCCTGCCAC ATGACCAGAG ATGTGACGGT 

12151 TGACGTTGAG TCCAGGTTTC GACCCTGTGG AGAGTTCGGA GCCCATAGGG 

12201 GCTGAAGTGA TGTTGAAGCT TAAGACGATC CTAGGTCCTG TTGTAGCGTA 

122 51 GCAATGACGT GCGAATAAAG CTTCAACAAG AGACTCTCGG GTATATTTAT 

123 01 TACAAATGAT AGCCGTCAAC CCTGGGGAAT ATTGCACTTG CGGAGAGTCA 

123 51 AAGTAGTCTT TATAAATTCC TCGATCGTCG AGACCCCCAG CAACAAATCC 

124 01 GAAGCGGAGA TTCTTCTTTA ATCCTTCAAT TACTGTACCT CGAGGATCTT 
124 51 CGCTATCTTT ACCTTGGATA GGGAAGGGGT TGTTTAGAGC GGCTGTGGTT 
12 501 TCTGAAGATC CCCAGGCATT ATAAATTTCT ACAACTCTTT CGAACTCGGG 
12551 GTAGAAATTC TCAAAGTCAA AACCATGTTC TTTAGAAGCT GTGAACGAAG 
12 601 GAATAGAAAT CATGTCGTGG TTGACAGTGC TTTTATAGAG CTTGGCGAGG 
12 651 GGAATATGTT TGTATTCTTT GTGTTTCGAG TGGGACTTTG TTTCCTTGGT 
12701 ATGAAGGATG TGACGCACTC CCTCGAGATG AGGTTCTCCG CTATATTGGA 
127 51 ATCCGGATAG TGTGATGAAG CGATCTTCTT CATTAAAGTC GGAGACAGTT 
12801 TGATTGATGA GCTTCCAAAT ATCTGGAGAG AGGTTCTCTT GATTTTCGAA 
12851 TGATGAAGAA GCATAGAAAT TCAGAGCGCG GTCATCTCGG AAATAACGCA 
12 901 TACAAGTTTC AATATTTTCT TCAGAGTCGA CGCGTTCGGA TTCGCCGTGG 
12951 AGGAGACCCC ACATAAGATT CGGGGCGGAG TCAGCGAAAC ATTTGATAGG 
13001 GGCAGAGATG AAAATTTCTT GTGTAGAGAG GTTTTTCAAT TGGATGCGAT 
13051 AAATTCCAGG CTCATTGAAA TAGAGATTAG GAAGAATAAC AAAGCCTGTT 
13101 TCTGGGATGA AGAGCTGCCA ATTTAAATTT TCTCTAAQAT GCTCGTAGGA 
13151 AAGCTCGATT CGGGTCTCTT CAGGAGAGAA GTTGGTGAGG TTCCCGAATT 
13201 CGTCTTCAAA TCGCACGGTG ATATCGAAGC GTTTGTTTTT AACGACATAG 
13251 GAGGGAGTAA AGATCTCTAT TTTTTTTAGG ACGTTTCCGC GGATATCCAT 
133 01 AGAGAAGACA TCGGGTTCAT CATAGTTTCC TTCTCCTGTA GGATCGATGT 
13351 AGAGGTAAAA GGGTTTGCGA CGTTGTGCGA AAAGTTGGGC TCCGTTCCCA 
13401 GCATCATCGA CTTGAGGATG GTTTGGAGAG GCTCCCATGA CAATAGTGAG 
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134 51 GGTTTCTCCT ACTTGAAGTT CGTAGGGGAG AGTAAACTCG AATTGTGGAA 

13501 CGGGATTGTC TTTTACAGGA ATGGCGGTTG CTTCGATGAT TTCGCCTTCT 

13551 GGCATTTCTG CGTAGATTAC GTTTCTAGTT TGGGAGAGAT CTGTCGCGGG 

13601 GGCTTCCCAA TCTGTGGGTT TCCCACTTCC TGCTAAGTCA AATTTACATT 

13651 TGGTTCCAGC TGGTAGTGGT GTGGCAAGGG AATAAAGAAA TTTCCAAGTA 

13701 GAAATTTGCC CTGCTCGAGC TATCGAAGGG TTAACGTAAC AAACAGATCG 

13751 TCGCATAGTA AGAGGGAGGC TTTATATGAC TTAAAAGCGC CATCATATAC 

13801 TAACAGTGAG GTTTTTCTCA ATCCCCGTCT TTGTTTAGTG TTTGTATCGC 

13851 TTCATCCACA GTATTGAATA TTTTAAAGTA AGAAAGGAAT CCTGTAACAT 

13901 AGAGAGTTTG TTCTATGGTT TTTGGGACTG TAGTCAGGAC AATTTTCCCA 

13951 GAATGTTGTC CTACTTGATG GTAGCTTTGC AGTAGGACTC GGATACCTGC 

14001 ACTGGACATG TAATCGAGGT GAGCACAGTC GAGAATGATA TTTTTGGATC 

14051 CAGCTGCTAG GGATTGGGAA ATATTTTCTT GTACTTCTGG AGAAGAAATT 

14101 CCATCAAGTT TTCCGTGGAG ATGAAAGATT GTTGTTGAGC CGTGTTCTTC 

14151 TTTTTGGATA TCACTCATCT AGATAGTTCT CCTAACTATA CGGGAGCTTA 

14201 AGTTTTCACT CTGATAAATC TTTAGCTTTT TTGCAAAGAG ATTTTTATTG 

14251 GTGATGTTTG AGAATTTCGA TTGGGGGGAG GCAGGATGGG ATCGTGGAGT 

14301 GCAAGGAAAT CGGTCCTAGG ATGTTTACAT TCTTAGGAGA TATTGAATTT 

14351 ACGTTTTCTT GGTGTGATTT TTAGTTTTCC GACATTTCGT TCTGTGCAGG 

14401 TAATGATTTC TATATCGAAG TTTTCGTGAT GGATACGCAT TCCTTTTTGG 

144 51 GGAACAGCAC CCACTTTATG GAAGACATGT CCTCCTAGTG TATCGTAGCT 

14501 ATTTTCATGA TCGATTTTCA AATTGAAGTA CTCTTCAGCG TCGGAGATAT 

14551 TCATTCTTCC ATCTACAATC CAAGAGCTTC CGATTTTCTT ATAAGGAGTA 

14601 TTTTCTTGTA CGTCGTGCTC GTCTGCGATC TCTCCTATAA TTTCTTCGAT 

14 651 AATATCTTCC ATGGTAGCGA TGCCTTCTGT GAATCCGTAT TCATTGACTA 

14701 TGATGGCTAG ATGGCGATGT TTTTGTCGGA ACTCTTGGAG AAGAGAGGAG 

14751 GCTTTTTTTA TTTCTGGGGC ATAGAATGGG GGTTTGCTAC TGAGGATATG 
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14801 


GGTTGGCTGA GGTCGTGGCT GCTTGTATAG 


AGCAGTAAGA 


GATCTTTAAC 


14851 


AAGAAGGATT 


CCTGTGATGT TGTCTAAGTT 


TTTTTTATAA 


r^r*r*r* ft ft rT^c 
ACGGGAAC GL 


14901 


GACTGTAGCC 


TTCTTCGCTT ACGAGAACCA GAGCTTCTTG 


«T» ft OTr*T> ft C f V r T l 

TAGTGTAb 1 I 


14951 


TCTTCGGGAA GTGCGAAAAT ATCTACTTTT GGGATCATGA 


frnTTsr* \ r*r*f* ftp 
LI 1 L,AL.LjoAL. 


15001 


AATGAGGTTA 


TCAAAAGCGG AGAGGGCTTC GGAGAGCTGG 


OTTTP ft ft ATP 


15051 


ATGTTGAAGA 


TCGTACTTGT TGGTTAGGGC 


GGCGTCTGTA 


ft ft x^RpriPT 
AAAGAGCAG i 


15101 


TGCAGTGGGA AGAGACCGAG TTGGAATACC 


GAAGCTAGAA 


AACGGAGGTG 


15151 


GGCGGTGGTT 


TCTTTAGGGA CTTTTGTAGA 


GATCCATGGG 


GGGAGGAATC 


15201 


CGTAAGCTAT 


CAGGGCGCTT AGAGAGTATA 


GGGGCCAGAA 


TAGGAGATCT 


15251 


TTGTGAGCTG 


TTTTTGGAGG GAGGAGGGTA 


TAGAGTTTTG 


TCCCGAGAGC 


15301 


TCCATAGAGG 


ATGCAGAGCA GCGTGGCGAG 


AATTGTAGGA GCACTGGGGA 


15351 


AGGGGGGATA 


CTCTCTTCCT TTATCTTTGA 


AGAAGCGTTG 


GTTTAGGGTT 


15401 


TTTAGGAATT 


TTGAGGATCC GTGACAGGAC 


GGTTGCGTAA 


GCCCGAAGGC 


15451 


TAGGAATAGA AGAATACAGA ATATGGCTAA AAGAATATGG AGCATGTTAA 


15501 


GCTGTTAGCA 


AAGCATGTTT TTTTCTTAAC 


ATACACAGGA 


TTTGATTTTC 


15551 


TTTAACTCTC 


ATTTTTCTCT TTTCTTCTGA 


TGAGGTGTCG 


TCGTATCCGA 


15601 


GCATATGGAG 


AATAGAGTGG ACGAGGTATC 


TCGAGATTTC 


TTCGTAGATA 


15651 


TCCTCTTGGT 


TTGGGGATGT GTTCTCTAAA 


AACCTAAGAG 


CGGCCTGTGG 


15701 


GCTAATGAAT 


GCTTCTCCTA AAACATGAGG 


ATAAGCGGGA 


TCTCCGGGAG 


15751 


CATCAATAGG 


CAGAGTGATC GTATCTGTTA 


GAGAAGGATC 


AGCAAATACC 


15801 


TTATCATGGA 


GTTCTGCAAG AGCTTTATCT 


TCTAGGAAGT 


AGATAAAAAT 


15851 


TTCATTAGTT 


GTTACTTTTA AGTGCTCTAA 


GAGCGTAAGA 


ACCAGCTTCT 


15901 


CTACAGAAAC 


CAAATGAATA GGAATACATG 


TTTGCTCATT 


GGAAACATGT 


15951 


ATTTTGATCT 


TTTCTTGCGT CACGCGAATG 


AAATTCCCAT 


AGAGAAAAAC 


16001 


AAACTTATTT 


TAAAATAGGG GTCTTAGGTA 


AACCTGTGAC 


TTTTTTCGCT 


16051 


GTACTATCAT 


TCCAACGGCC CAACTTACGC 


AAGACTTCTA 


CTCGCTCAAA 


16101 


ACGCTTCAAA 


ACATTTCTTT TGGTAACCCC 


TTTGACAGAT 


TTACCATAAC 
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16151 TACGATGTCG AGACATAATC CTGCTCTAGA AATAAACCTA TTTTCGGGAT 

16201 AAAAATGCTA TCACTGGTGC TCAATGCATT CGTATGCAAT TTATATACAA 

16251 TTCTTGGAGC TGGCGCTGGA TGCACAATGG CATACTTAGG TTTACGTTTT 

16301 TTAGGACTTT TCGCTCTGCG CCGAGCTTGT TTACTCATTC GTGTCATAAA 

16351 TAACTCGATA TTGAATTTTT TAATTTCTCC AGACAACGGA AATTGAGGAT 

16401 ACGGAGTACT TTATAAGAAA AAGGATAGTA AAAGAAGAGT TTTTTTTCAA 

16451 GAAGATGACG TCTTTTAGCT GCCTTGATCT TGGTGTAGCT CGTCAGGGAG 

16501 GGGAGACGAG GGGCATGCCA TGGGGGTTGA GAGGACTCCA CAGGAGATAA 

16551 TATATTTGAA AGCATCTTCG ATTTTCATAT CTAGGAATAC GATATCAGAT 

16601 TTTCTAAATA GGGTAAGAAA CCCTGAGGTG GGGTTGGGTG TTGTTGGGAT 

16651 GAAGACCGTG ACGAGGGGGT CGTCTTCCTT TTCTCCTGTG CAGCATACTG 

16701 TGGGTGCGTC TCCAGCGACG AGACCGATGC ATTGAACATT TGCGTTAGGG 

16751 AAAGGAACCA TAACTACTTG TTTGAAGGAT CCTGATTTTG ATCCAAATAT 

16801 GGTAGTCATG ACTTGTTGCG CAGCTTTATA CACTGTTTTA ATGATGGGAA 

16851 TTCGGTGTAA GATTTTGTCG TAGATAGAGA GTAGGGATTT AAAAATCATA 

16901 ATTCTCGTGA GGAAACCTAG GAGCACTGTG GCGAAAAAGA GACCGAAGAG 

16951 TAAAATGATT TGCAATACGA ATTTTAGAAG AGCTCTATGT TTAGTATAAA 

170C1 AGCTAAATTT CTCAAAGAAT TCCGAAGCCA AGCCTACGAA GGGTTGGGTT 

17051 AGGAAGTTCA TGATCATAGT AACAATAGCA ATAGTAATTG CTAGAGGAAG 

17101 GAGAATAACA AGTCCTGTAA TAAAGTATTT TTTCATGATT CTCCTGCAAG 

17151 ATATGAGGAA ATGGGCATTT GTTTCTTTAC TATACAGCTT AAGATTATTT 

17201 AAGATAAAAC TTTTCCCGAA TCTTCTGGGG ATAGGAGAAA TCTCCATGGG 

17251 ACATCACGAT ACTCTTGAGC ATAATCGATG CCGATCCGGG CAGTTGCTGT 

17301 TAGAGTCCCA GAGATTTTTT CTTTGCTGAT ATAGAGAGCT GGGGTATTTA 

17351 GGCGTTGCCT ATTGTTTTCC AAAGAGATTC CTAGAGCTTG GCACACTTTT 

17401 CCGGGTCCAT TGGTGAGAAG GTGTGGGGGT TTATCTCTCC ATTGGCGGCG 

17451 TTGGATCATA AGTTCTTTGC CTTGATCAGG AAGGATGGCC CGGATCAGGA 
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17 501 CGGCATGGGG AATGTCCTCA GGTCCAGTGA CAACATTCAA TAGGTGATGC 

17 551 ATGCCATAGC AACGGTAGAG GTAAGCAGAG CCTCCTTTCA GGTACATCGC 

17 601 TCTGTTCCTC TGAGTTTTTC TGTAGTTGTA GGCGTGGCAT GCTTTGTCAT 

17651 CAGGGCCACG ATACGCTTCG GTTTCTACAA TGTAACCTGA AGTTATCAGA 

17701 CCCTCATGTG TTGTGATGAG TTTATGTCCT AAAAGCTGTT GCGCT AGTGT 

17751 AATTACATCT TCCGATAGAA AAAAATGTTC TTGTAGCACG TTACGAGGCT 

17801 CTTTTTTTCG TTCCTTTTTT CTTAGAAGGC GTTTTCTTTA TTTTCTTAGG 

17851 TTTATCTTCT GTGGTTGTCG CTATAGACCA GACGATTTTT TGCGTAAGGA 

17901 GATTCACGGA ATCAATAGTG ACTTTTATAG AAGCTCCAGG TTTCATTTTA 

17951 TCTGGGATAG ATTCTGGAAG AGCGTTTTTC TTTAGGGAAT ATTCTTTAGG 

18001 GAGTTCTGCT GCTGCAATGA ACCCTTCATG GCAGAATTCG GTCACTACAA 

18051 ATGAGAGTCC TTCATGATTT GCAGTGATGA TATACGCATG GTATGTAGTT 

18101 TTAGGTTGCT CTTGCAAAAA TTTATTTATG AACCGAGTTT TTTTGAGGTT 

18151 TTCGAAAGAA TTTTCTGCTT TTGCGGATAC TCGTTCTTTT GTAGAGCATG 

18201 CTCTTACGAT AATTTCGAGG TGCGTTTGGT CTATAGATAG GGGGTTGAAG 

18251 AGAAGCCTGT GAACAATAAG ATCGATATAT CTACGTATGG GACTCGTAAA 

18301 GTGGGTGTAG TAGTCGAGCT TAAGTCCGTA ATGACCTTTA TTTTCTGTAG 

18351 AGTAGGAGGC TGTTTTCATA CTTCGGACAA ACTGCGAGTG TAGAACTTGC 

18401 TCTAGGGGAT GTCCTGCTGA CGTAGTTTGC AAAAGGTATT GGTAATCAGG 

18451 TTCTTGTGTG GGAGTGAACG TGATATCAAA GCCCATGTTT TTTGCCAATT 

18501 CTTGGAAGGC GAGTAGGTTT TCATCATTGG GAGGTTCGTG ACTACGAAAA 

18551 GGTAGAGAAA CGCCTTGATG GGAGATATGA TAGGCGACCA CTTCGTTTGC 

18601 TTTAAGCATA AACTCTTCGA TGAGTTTATG GGAGAAGGTC TGGTGGTTTT 

18651 CTATCAGAGC TACGGGTTCT TGAAGATTAT CCAAGGACAT AGTGACTGAG 

18701 GGGAGGACAA AGCGAATGCA ACCACGTTCT TCACGGATAT CGGAAAACTT 

187 51 TTTACTTAGA GTGGCCATCT CATTGAGGAT TTTTGAGAGG GGGTGGGAGT 

18801 GTTTCTTTTC AATGATGTTA TCGACTTCAT CGTAGGTCAT ACGATATTTG 
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18851 CTTCGAATGA CGCTACGGAA AATCTGGTAA TCTGAAAGAT GACCTGATTT 

18901 TGTAAACGTC ATAAATACGG ATACAGCGAG TCTATCAACG TTTGGTTTTA 

18951 AGCTGCAGAG ATTATCAGAG AGTGCTGATG GCAACATGGG AATGACTTTC 

19001 CCTGGGAAAT ATGTAGAGTT ACAGCGTTTA GCAGCTTCTT TGTCTAGGTG 

19051 AGAATGTGGG GTAACGTAGT GGGAGACGTC TGCGATGTGT ACACCAAGAA 

19101 TGTAATTGTT ATTATGATCG TAGGTGAGGG AGATGGCATC GTCGAAGTCT 

19151 CTGGCTGTGG AAGAGTCTAT GGTGAAACAG AGGAGATCAC GGAGATCTTT 

19201 GCGAGAGTGG AGAACTTGGG TAATGTGTTT TTGAGAGAAA AGGCTTGCTT 

19251 CTTCAATGAC CTCTGGGGGG AATTCTTCGG CAAGGTTATA TTCGGCTTGA 

19301 ATTGCCTGAA AGTCCGCTTT AGCGTTGGTG ATGTGGCCAA TAAATTCGAG 

19351 CATTTGTAAG GCTGGAGAGG CTCCTTCTTG GGGTTTATCT ACCCAGGGAG 

19401 GAGTGCTCAG AAGAATGCGA TCGCCGATTT TGTAAGTGCG TCCGGGAAGG 

19451 AGTTCTACTG GAATTAAAGA TTGGGATCCC GACATGCTTG TGTAGGCAAG 

19 501 TGCTGATGTG GGACTGACTA GTGAGGTGAT CGTTCCTACG AGTGTTGTTT 

19 551 TTCCTCTTGC GAGTACTTCG CTGATAGTGC CTTTGAGTTT TTGTCCGTCT 

19601 CTTGGATAGG GAAGCACGGA GACAATCACG TGGTCACCAT CTAGAGCCCC 

19651 GCGTAAATCT CGGGCGGGAA CAAAAATATC AAATGGGTAT TCTTCGGGGT 

197 01 TGTCGGGAGA AACAAAACCG AAACCTTTTC TAGCATGAAC AAATAGGGTT 
19751 CCTGGAATAA AAATCTTCAA GGATTTACCG TATGTTCTTC TCCCTGGTTT 

198 01 TCTTTTTGGT TTTTTCAACA ATTGGGCTCC GCCTGTAAGT TTAGGACATT 
19851 GAACGAGAAA TCCCGTAGTT TGACTCGTGA ACTGAGTGGC TCAACAAAAT 
19901 TTTCCCTTTT AAGTGGGTTT TGTGATTATG AAAGGAGCAG TCTCAAATTC 
19951 AAAGCCAATT "GTACTAAAGA AGCTCTGTTA TTGCAACTCC TTGATCAGAG 
20001 AATAAGAGAA TGGAACTGTT TTCTGTTTAT AAGGAAGTTT CCCATCCTCT 
20051 TATGAGGATG GGAATAGAGA AACTTAAATT GAAAATTTTG ATTACTTATC 
20101 GTCGTTATCA ATAATTTCTA CATCAGCTTC TTCGATATGG TCTTCTGAAG 
20151 AACCGTTATT TGAAGGAGGC TTCGTACTGA AACTATGTTT TTTCAAATCT 
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20201 TCTGTATTGA TGTTAGGTCC ACCTTTAGCA TTGGCTGCCG ATGATGCTGC 

20251 TGCTGATGCA GACTGCGATT GCATAGACTC TCCAATTTTT TGCATATGCT 

20301 TGCTTAGGTC TTCAGTAACC TCTTTAATTT TTTCAATAGG AGCGTCATCT 

20351 TTGAGTGCGT TGCGCACGTT TTCGATTCGC TCTTCGATTT CTTTAACTAA 

20401 AGTTTCAGGA ATTTGCTCCT TATAATCTTT AATAGCTTTT TCGGCTCTGA 

20451 AGATCATGCT ATCGGCTTCA TTTTTAGCAT CTGAAGCTTC ACGACGTTTT 

20501 TTATCTTCTT CCTTATTAAT TTCGGCATCT CGAACCATTC TTTGGATTTC 

20551 ATCTTCTTGA AGTCCTGAGC TTGCTTCGAT ACGAATTTTC TGTTCTTTAC 

20601 CGCTGGCAAC ATCTTTAGCT GAGACATGGA AAATTCCGTT TGCATCGATA 

20651 TCGAAGGAGA CTTCGATTTG' AGGATGGCCT CGAGGAGCCG GAGGGATATC 

20701 TGTAAGATCG AATCTTCCGA TTTCCTTGTT ATCTTTGGCC ATGGGACGCT 

20751 CTCCTTGGAG AACTACGATG GTAACCGCAG CTGGTTATCA GCAGCTGTGG 

20801 AGAAGATTTG TTTTTTCTGT GTAGGGATTG TAGTATTTCT CTCTACCAGA 

20851 GTCGTCATGA CGCCTCCTAG AGTTTCGATA CCCAGAGATA GGGGGATAAC 

20901 GTCTAGAAGT AGAACATCCT TAACTTCTCC GCCAAGAACA CCACCTTGAA 

20951 TTGCGGCTCC AATAGCAACA ACTTCGTCGG GGTTGACTCC TTTATTAGGC 

21001 TCTTTGCCGA AGAGTTCTTT TACAGTTTCT TGCACTGCGG GCATTCTTGA 

21051 CATACCTCCA ACTAAGAGAA CATCATCGAT ATCCTTAGCG GAAAGTTTTG 

21101 CGTCACTGAG TGCTTTGATG CATGGAGATT TTGTTCTTTC GATTAGAGAG 

21151 GCTGCGAGTT TCTCGAATTG CGCACGTGTG AGTGTCAATG CAAGGTGTTT 

21201 AGGTCCTTGT GCATCCATTG TGATGAATGG CTGATTGATT TCTGTGGAAG 

21251 AGACTCCTGA AAGTTCTATT TTTGCTTTCT CAGCAGCATC TTTAAGTCTT 

21301 TGTAAGGCCA TATTATCTTT GCTAAGATCA ATGCCTTCTT GTTTTTTGAA 

21351 TTCTTCGATC ATCCATTTGA TAATGACTTC ATCAAAGTCG TCTCCACCGA 

214 01 GGAGAGTATC TCCATTTGTA GATAGAACTT CGAAGACGCC ATCACCGATT 

214 51 TCTAGGATGG AGATATCAAA AGTTCCTCCA CCAAGGTCGA AG AC AG CG AT 

21501 TTTTTTATCA CCGACTTTAT CGATTCCGTA GGCAAGAGCT GCTGCGGTAG 
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21551 GTTCTGGAAT GATACGTTTT ACATCTAGAC CTGCAATGCG TCCAGCATCT 

21601 TTTGTGGATG CTCGTTGAGA ATCATTGAAG TATGCGGGGA CGGTGATCAC 

21651 TGCTTCTGTG ACAGTTTCGC CTAGATAAGC ATCAGCTGTC TCTTTCATTT 

21701 TCATTAAGAT TTGTGCGCCA ATTTCTTCTG GAGTGTATTG TTTGCCATCA 

21751 ACTTCGAAAA CGGCATCACC TTTAGATCCG GAGGTGACTG TATAAGGAAC 

21801 GGTTTGGATT TCCGAAGCTA CTTCAGAGTA CTTACGGCCA ATAAAGCGTT 

21851 TTGTAGAGCC GAGAGTTTTT TCTGGATTTG TCACTGCTTG ACGTTTTGCT 

21901 GGAATCCCCA CTAATTTCTC ATTACCTTTG AAGGCAACGA TCGATGGCGT 

21951 GGTTCTTGTT CCTTCGGATG ATGTAATTAC TTTAGCTTGT CCTCCTTCCA 

22001 TAACAGATAC GCAGGAGTTT GTTGTGCCTA AGTCTATACC TATAATTTTG 

22051 CTTGATTTTT TGTGTTCACT CATGTTTGGT ACCTAATCTC TAGGGGTTAT 

22101 TTCTATTCTT TATTTTCTTT GGGAGTAGGA GCTTTAGCGA CTTTAACTTT 

22151 AGCTACCCGA ATCGGGCGTT CTCCTATTTT ATATCCCTTT GCAAACTCTT 

22201 CTAAAATCGT CCCCTCAGGA ACTTCAGAAG TCTCTTCTGT TTGCACCGCT 

22251 TCGTGTAGGA AGGGGTTAAA CTTTTGGCCT ATTGAAGAAT ATTCAATAAT 

22301 ACCTTTTTCC TCGAAGATTT GTTTGAATTG GTTGAGAATC ATGTTGAATC 

22351 CGAGGGCCCA ATTTTTTACA TCGTCGGACA TTTGTGTAGC AAATCCGAGG 

224 01 GCTTTCTCCA TGCTTTCTAT GGGATTGAGA AAGTCTATTA AAGTATTTTC 

22451 TAAAGCATAC TGCATAAGTT CTTGGCGTTC TTTTTGTAAG CGTTTTCTAG 

22 501 AATTCTCAGA TTCTGCTAGA GCCATGAGAT ACTTATCGTT TTTTTCTTTT 

22551 AATTCGGTTT TTAGGGTGAC GATTTCCTGT TGCAAATGTT CAACTTCATT 

22 601 TTCGTTTTGA ACATTGCTTT CGTGTTGTTC CTCATTTTCA GGTGGGGTAT 

22651 CTGTCATAAC GTCTCCTTAG AGGGTAATAG TTTTATAGAA GAGTACTCCG 

22701 TTCTTAAAAT AGGCTCATTC GAAAGCTTAC AGTTAGAGGT GAGTGGTCTT 

22751 CTGAAGGATA GTTTAAATTT GTAGAAACTT TGTGTCAGGG TTTCATTTAT 

22801 TTTATTCGCA AATAGTTTGA GCAAAGGAAG AGCTTCCTTA TAAGGAAGAT 

22851 TGATCGGGCC TAGGATACCT AAAGCTCCGA GTGGAGAGCG ATTCATATAA 
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22901 TAGGGAATAG TAATTACAGA ACATCCTGGA TTCGAGGTCC CTAAAATATC 

22951 AGAAAGCTCC TTCCCTATGA ACGCTGTAGC TCTTCCTTTA TGCATTCCTA 

23001 TATTTAGAAG CTCACACATT TGTCTGCGAT TTTCAAAAAG AGAGAGTCCT 

23051 AGAGCTAGAA CTTCAGGATC TTTAAACGCT TCGTATTTCA GTAGTTTCGA 

23101 CATTCCTGTT TGATAGAGAT CTTCTTCACT AAAGTTGCAG TAGCGTGTTA 

23151 GATAGCGGAC AACCACCTCA TTATAGAGGG ACATGCTCAG GTGTTCTTCT 

23201 TTTTTCGAAA GTTCCTCATT TGTGGGGAGC TTTCGGATGT AGTTCTGCAG 

23251 GAATTTTTCT ATACGTTTGA TAGAAAGAGT ATCGCAAGCT TCAGGCAGCC 

23301 ATAGGGTGTC TGTGAAGATC TGACCAAACT CCGTAGAGAG GATGGTGACA 

233 51 GCTCTTTGCT TATCGACCTG TGTAATTTGA ATATTGGTTA CGGAATCATT 

23401 TTCAAAGCGT GGGGAAGAAA AAAACGTAGG CAGGTCTAGG ATTTCTCCAA 

23451 GAAGTTCCGT AGCTTTTTGT AGATCCTTGA TAATATTGCG ACTTTCGCTA 

23501 GGAAGCTGAC TGATCTTATC AAAAATGGGG GCAGAAATCT CAGCTTCTGG 

23551 GCATTCTTCT TGGTGATCTA CATAGTGACG TAATGCTAGG TCTGTAGGGA 

23601 TTCTTCCTCC GGAAGTATGA TTTTTTTTTA AGAATCCTTC AGCTTCAAGT 

23 651 TCTGCAAAGT AATTTCTTAT AGTTGCCGTA CTCAAATCAG AGCAAAAACT 

23701 TTCCTTTAAA GTTTTAGACC CTACAGGCTG CCCTGTTTTT AGGTACAACT 

23751 CTGTTGTAGC AAACAGGATA TCAAGGATTT TTGAATCTCG CTTTGAGACT 

23801 TTGGATCTAG CCATCTCGAG TCCTACTAGA ACAATCGTAA CTGAGAGCAT 

23851 TATAGGAAAA GAACCCGAGA AGGTCAAGAG AATTTAGCAC TCGAATTGAA 

23901 TGAATGCTAA CTTTTTTACG AGGAGGGCAG CGATCAAAGA GCTAGGCTAA 

23951 GTGATTCTGA CACCAAGTAG GGAAGGCCTC CGGGGAGACT GTATACTTTT 

24001 CTCCAGATCG GGATTCAATT TCGAATATTC CCGAAGATTG GTAGGACTTT 

24051 CCTAAAATAA GCTTATAAGG AATGCCGATA AGGTCACTGT CTTTAAGTTT 

24101 AAATCCGAGT CTTTCATCTC GATCATCAAG AAGGGGCTCA TAGCCTTGAC 

24151 TTTGTAGCTC ATGATAAATA GTTTCCGCAA GCTCTTGAGA TACAGTGTCT 

24201 CCTCCGTTAA AGGCGATAGT GATAGAGAAG GGAGCGAGTG CTTTTGGCCA 
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24251 AACAATACCA CGGTCGTCGG CAAGCTGTTC TACACAAGCG GCTAATGTTC 

24301 TTCCGACTCC AATGCCGTAG GTCCCCATCC AGCACTGCTG GGTTTGCCCG 

243 51 TGTTCATCTT GGAAGTTTAC CTCAAAACTA TCGGTATAGC GTGTCCCGAG 

24401 ATTGAAAATA TGAGCAACTT CTATGCCTTG ATAAATGCGG TAAGGATGGC 

24451 CAGGATTTTC AGGACATGTG TCTCCCTCTT CAGCGAGTAG AAAGTCACCG 

24 501 TATTGGGGGG GGAGGAGGTC GCGATCCCAG TTTACATTTA CGTAGTGCTT 

24551 ATCTTTAGCA TTGCCCGCAC AAACAAAGTT CGTCATTGGG GACGTTGTTT 

24601 CGTCTGCGAA AAAGTCTATG GGACAGTTTA GGGGACCGAT GAATCCTTTT 

24651 TCTGTGCCTA GAACGCGTTC GATTTCTTCA TCAGAAGCTA GAGCAATATC 

24701 ATCGGCATTC AGTTTGGAAG CGACCTTCAC TAGGTTGACT TGCCGATCTC 

24751 CTCTCATTCC AATGGCAATG AATTTTTCTT CATTTGAGTA GGAGAGTTTT 

24801 ACGACAAGGG TTTTTAAAAT TTTATGTAAG GGGATAGAGA AGAAGTTTGC 

24 851 TAGAGCTTCT ATTGTTGTAA TCCCAGGGGT GGCCACTTCT TCGACGGGAA 

24 901 GAAACTCGCG ATCGTAGGCA TGCTGTGGAG GAATGGAGAC AGCAGCCTCA 

24951 ATATTAGCTC CATAGGAACC GCTGACGCAG ATCGTGTCCT CGCCTAGAGA 

25001 GCAAAGGACC TGAAATTCCT CAGACTTTCC TTTGCCGATT TTCCCTCCAT 

25051 CAGCTGTAAC GATGACATAG GCAAGACCGA GACGATCAAA GATCTTACTA 

25101 TACGCAGAGC GGAGTTTTTC ATATTGCTCG TTCATTTGTT CGGGAGAGTC 

25151 TGAGAAGGTA TAGCTGTCTT CCATAAGGAG CTCTCGAGAG CGAATGAGAC 

25201 CGAATCGAGG GCGAATCTCG TCTCGGAATT TTGTAGCAAT TTGGTAAAGG 

2 5251 TGGAGAGGAA GTTGTCTTTT TGAGGAGAGC CATTGTGCAA CAAAAGAGCA 

25301 GATGACCTCT TCATGTGTAG GAGCTAGGCA ATGAGATTTT CCTTCGCGGT 

25351 CTTTGAGAGT GTAGAGCAGT CCTTCCGAAG TAAATGCCTC CCATCTCCCT 

25401 GTATGTTGCC AAAGTTCAGC ATTGTGGAGA AGTGGGAGTA GAAGTTCTTG 

25451 ACCTCCAATC GCATTAAGTT CCTCTCTAAT GATGTTCATC ATCTTGGAGA 

25501 CCACGCGCCA TAACAGGGGT GTATAGGTAT AGACTCCTTT ACTTACTTTA 

25551 AATAGGTATC CTGCCTTTTC TAGGAGCT CG TTTGAGAGCA CAGCAGCGCT 
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2 5601 TTTATTTGCA TTTTTTGAAG TCTTATAAAA GAGTTGAGAC GTTTTCATAG 

25651 AGTGGGGCTG TTATCTTCAG TGATTGATCG TCAAGAGTCT ATAGAAAAAA 

25701 TTACAGGCTA TTGCGATTTT ATATCGAGAG TCTTTTAAGC TAGTAGATTT 

25751 AGTTTAGCTA TGAAGGAGGT ATCTTAACAC AGGGATTTTT CTAGATCAAT 

25801 GTTTTGCGTT GGGTTCTTAA CTTAACAATT GGGGTCAGGA TTATGTATTA 

25851 TTTTAATTAT ATTTTTGTTT TTATTTAGAA ATTTAATAAT CTCAACTTAT 

25901 AATTAAGTGT ATACTTATTA ATCTTTTATT TTTGTAATTG TAGTACTATG 

25951 TCGTCAGTAA ATCAAAGCTC TGGAACCCCG AATCCAGAAG AGGTAACTTC 

2 6001 TCCTGAATCT ACGGAAGAAA ACAAAAATGT TGTTTCTTCA GATGAGGCGC 

2 6051 AAGCCACGCA TGCTGTGGCT CTTCCTATAG TCACTCAACT TTCTCTTCCT 

2 6101 GAAGGTGTGG GGACCTCATC TGAAGAAACG GCGAGTAATC CGAGGGTAGA 

26151 CGAGATTGTA GCTGAAGTTT CTTCGAGTCG GGCGGTTGCT GATCAGATCT 

26201 CATCACTTGT AGAGCGTGTT GGAGAGCTTT TAGACGACCT TAAGGGTGCC 

26251 CAGTCCCTTT TCACTAGCTT TCAGTCAGAG TTGAAAAACT GTCTTCCGGC 

26301 ATGGAAATCT TCAACGAGAA GACTCGAAAC TCGAGGTGCT GGGGATAATG 

2 6351 CGGATATAGC GAGGCTGGAA TTATTTCGTA GCGATTACGA GGCTGTCTTA 

2 6401 GGCCATGCGA ACCAGTTTCA TGGGAAGGCT CATCTCATTT TAAGTAAGTT 

2 6451 AACAGATGTA CATCACAAGC TACAGGGACT CAGTCGTGAA GATCTTTCCC 

2 6501 TGGCGTTTGA CAATAATGAT AGGGTTCTTG AGCATCTGGG TTCGTTAGGG 

2 6551 CTTGATGTAG ACGCTGAAGG TAATTGGTCT CTTTCTTGTG AGAGGGGGAT 

2 6601 TCCGCGACTG GTGCTTACTG CTGACAGTAT GCTTGTCCAG ATCAAGAAAG 

26651 TGAATCTACC TACTGTAGAA GAATTGCGGA CTCTTCAGGG AACAACGGAA 

26701 TCTTCGTCTG ATCCTAGGGT TGAAGAAAGT TTGTCTTGCT GTGAAAGATT 

26751 GCTCAATGAA TTACGTCGTC TTTGGGCGAA TTTTGTAGGT TTTATTTCGA 

26801 GTTGCTATGA CAACATCGTG TTTGTTTTGA TGTGGATAGT GAGACGGATT 

26851 AACCTTTTGC CTGGGCTGGG GTGTTTGCCT TTCCATAATC CCGATGCTTC 

26901 TCAAGAAGAC CAGAGGTCTT CTTCCGGAGA GCGTTCTACA AGGAGAGAAC 
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26951 GCCTTTCTCG GCGATCTGAC TTATCTGAGG AAGAGATGAT TGTGAGAGCT 

27001 GAGGGAGAGT CTATACATCC TGAATCTCCC CATGGAGATG GCCGTAACCA 

27051 ACCTAGTCGA GGTGATAAGC AAGACTCTGA TAGTGAGGAA GAGACGGAGT 

27101 TATAATAAAG AATGATCTCT TCTGAGTAGA AACTATCGAA TCAGAATTGC 

27151 CAGCTTTTAC TCTCACCCAG TCCTATTTTT ATAGTTCAAA TTGCGGTGGT 

27201 TTGTTTATTG TTTTTTTGAA TTTTTGTTAT ATGGTGGCGT CTAATATTGC 

27251 CGATTAAGGA GGGCGCCCTT TATGAATAGA AGAAAAGCAA GATGGGTAGT 

27301 GGCATTGTTC GCAATGACGG CGCTCATTTC TGTTGGGTGT TGTCCTTGGT 

273 51 CACAAGCGAA ATCAAGATGT TCTATTGATA AGTATATTCC TGTAGTCAAT 

274 01 CGTTTACTAG AAGTTTGTGG ACTTCCTGAA GCTGAGAATG TTGAGGATTT 
274 51 AATCGAGTCC TCGTCTGCTT GGGTACTGAC TCCTGAAGAA CGTTTTTCTG 
27501 GAGAGTTAGT CTCTATCTGT CAGGTTAAAG ATGAGCATGC TTTCTATAAC 
27551 GATTTGTCTT TATTACATAT GACTCAGGCT GTGCCTTCGT ATTCTGCAAC 
27601 GTATGATTGT GCTGTAGTTT TTGGCGGGCC TTTGCCAGCG CTACGTCAGC 
27651 GCTTAGATTT TTTGGTGCGA GAGTGGCAGC GTGGCGTGCG CTTTAAGAAA 
277 01 ATCGTTTTTC TATGTGGAGA GCGAGGGCGC TATCAGTCTA TTGAAGAACA 
27751 AGAGCATTTC TTTGATTCTC GGTACAATCC TTTCCCTACT GAAGAGAACT 
27801 GGGAATCTGG TAACCGAGTT ACTCCCTCTT CTGAAGAAGA GATTGCCAAA 
27851 TTTGTTTGGA TGCAAATGCT TTTACCTAGA GCATGGCGAG ATAGTACTTC 
27901 AGGAGTCAGA GTGACATTTC TTCTAGCAAA GCCAGAGGAA AATCGTGTGG 
27951 TTGCGAATCG TAAGGACACC TTACTTTTAT TCCGTTCTTA TCAAGAAGCG 
28001 TTTCCGGGAC GCGTGTTATT TGTAAGTAGT CAACCCTTTA TCGGTTTAGA 
28051 TGCTTGCAGG 'GTCGGGCAGT TTTTCAAAGG GGAAAGCTAT GATCTTGCTG 
28101 GACCTGGATT TGCTCAAGGA GTCTTGAAGT ATCATTGGGC TCCAAGGATT 
28151 TGTCTACATA CTTTAGCGGA ATGGTTAAAG GAAACGAACG GCTGCTTAAA 
28201 TATTTCAGAG GGTTGTTTTG GATGATfCAT GGATCTTAGA GGTTAAAGTC 
28251 ACTCCAAAAG CCAAAGAGAA CAAAATTGTA GGCTTTGATG GACAAGCTTT 
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28301 


GAAGGTCCGT GTTACCGAAC CCCCAGAAAA GGGTAAGGCC 


AATGATGCTG 


28351 


TAATTTCTTT 


ATTAGCAAAA GCTTTATCCT TACCGAAGCG 


TGATGTCACT 


28401 


TTAATTGCAG 


GAGAAACTTC TCGAAAGAAA AAGTTTCTTC 


TTCCTAACAG 


28451 


AGTTCAAGAC 


ATTATTTTTT CTTTGCATAT AGACGTATAG 


CCTAACTCAC 


28501 


AGCTACGTTT 


TTCCCTTTCT CAGATTTTTC CAATTTTTTG 


CTTTTAAAAG 


28551 


ATAAGAACTG 


GTTGCGTTCT GTTTTATGAA GCTTGATTCC TAAGTACTTG 


28601 


ATGATATCTT 


CATTAAAGGT GGTTGTAGGT GACAGGCGTT 


GAGCAATGAT 


28651 


TTTACGCAGG 


CTGTCCACAT CGTGATTGTT ATAGAGTAGG 


TGGTGCACAA 


28701 


TTTTTGCGAT 


TTGTTTTCCT GATTTTTTGT AATCCACGCT 


ACAGGCAATG 


28751 


CAGGCTCCTT 


CGGAAATTAA GGAGGTATCG TCGGTAATGA 


TAGGGATTTT 


28801 


CTCTTTGAGG 


ATTTCCTGAA GGAATGCGGT GCCTTCTTTA 


TGAGAAAGTG 


28851 


GGGAGAGGGG 


AATGAAGATA GCTGAGGGGC GCTTGTCGAT 


AGCCTGGCGT 


28901 


ATCCGGGTTT 


TGAATGTACT GCTTGTAATA GAGATCTCAA 


TGACCTCAAT 


28951 


TCCTGAAGCA 


TGGAGTTTCT TAACAATTTC TTTTTGGAGA 


TCTGAGGGGA 


29001 


AAGGTTCGGA 


GGGTTTTAAA TACACGATAG ATTGTGCATT 


GGTAGCTACG 


29051 


GCTTGTATAG 


CAAAGCAGTA TTGATTGATG TCTAGAGTGT 


CATTCACTCC 


29101 


GTAGATATTC 


ATTGTGTTTT TAGGAAGGGT TAGGCTTTCG 


CGATCAGGAA 


29151 


CAGCGGCATA 


GATCACAGGT TTCTGTGTTT CAATGTGGCT 


CATGACCTTC 


29201 


GTAGCAATAG 


TTCCTAAGGT GACAATCGCC ACGACATTTT 


TATCGGTATG 


29251 


TAAGGAGCGA 


GCAATTTTCC TAGCCTTTAC GATACTGTCT 


TCAGCATTTA 


29301 


GGACAACAAT 


TTCAGGAAGG TTCTCAAAAT CTTTCAAGGT 


TTCTATACAG 


29351 


CTTTTACTGC 


AATCTTCTAA TAGGGGATGG GGAAAGGATA AGAAAATTGC 


29401 


GATTTTAGGA 


GAGGAGACGC TATCTGGT7G AGAACCACAA 


GTGGCTACAT 


29451 


AGATGAAAGA 


GCAAAACAGA GAAAAGAAAA ATAAGTACTG 


AGAGAGTTTA 


29501 


CGTAGAATTG 


TCATGCAAGA ACCATCGTTT CTTTTAGTGA 


AGCGGTACAG 


29551 


AGGCGGTCAC 


AGGCTAAAGC GATATTTTGT GGTTGTGTCA 


GAGCGGAGAA 


29601 


ACGAACAAAT 


CCTTGTCCAC AGGAACCAAA ACCGTGGCCG 


GGAGTCACTG 
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29701 CCTTCAGGGA GTTCTACCCA AAGGTAAGGG 

29751 TGAGAATCCT GCAGTTTCTA AGCTTTTTTT 

29801 GATATAAAGA GATGGCGGGA GGTGTCGGAA 

29851 CCTGCTTCTT GCATGAGGAG AGATGCTCCG 

29901 CCGTTTCCAA TCGTTGATCA TAGGTTCGTT 

29951 TAGGGATCAC GTTCCAGGCA AGGCGCATGC 

3 0001 GAGAAAGAGT TGATTTCTAT AGCACAATAT 

30051 GATGCTTTTA GGTAGGCTAG GATCTGAGAC 

30101 CAAAAATAAG AACGGTTCCG TGCTGATTCG 

30151 AGTTGTTGAA AGGTTAGAAC TGTTCCTGTG 

30201 ACAAAGAATG TCTAGGGATT GTTGGTTCGG 

30251 TTTCTTTTCT GCATGCTAGG GGGATAATGT 

30301 GCAATGTCTC TATAAGCTGG ATAGACAGGA 

30351 TTCTGAGCCA AAAAAAGAAA AGAGACGGAA 

3 0401 CCGAAATAAA AATCTCTTCA GGGGAGATTC 

30451 GCAATTTTTG TGCGTAATTT TTCTAATCCG 

30501 ATAGGTCTCT TGTTTCTCTT GAGAAACGCA 

30551 TAATAGAGCG GCAGAGAGGT TGTGTCGTAT 

30601 ACAGAGATTT CTGGATTCTC CTTGCGAAAC 

30651 TTCAGAAAAT AGATACTGAG GCTTGAGAAG 

30701 GCATAGCGTG CTCCAGGTAA GGGCGTGATT 

30751 AAAGCCCCAG CTTTTTGGAT AGCCATTTTC 

3 0801 CTATAAAGGC AAGGAGTGCT CTGTAGTTTC 

3 0851 TTATAATTCT GTTTATTAAA AGTTTTTTAA 

30901 TATAATTAAA GGATAAGATT TGAAAAAATT 

3 0951 GGTCAATCCT TATGGAAGTT ATAGGGGTAG 
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GCATGATCGC CACCATGAAC 
AAGTTTCTGA GCATTGGTTA 
ATAAATCTAG GCCGTAATAC 
TTAAATGTAG TCGCAAAGAG 
ATTGTCATAG GTGAGTTCTT 
CAGTAAAGCC TAATGATTTA 
TTTGCTTCAG GGATTTCGAA 
AAAGGCGCTA TAGGCCGCAT 
CGTAGTTCAC AAGTGCTTGG 
GGGTTGTTAG GATAGCATAG 
AAGTTCTGGA ATAAACCCAG 
CGCGGATTCC TGTAATGTGG 
TCCTGTAGAC CTAGAGTCTT 
GATATCAGGT TTGGCACCAT 
TATTTTCATA GACTTCAGAG 
GTTTCTGGGC CGTACCCACG 
GAACTCTTTG ATTGCCTGAG 
CTCCGATAGA AAGATCTATG 
TGAGCAAGCT TTTTACTAAT 
AGAAAAGTGG GGATTTCTAC 
CACTTGGAAG ACATCTTAAG 
TGATTTTTCT TTAACCGCCT 
GGTTAATTAT TAAGATATAT 
ATCTTTTCTA ATTTGCTCAC 
TTTTAGGTAA TTATGATAAG 
GAATCCTTCT CCAGAAGATG 
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31001 GGAAAAAGGA TGTACCCCTT TCAGGGAACT CTCGCTTGCA TCGTCGTGGT 

31051 GGGATTCGTA GAAAGCATAA GAGTGCTTCA GTTGGGGTGA CCTCGGGTTC 

31101 TAAGACGGGG AAAGCTTCTT TAGAGAAGAA GGTCAAAGGC ATTTCAGAAG 

31151 CCCATTTCAA ATAATCCAAG ACAGAAGGTT CTCATTCGAA GACAAGCAAA 

31201 GGATTCGTTG GCAGATTTGT TCAATGGATT AGAACGTTTA CAGGACGTGG 

31251 AAGCAAGAAG CGTTCTCCCT CAAGTTTTTC TCCAACGCAC CCTTACATAC 

313 01 GTTTGCGAAC TTACACACGC AGTCCAAAAC AGAGTGGTGT AGAGAGAAAA 

313 51 CAAGAAGATG CTGAGACCTC ATTTATAGAG ACACCCAAAG GGATCTTGAA 
31401 AAAGCCTGGA AACAAAGACC CCAAAGGCAA GCACGTCCAT TGGAAAGACA 

314 51 GCTAATCCGG ATCAGAGCTG AGATCCCTCC TTCTCTTCAT TAAGAGGGTT 
31501 CAGAGTTTTA GGTCTCTGCG GTAGAAGGGA GGGAATCCAT AAGGAGAAGA 
31551 CCAGTGGATT TTCAACGGAG ATTCTAGAAG GAAACAAGCT GATTCTACAT 
31601 GGAGATGTTT GTAGGTCTGT TGTGAATTCC TCTCTTCACA GATCGGAAAG 
31651 ATCAATGAAA AGAGTTAGGG TCTCTTAGTT TTAAGAATTG CTATACAATA 
31701 TTGCGAAATA AATACTTTCT TCTCTATACA CTGTCTTTAC GATGAAGACA 
31751 GCTTTTCACT CTTGCTATTC TTGGTTTTGT TGGCTCTTTA GCTTCTTGGT 
31801 ACTCTTTGTG GGTGGCATCG CTGGGGGAGA GCCTTTGTGC CCCGATTGCA . 
31851 AATACGAAAC TAAGTCTGTT TTACGTTCGG ATCAGCTGCC GGATCATCTC 
31901 TGGAACTATG AAAACGACTG TTATCTTACA GG TTATGTGC AGTCTCTTTT 
31951 GGACATGCAT TTTTTAGATA GCCGT ACGCA AGTTGTTATT GAGAAGAATA 
32001 GAGCGTATCT TTTCTCTTTG CCTGTAGATT CGAGTTTATC AGAAGCCATT 
32051 ACCAACTTTG TTAGGGATCT TCCCTTCATA TGTGCTGTGG AGATTTGCGA 
32101 GCGTCCTTAT GGTGAATGCA TAACGAGATC TTCTGCGGAG CGTCCCTTAC 
32151 TCCCTAAAGA GAAAACTTTA GGAATGCCAA TTTTCTGCGG CAAAGAAGGG 
32201 GTATGGTTAC CTCAAAATAC CATTTTGTTT TCTCCTTTGA TTGCAGATCC 
32251 TCGTCAGGTT ACCAACAGTG CTGGCATTCG TTTTAATGAG AAGGTCGTGG 
32301 GGAATCGGGT AGGTGCTACC ATCTTTGGGG GAGATTTTAT TCTCCTGCGT 
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32351 


CTTTTTGATG 


TTTCTLVjAI I 


PP ATP.T A P. A T 


TPTP ATTTPG 


GAATTCAAGG 


32401 


AGGAGTCTTC 


TCAGTTTI iv? 


21 TTT A P. A TP A 


TPPTP. 24 ATP G 

iLLi IV.VJ 


TGCATGGTAA 


32451 


ATTCAGATTT 


CTTTb 1 i oCC 




PAGGGGCTAT 


AGATAAATGG 


32501 


AG TTTT AG G T 


II CvsA 1 i v» i\3 


pp a p ptptpp 


TPPPATTTAG 


GAGATGAGTT 


32551 


TATTCTTACG 


CATCCAAA 1 1 


ICCCAAoAl I 


TAATTTGAGT 
1 nn ill Va/ivj 1 


GATGAGGGCG 


32601 


TCGATCTCTT 


CATTTCGTTT 


CGTTACACAC 


CACAGATCCG 


CTTGTATGGC 


32651 


GGCTGCGGTT 


ATATTGTAAG 


TAGGGATCTT 


ACTTTTCCTG 


AGCGGCCGTT 


32701 


TTACTGTGAA 


TGGGGTGCGG 


AACTCAGACC 


TTTTGGTCTG 


AGAGAAGGAA 


32751 


ATCTCCACGC 


ACAACCGATT 


TTCGCGATGC 


ATTTCCGTTG 


TTGGGAAGAA 


32801 


CAGAAATTTG 


GCTTGGATCA AAGCTATATT TTAGGCATGG AGTGGGCCAA 


32851 


ATTTCAAGAA 


ATCGGAAGGA 


AAATCCGTGC 


TGTTTTAGAA 


TATCATCAGG 


32901 


GATTTTCTAA 


AGAAGGCCAA 


TTCATTCGTG 


AACCGTGTAA 


TTACTACGGT 


32951 


TTCCGTCTTA 


CCTATGGATT 


CTAAACGATA 


AcAAAACC A I 


PTTP &PPP AP 


33001 


AGGGTGTTCA 


GGTCCTATGG 


GCAGCGTATG 


ATTGAGATAT 


CTGCGGAAGA 


33051 


TTGCAATGCC 


TGCCTTTGCT 


GAGGATAGGC 


AGAATAGGCA 


GTTGTGTACC 


33101 


CATTCAGAAC 


CTCGAATGGT 


TCCTACAGCA 


TGATTGGAAT 


TATACAAAGC 


33151 


TGTGATCTTG 


GATTTCCAAT 


AGTCTACAGG 


TCCGATTAGG 


AAGACGGGAA 


33201 


CAAGAGCTTT 


TTTCCCTGTT 


TTGAGACTAA 


TAAGCTCCAG 


AAGGAGTTCG 


33251 


AAATCGGTTC 


CCATGCCTCC 


GATAACAAAT 


ACAGCAAGGT 


CGACATGGAA 


33301 


GTCGGCCTGA 


CGTTCTAAAA 


GATCAGGAAT 


AGCATAGCTC 


ATTTTAGCTT 


33351 


CTACATAGGC 


ATTCGTGGTA 


TGCAAGCTAA 


TTAGATTCCC 


ACAAGAGAGT 


33401 


ATGGAGAGTT 


CTGTAGCTAC 


ACGATTCGCG 


AGTTCCATAG 


CTCCAGAACC 


33451 


CCCTCCTGTA 


AGGATTGCTA 


ACGGTGTCTG 


TGGTGGAAAT 


TCTGGGATCG 


33501 


TGAATTGCTG 


AGAAAGAGTA 


TGCATTCCTG 


TCAGGAGCTC 


ACGGAGAAAC 


33551 


TCATCATAAT 


CCCCAGCGAT 


TAGGCAGGAA 


CCATGAATTC 


CTATAAAGTA 


33601 


GGATTGAGCA 


AACTGTTCAG 


CTTGATGTTT 


AGGGACAAAC 


ATGCCCACAT 


33651 


CTTTATTTCT 


GCGTTTGATG 


TATTGTAAGA 


GTCGTTTCGA 


TTCTAAGTCT 
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33701 GCCCAAAATA CAGAAATTCC TGCAAAATAT AGATCGAGAA GG AAAGAGCG 

33751 ATCTCGATTC GAGAAAAACT CTCCAGAAGT GGGAGAGGGA ATCTGAAAAT 

33 801 AGATATGTTG CAGGTAATAG CGAGAGTAGT TAGAGAGGAA CATGCCCTTC 

33851 AGCGATGCTG AAGGGAAGTA GCGGGAAAAT AAAACTCCTT GGCTTGTGAT 

33901 ATGATCTGTT TCCATGGCTT TTAAAAAAGG GAAACAAGGT TGGTCTTCAA 

33951 TGTGCTTTTG AATTTCCCTA GCATGTCTTT CATCTGATGG GGAGATTCGA 

34001 GGTTTGATGA TCCAAGAGTC TTGGGAGAGC TCAAGCAGCT CACTACCTTT 

34051 GGAGATAAAC ATCGCAGCTT GATCTTCGCC TTCCGGTATG GATTCAAAAA 

34101 CACGAAATAC CTCTTGAGGA GATTCTAAGG TTTCCTGGAG CATATCTCTA 

34151 TAGAAGAAAA ACGAATGCTC TTTGTAAGGC TCAAGAGTAA AAAATTCTAA 

34201 AGGTATTCTC TCAATAGGTT CTGAAGTGCT GCCGTAGAAT TCATAAATAT 

34251 CTCCAGATTC TTGTGTGGTA GGTTCGAGAA TATCCGCTGC GGTGTGACGA 

34301 AGCCCTTGGG GGAGTAAGTC CTGAACGACT CTTGCAAATA CGGTTCGGAT 

34351 GTGCAGAGGC TCTGTCTTTA TGAGAAGAAT TTTATGATCT TCGGGAACGG 

34401 GAGGACGATC TGTTACCATT TGATACAAAG GAAGAAACTT ACGTATTTTT 

344 51 AAATGGGGAC GCGTGAGTGA TTTGCTCATT AAGGGAAGGA ACCCATAAAT 

34501 TGTCTCTTCG TAACAGATTG TTCCTGGAAG GATCGGAAGG AAGACAACAA 

34551 GCCGATCATT AATGATCTCT AGAGTGATGA AGTGCTCAAG TTTTTTCCCA 

34601 AAGCGTAGGA GCGGAGATCC TGTACGGTCT GTGTGCGTAA ACATCCTGTT 

34651 GAGATAACAA GGCGAACGTA CGAGTCGGCG ATCATCAGCA GCAAAGAGCT 

34701 TGCAGACAAA ACTTCCAGGC TCTAGGAGCT CCAACATAGC AGTGGCTATA 

34751 GGATCTTGGC TCATGAAGAG AACGTGTAGA CGAGCTTCTT TTCGGGCTTT 

34801 ATTTAGCTCC AAGTGGTTTA AAACGGCTTC GACACCTAGT TGGGCTAAGG 

34851 AACTTTTTAA ATTTACTTGT ATACACTGTT GAGGCAGATG AAATCCAAGA 

34901 AAGTACGCAG GAATATTCTC AATGAGGACC TCTCCTTCGT AAATGTGGGG 

34 951 CGAGAGTTTT TTCAAATGGG AAACGAGTCG TCCGTCTGGG GAGGCTGCAT 
35001 CATGATGCGC GTGGAGTAGG TTATACATAA TTCATCGTAT TTTAAAGTTA 
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35051 AAAGCAATAG GTAAGGTCCC TCTTTGCGCT TCTAAGACTA GCGTTTCCAG 

3 5101 AAGAATTTCC TACAAGGACT TCATTCTGCT TGTTGTTATA ATTTTGTGCA 

3 5151 ATTTGTCAAG GAGGGGGTCT TGGAACGAGA AAATTGTGTT GTGGAAGGGA 

35201 ATTTTAGGAA AAAAGTTTCT TAGAAAATAG GGCAGAAACT TTCTCTTGGA 

3 5251 GAGATTGAGT AAAATGTAAA GAATAGTCTT TGCAATTGAG AATTATTTCT 

3 5301 CTCTGGTTAC AATGGAGGAT TGGCTAAGGA GGATAGTAGG TATGCAGATT 

3 5351 CC AAGAAGCA TTGGTACTCA CGATGGTTCT TTCCATGCGG ATGAGGTCAC 

3 5401 AGCGTGTGCT CTCCTTATTA TTTTCGATCT TGTGGATGAA AATAAAATTA 

35451 TACGCTCTCG AGATCCTGTC GTATTATCGA AATGTGAATA TGTTTGTGAT 

35501 GTCGGTGGTG TTTATTCTAT AGAAAACAAG CGTTTTGATC ATCATCAAGT 

3 5551 CTCTTATGAT GGATCTTGGA GTAGTGCAGG TATGATTCTG CATTATCTTA 

3 5601 AAGAGTTTGG TTATATGGAT TGTGAAGAAT ATCATTTCCT TAACAACACT 

35651 TTGGTACATG GTGTGGATGA ACAAGATAAT GGCAGATTCT TCTCTAAGGA 

35701 GGGATTTTGT TCGTTTTCTG ATATTATTAA AATTTATAAT CCTCGCGAGG 

35751 AAGAAGAAAC TAATTCGGAT GCGGATTTTT CTTGTGCTTT GCATTTTACC 

3 5801 ATCGACTTTT TGTGTCGGCT AAGGAAGAAG TTTCAGTATG ATCGAGTTTG 

3 5851 TAGGGGGATT GTCAGAGAAG CCATGGAAAC CGAGGATATG TGTTTATATT 

3 5901 TTGATCGTCC TTTAGCATGG CAAGAAAATT TCTTTTTTTT AGGGGGAGAG 

3 5951 AAGCACCCTG CAGCTTTTGT TTGTTTTCCT TCCTGCGATC AATGGATTTT 

36001 ACGAGGGATT CCTCCGAATT TAGATCGCCG TATGGACGTT CGTGTTCCTT 

36051 TCCCTGAGAA TTGGGCAGGT TTGTTAGGTA AAGAGTTGTC CAAAGTATCA 

3 6101 GGGATTCCTG GGGCTGTGTT CTGCCATAAA GGTCTTTTCC TTTCTGTATG 

3 6151 GACAAATAGA "GAAAGTTGCC AACGTGCTTT GCGGTTAACG TTACAAGATC 

36201 GAGGGATCAT ATGACAGTAT TCAAACAAAT TATCGATGGA TTGATAGATT 

36251 GTGAAAAGGT ATTTGAAAAC GAAAATTTCA TAGCTATAAA AGATCGTTTT 

3 6301 CCTCAAGCTC CTGTTCATCT TCTTATCATT CCTAAAAAAC CTATACCACG 

3 6351 ATTTCAGGAT ATCCCAGGGG ATGAGATGAT TTTAATGGCA GAGGCTGGAA 
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36401 AGATCGTGCA AGAGCTTGCT GCAGAATTTG GAATTGCCGA TGGGTATCGT 

3 6451 GTGGTTATCA ACAACGGTGC TGAAGGAGGA CAGGCGGTAT TTCACTTACA 

3 6501 TATTCATCTT TTAGGTGGGC GTCCTTTAGG TGCTATAGCC TAATTTTCTT 

3 6551 TTGTTTCTGT GGATCCTTGT TCGGCTCGGA GTCCCTCCGT TATCAATTGT 

3 6601 TGATCCAAGA TTTTGCAAAA GTTTCAGAAG AGGGCATAGG CCTTTTGGAG 

36651 TCTAAAGAGT ATTCTTTACT TCAGGCTAAG CTAGTTTTAA GGGCTCTGGC 

3 6701 TCAAAATTCT TCTTTTGATG ATTGGTTTAG AAGTTTTAAG AAGTGTCAGA 

36751 TTTCCTATCC AGAGTTAGCT CATGATCGCG ATGTCTTAGA AGAATTTGGG 

36801 ATTCAAGTTC TGCGTGAGGG AATCGAAAAT CCTTCCGTGA CCGTTCGTGC 

36851 TGTGAGTGTC CTTGCTATTG GGCTTGCTAG AGATTTTCGC TTGGTCCCTC 

36901 TCCTGCTCCA AAGTTGTAAT GATGACAGTG CTATTGTTCG ATCTTTGGCT 

36951 CTTCAGGTTG CTGTGAACTA TGGCTCTGAA AGTTTAAAAA AGGCCATTGT 

37001 AGAGCTTGCC CGTAATGATG ATTCTATTCA TGTTCGGATT ACAGCATATC 

37051 AGGTGGTCGC TCTTTTACAG ATAGAGGAGC TATTGCCATT TTTAAGAGAG 

37101 CGTGCTGAGA ACAAACTTGT AGATAGTGTA GAACGTCGAG AGGCGTGGAA 

37151 GGCTTGCTTG GAACTCTCTT CTCAATTTCT AGAGACGGGT GTAGCTAAGG 

37201 ACGATATTGA TCAAGCGTTG TTCACTTGTG AAGTGTTGCG TAACGGTATG 

37251 TTGCCAGAGA CTACTGAGAT TTTTACAGAA CTCTTATCTG TAGAGCATCC 

37301 TGAAGTGCAG GAGTCTCTCT TACTTTCTGC TTTAGCTTGG AGTCATCAGC 

373 51 TACAGAATCA CAAAGAGTTT CTTAGTAAAG TGCGCCATGT GATGTGCACT 

374 01 TCTCCATTTC CAAAAGTACG TTTTCAAGCT GCTGCACTT.C TCCATCTGCA 
37451 TGGAGACCCT TTGGGCAGAG ACTCTCTGGT TGAGGGCTTG CGCTCTCCTC 
37 501 AACCTCTTGT GTGTGAGGCA GCTTCGGCGG CTCTCTGCTC TTTAGGAATC 

375 51 CATGGAGTCC CTTTGGCAAA GGAGCATTTG GAGAGCCTTT CTTCTCGAAA 
37601 GGCTGCTGCG AACCTCTCCA TTTTGCTTCT TGTGAGCCGT GAAGATATTG 
37651 AAAGAGCTGG AGATGTGATT GCTCGCTACC TCTCCAATCC TGAAATGTGC 
37701 TGGGCTATAG AGTATTTCTT ATGGGATGCA CAATGGAATT TACGTGGTGA 
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37751 TACCTTCCCT CTATATTCGG ATATGATTAA ACGTGAGATT GGTAGGAAGC 

37801 TCATTCGCCT TTTGGCAGTA GCTCGCTATA GCCAAGCCAA GGCTGTAACA 

37851 GCAACGTTCC TTTCAGGACA GCAAGCTCAG GGATGGAGCT TTTTTTCTGG 

37901 AATGTTCTGG GAAGAGGGAG ATGTGAAAAC TTCTGAGGAT TTGGTTACAG 

37951 ATGCTTGCTT TGCAGCAAAG TTGGAAGGAG CGTTAGCCTC GCTATGTCAG 

38001 AAAAAAGATC AAGCTTCCCT ACAGAGGGTC TCTCAACTTT ATAATGACAG 

38051 CCGTTGGCAA GATAAATTAG CAATCTTAGA GAGCGTTGCT TTTTCTGAGA 

38101 ATCTTGATGC TGTGCCTTTT CTTCTAGACT GCTGCCATCA CGAAGCTCCT 

38151 TCGCTGCGAA GTGCAGCAGC GGGTGCTCTT TTCTCTATTT TCAAATAAAT 

38201 ATTAATAAAA TTATTCAAGA TATAGAAGAA AAACCACGCA GTTGTAATTT 

38251 CTATTTCTTA AAAATAATTT TTCAGACTGA CTTTATTCTT TCATTTTTAA 

3B301 GTCTTTGGAG ATAGAAAACT TTGTTATAGA TTTTTATCTG GTAGCTTTTA 

38351 TAATTTATGA AGAGCGTAAG CTCAGAGCCT GTATGTCATG CACAACCTGT 

38401 ATATAAAATC AGATTTGTTT TTTGAATCTC TATTCTCGTT AAGATTTCGT 

38451 TATTCTGGAC GTTATCTCCA TCACCACTCC TAATTTTCCT AGCATTTCTA 

38501 TCTTTAAGCT CAGCACCGTA GCTTGCTTAA AGGAAATATT TTTCATTTAG 

38551 GTTGTGGAGT TCTTTATTTT ATGAATTTTT CATTATTTTT ATTTTTCCTG 

38601 ATAGCTATTC AGGGAATCTG CTTGTACGTG GGACGTCGTG GTAGCAAAAA 

38651 GGTAGAAGAT CGCGAGAGCT ATTTTCTTGC AGGAAGGAGT TTAAAAATCT 

38701 TTCCTTTGAT GATGACATTC ATTGCCACCC AAATCGGTGG CGGTGTACTT 

38751 CTTGGGGCTG CTGAAGAGGC CTTCTGTTAT GGTTATGGGG GG ATTCtTTA 

38801 TCCTTTAGGA GTCGCTTTAG GGTTGATTTT CTTAGGAATG GGGCCCGGGA 

38851 AGCGGTTGGC AGAGGGATCG TTAACGACCG TAGTCTCTAT CTTTGAAGTG 

38901 TTTTATGGTT CTAAAAAGCT CCGTAAGATC GCATTTTTAT TATCCGCAGG 

38951 TTCCTTATTT TTCATCCTGG TCGCTCAGGT GATTGCTTTA GATCGGTTGT 

39001 TTAGCAGCTT CCCTTTTGGC AAGTACGTAA CCGTAGCATT TTGGATTGTC 

39 051 TTAGCATCCT ATACCTCAAC AGGAGGGTTT CGCGGGGTCG TACGTACTGA 
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39101 TGTGATCCAA GCAGGATTTC TTCTTATTGC GGTGCTCGTC TGTGGTGTTT 

39151 CTGTATGGCT CTCTGTCCCT AAATCCTTGT CTGTGTTGGA TCCTTTCCAA 

39201 TCACTTCCTT GTGCGAAGCT TTCCAATTGG ATATTCATGC CTATGCTCTT 

39251 TATGCTTGTT GAGCAGGATA TGGTGCAAAG GTGTGTGGCT GCCTCCTCTC 

39301 CAAAACGCTT GCAATGGGCG GCTGTAGGCG CAGGCCTTGT TCTTCTTCTT 

39351 TTTAACTTTA TCCCTTTATT TTTAGGTTCT TTAGGAGCTA AAGCAGGCCT 

39401 TAAAGCAGGA TGCCCTCTGA TTGATACCAT TGCATATTTT TGCAATCCCT 

39451 CACTAGCAGC TGTGATGGCT GCTGCCATCG GCGTTGCGAT TCTCTCTACC 

39501 GCGGACTCTC TTATGAATGC TGTAAGCCAG CTAATCGCTG AAGAATACCC 

39551 TACGTTGAAA GCCCCTTATT ATCGTTATTT AGTATTGGGT TTGGCGGTTG 

39601 CAGCTCCTCT TGTTGCTATT GGTTTTACAA ACATCGTAGA TGTCTTGATT 

39651 TTAAGCTATA GCCTGTCAGT GTGTTGTCTT TCAGTCCCTG TGGGTTTCTA 

39701 TCTTCTAGCT CCTAAAGGTC GCCGTGTGAG CGGAGCTGCT GCTTGGGCAG 

39751 GAGTGCTCGT TGGTGCTCTG GGCTATGGAT GGGTTCAGAT AGTCTCTTTG 

39801 GGGATGTTTG GGGAGCTATT GGCTTGGGTA GGTTCTCTAG TCGCCTTTTC 

39851 CTTTGTAGGA TTTATTGAGA TCACTTGGAA AAACAAAGTC AAAACGCAAA 

39901 CTTAGATAAC CACTGCATGA GAAGATATAA CTAAAATAGA TCCTGAGTTG 

39951 TTTAGGTTTC TCTTAGATCT GATAGGTTGC GCTTAGTAAG AGATCGTCAG 

40001 TTTTTTAAGT TGTGTTTAGA ATCTGATACC TCTCCTTCTT TTCCAAGAAG 

40051 AAGAGGGGTT CGTTTTATTT TTTATTATTC ATTCGTAGGG GCGGGAAGCT 

40101 GTTTTAACTG ATAGAGCAGG TCGATAAGAG AGGAGAGCAG GGCTTGATAG 

40151 AGCGTCTCTT CAGAATGGTG AAGAGTGCCC TTAAGAAGCT TTCTGCAATA 

4Q201 CTTTAATAGC CAATACAGAG CGCAAAGTAA ACACTGTAAA AGGTAAAATT 

40251 TACACGCTTG TTTTATCATA AACCTCCAAA GAAAGACGCG TGTTGGCGTT 

40301 CTTCCAATTG GCTGTTAGAT AGTGAAAATA GTATTTACTA TTCAATAAAA 

40351 ATATTTATAT TAAATATAAT AAAACAAAAT TTCTAATAAA CTTTTTAAAA 

40401 GTATTTGGCT AAGTTTAATT TAAGAGTTCT CAAATAAAAG ATTTTTTAGT 
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4 0451 CTCTTTTATT TGAGAATCAT CAACCGATTT CAAGAAATCG CATGAGCCCG 

4 0501 TGATTCTGAT GGGCAGAAGT GCTTTTTAGC AAATAGGTGA AGCTCCTCGG 

4 0551 GCGTGATCTG TTGCTTCGCG TCACAAAGAG CCTTTTCAGG GTGTGCATGC 

40601 ACTTCGATCA TCAGACCGTC GGCACCTACC GAGAGACCAG CAGAGGCGAG 

40651 AGGAAGAACT AGAGAACGCT TCCCCGCTGC GTGGGAAGGA TCTACAATTA 

40701 CAGGGAGAGA AGAGATCTCT TTAAGGAGAG CCACGGTATT GAGATCTAGC 

40751 GTGTAGCGCG TAGAGTGCTC AAAGGTACGA ATTCCTCGTT CACAAAGGAT 

4 0801 TACCCCAGGA CAGGAGGGAG AAGAAGCAAG GATGTACTCC GCTGCGCATA 

4 0851 GCCACTCTTC AAGAGTAGCT GCTGGACTGC GTTTTAGGAT AATCGGACGA 

4 0901 TGTGATTTGC TGACCTCTTG TAAAAGAGGG GTGTTATGCA TGTTTTTGGC 

40951 TCCGATACGG AGGATATCCA CATGTTCGGC AGTAATTTCA ACATCTCGGA 

41001 CATCTAAAAC TTCGGTTTCT GTAGGGAGAC CATGGATGCT CTGTGCTTCC 

41051 TTATGCCAAA GCACACACTC TTTCTCCCAT CCTTGAAACG AAAATGGGCT 

41101 TGTCCGTGGT TTTCTGATTG ATCCTCGGAA TACCTGAGCT CCTGCTTCTT 

41151 TAACTGTAAG AGCTGAAGAG ACTGTATGCT CGTAACTTTC TAAGGTGCAG 

41201 GGGCCTGCGA TCAGTATTGG CGATCCTTCT ■ CCAAACGATA GATTTGGAGA 

412 51 AATAGGAACG GTATGGACCT CGTCAGGATG CTGTTTGAGG GTGCGCGGTA 

41301 GGGGATAGGT AAACGTAAGA ATAAGTACCT CATGCAAAAC TAGGGATTTG 

41351 AGCTGTCCGG TTTTGAGTTC TGATCTTCAG TTCTAGGGTT TCTAGGCAAG 

41401 AGACAACCGT AGTGGTTGGG GAAGCGGATT AAGAAGATAC GAGCGTCTCT 

41451 TTCCGGAGAA ATTGTATTTA AAAATCCATG CATCGCAACA AAGTTTTCAA 

41501 TATTTTCTTC TGAGACAGTA TCTCGATCAG ACATGATCAG ACAAACAGCA 

41551 AACGGCTGCA AAGGTACAGT AGTCATAGCA GCGACGTCGT TAGCCTCAGC 

41601 TTGCTCTTTG ACGTCATCCA AGATATCACC GAGTTCATTA GAACTTAGGG 

41651 AGTGGAGGTA GCCTATCAGT GATTGTCTAA AACCAGAGCG ACAGAAACCA 

41701 TCAGATGCTG TGACCGTGTT TTTTAAGAGA TTACAAATCA ACTGATCTCG 

41751 AGCTCTAGAG TATTGATTTA TAGTCGCTGT AGCTAGAGGG AAACCGTATT 
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41801 


TTAACGAAAG 


TAAAAGCTTA 


TCTTTTACTT 


GTGTAAATCT 


ATGAGAAGCC 


41851 


TCTCTTTGGA 


AAGCGTCTAA 


TCGAGCAGCT 


AGCGGAGTCT 


TTTTTTGTAG 


41901 


AAGTTTAGGG 


TAGTTGTTAA 


GGAAACAAAA 


GATTAAAAAA 


GATTTCGAAA 


41951 


ACTGATCACT 


TAAAGCTGTG 


GGAGAAGAAG 


GCGAACCCAG 


TTTTGATCCG 


42001 


AAGAGATTTA 


ACCGTGTTAA 


AAAAACGTTC 


CACCCATTCC 


TAAAGGTACT 


42051 


TTCGATTTCT 


TTTTCTAAGA 


GTATGGCCTC 


GGTTTCCAGT TGATAATTTG 


42101 


GGAAAGTAGC 


TCGTAGAAAC 


TGCATACAGT 


TCTTTTCAAG 


TATGGGATCT 


42151 


AAAACCATCA 


CTTCAGGATG 


TCGTGACCAG 


AACTTGCAGA 


ACGTCTCTTT 


42201 


ATCTTTATTA 


GAAGGGGTTG 


AAGGATAGCG 


TGCTGGAGGC 


TTGAGTACTA 


42251 


GAGACCCTAG 


AATGGAACGT AGAAGTTTTT 


GGAGTTTCTT 


TTCGTTTTCT 


42301 


ACTTTTTTAA 


AGGCATATTG 


TAGAAAAGGA 


AGTAGAAGCA 


TTTTTAACTC 


42351 


TTTAACGGCA 


TCTTTAGTTT 


CTGGACGATT ATTCACTTGA 


GGATGCTGTA 


42401 


AAATAAAGAA 


TAAGAGCTGT ACTGCCTGTG TATAGAGGAG GTCCTGCTCT 


42451 


TGTGAGAGAC 


CACTGCCTTC 


ATCACGCGTT 


AAACGTAAAC 


TTCTTGCCGA 


42501 


GCAGACAGTA 


TTGAAGTGCT 


CTTTTAATTC 


AAGCTTACTA 


TGAATTGCGA 


42551 


CTAATAGCTT 


GTTTACAAGA TTTGAAAGTA 


AATCCGGCTG 


ATTGCTTAAA 


42601 


GGAGGGGTTT 


CAAAGGATGT 


GGATATTACA 


TTCTTTAATA 


GGTTTGCTAT 


42651 


GCGTTTATGT 


AGTGTTTCGA GTTGAGGCAT AGAGCTCTCT AACGTCAAAC 


42701 


TCTCATGAGT 


CGTAAGGATA 


GTCATGATGT 


TAGAAGAAAT 


CACCTCTCGT 


42751 


TGCCAAGAAG 


AATCTAAATC 


TTCATTAAGG 


TAGCTGAGGG 


AGGCGTTACT 


42801 


TATTTGTAAG 


ATCTCTTGTA 


TAGAGCGTTG 


GGGGGGCGCG 


GACGCTATAA 


42851 


TGATATCCTC 


TAATGATTTG 


TAGAAAGTAG 


CGCAGAGTTT 


GTGAAATCTT 


42901 


ACAGAGGGGC 


AGGTACATAG 


AAAAGCAAAG 


AAAGAAAGAG 


CTGGAGACCA 


42951 


AGGGAGATCC 


CCAGATCGTC 


TGTTTTCTAT 


AGTTGCAAGA AAGCGAGAGT 


43001 


ATTGACATCG 


GATTGCTTCA 


GGGAGGTGAT 


CTTTGACTAT 


AGCATTGAAT 


43051 


CTCTCAGTAT 


CCCACCCTGA 


ATCAGCAATG 


CGTCGACTCA GCTTAGCAAA 


43101 


AGCCTCTCGT 


ATTTCTCTTT 


CATATTGCTT 


TCGAAGTTGT 


TGATCTTCTG 



WO0027994 f file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pcl 



WO 00/27994 PCT/US99/26923 

43151 GAGACATCTT TTGAAGCTCT GCATCAGTAA GAAAGAGAGC GGGCAGATGT 

4 3201 TCCAAGAGAA GAAAGAGTTT GTCTCTAGAT TGCATGTAAG GAAGTGCGGA 

4 3251 ATGATCAATG AGATTTTGGA ACAATTCCGA ATAGGGACGA TTTGCAGCAA 

43301 GAAATTCCAG AACACTGGGG AGGAGATCGT CCTGCATATC AGAAATAAAT 

4 3351 AAGGCTCTCG CTGTTTCTTC ATTACTTGAA GCTGCGATTT GTTGGCGAAT 

4 3401 CGCATATGCA GAGAGTTTTC TTAGAAAGGC TATCAGAGTT GCAGTGTGTT 

43451 TCTGAGAAAG AATCACCCCG TCATAGAGGT CTATGAAGGA GCAATAAGTA 

43501 CTGCATAACT GAAGGAGTTC AGCCATTTCT GCACAAAGAT TCGCATTTGC 

43 551 CGGAGAAGAG GAGGCGAAAG GCAGGTCAAG GAGACGTGTG GCTTCCTGCT 

.43601 CAAAGACAAT ATCATTTCTG CTGCTCTCTT CGTAGAGAGC AGATAGCCAT 

43 651 CCTACAGCAT AGGCACGATA AAAGCAGTTC CCATCTCCCG GTACATTCAC 
43701 AAGGTAGTAA TTGTCATTTA GATAGAGAGC CTGTTCAAGA GAGAGTTGCG 
43751 CAAGTCGCCG GTGTTGTTGA GGAAGATCCG GATTTTGTGC GATTTTCTTG 
4 3801 AACTGTTTTA TTTGGAAATA CATCGGTTCG TTATCAATCC GATTGGGATA 
43851 GGAAGCTACG AAATGAGGGT TAAAGTCGCC AAGTATTGGA TCAACTTGCA 
43901 TGGCAGGAAG AGGCAGAGGC GCTCGTCTTA GTACCATGCT CACCATGCGA 
4 3951 TTTAAGCATT GCCAATAGCC TTGACTAGAT GGGCGGAGAG GCATGGGGGT 

44 001 AGCCACCCGT ACGGGAGGAG GAGGGGCCTC TGGTGGTGGA GTCGGCTTTT 
44 051 TATCAGCAGG TTGTTTAGGG ACTTTTGGGC TCGCTGGTGA AGGAGCTTTG 
44101 GGGGGAGGCG GGGGTGTGTC CTCTGGGGGC GGCGTGCCCG GCTTGGG AAC 
44151 ATCGGGTTTT TTGTCTTCAC CATCCTTAGG CGGTTGTTTG GCAATTTCTA 
44201 TAGTTTTTGG CTCTGGTCCT TTGGGAAGAG TGGGAGGCGT TGGCAAGCCT 
44251 TCTTTTCTGA CAACCCGATG ATGCTTGTAG TAGTGAATCA GAAGAAGCAA 

443 01 ACCAAGAGTA ATGATATGGA GCAGAACGTA TCCTATGGTA CGTAGAATTC 
44351 TAAGTAACAG AGGGTCTTTA GTATCAGTCG TTAAGTGGTA AAAATTATTC 
44401 TTGTTATTGG GCGGGCAATG TGGTGGGGAA TTGGATACAT ACGTTCAAAA 

444 51 ATTGCTCGTT TTTTAATCAA AATTATTCAA AGTTAAAACT TTTTCGAGTT 
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44 501 TGATGTATTT ATTTTTTTAT GTAAACTTTA ACACATGATA GAATTTAGCG 

44 551 TATAGAGCGC AAACTTTCAT GATAAAACAA ATAGGCCGTT TTTTTAGAGC 

44 601 ATTTATTTTT ATAATGCCTT TATCTTTAAC AAGTTGTGAG TCTAAAATCG 

44651 ATCGAAATCG CATCTGGATT GTAGGTACGA ATGCTACATA TCCTCCTTTT 

44701 GAGTATGTGG ATGCTCAGGG GGAAGTTGTA GGTTTCGATA TAGATTTGGC 

44751 AAAGGCAATT AGTGAAAAAC TTGGCAAGCA ATTGGAAGTT AGAGAATTCG 

44801 CTTTCGATGC TTTAATTTTA AATTTAAAAA AACATCGTAT CGATGCAATT 

44851 TTAGCAGGAA TGTCCATTAC TCCTTCGCGT CAGAAGGAAA TCGCCCTGCT 

44901 TCCCTATTAT GGCGATGAGG TTCAAGAGCT GATGGTGGTT TCTAAGCGGT 

44951 CTTTAGAGAC CCCTGTGCTT CCCCTAACAC AGTATTCTTC TGTTGCTGTT 

4 5001 CAGACAGGAA CGTTTCAGGA GCATTATCTT TTATCTCAGC CCGGAATTTG 

45051 TGTCCGTTCT TTTGATAGCA CCTTGGAGGT GATTATGGAA GTTCGTTATG 

45101 GGAAATCTCC GGTTGCCGTT CTAGAACCCT CGGTAGGACG TGTCGTTCTT 

4 5151 AAAGACTTCC CTAATCTTGT TGCAACAAGA TTAGAGCTCC CTCCTGAATG 

4 5201 TTGGGTGTTG GGCTGTGGTC TCGGCGTAGC TAAAGATCGT CCTGAAGAAA 

45251 TACAAACGAT TCAACAAGCG ATTACAGATT TAAAGAGCGA AGGGGTGATT 

4 5301 CAATCTTTAA CCAAGAAATG GCAACTTTCT GAAGTTGCTT ACGAATAGAG 

4 5351 GGTATTCTTA TGGCAACCTC TGTTCCTGTA ACTTCATCTA CTTCTGTAGG 

45401 AGAGGCTAAC TCCTCCAACG AAAGATTTAC TGAACGAACA TCGCGAATGT 

45451 ATTACGCAGC TTTAGTCCTA GGGGCTTTGA GCTGTTTAAT TTTTATTGCT 

45501 ATGATTGTCA TTTTCCCACA GGTCGGATTG TGGGCTGTGG TCCTCGGGTT 

4 5551 TGCTCTTGGA TGTTTACTTT TAAGCTTAGC TATCGTTTTT GCTGTCTCCG 

4 5601 GTCTCGTTTT AGGCAAGACT TTAGAACCTA GTCGAGAAGC GACTCCTCCA 

4 5651 GAAATTGTTG CGCAAAAGGA GTGGACTACA CAACAAGATG TCTTAGGGAA 

4 5701 TGAGTATTGG CGTTCCGAGT TGATTTCCTT GTTCTTACGA GGGGATCTCC 

45751 ACGAATCTCT GATTGTTGAT TCTAAGGATC GATCTTTAGA TATTGATCAG 

4 5801 AGTTTACAAA ATATATTGAA ACTTGAGCCC CTATCTACGA CACTTTCGCT 
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4 5851 GTTAAAGAAA GATTGTGTCC ACATCAATAT CATTTTACAT TTAGTGAGAC 

4 5901 AGTGGAACTT ACTGGGAGTG GATCTTAGTC CTGAAGTCAC TGCGCACGCC 

4 5951 GAGGAACTTC TACTCTTTTT GATAGAAGAG CAGTATTACT CTCCTGATAT 

4 6001 TTTGAAATTG ATTCGCTACG GAGATGCTTT- ACAAGCAACG TCTCCTTTGA 

4 6051 TGGATTGGGC AGATTCAGGT TCCTTTAGTG TAGACGCAGA CGGGGTATTT 

4 6101 AGCTGTCGCA GAGAAGAATG TTCTCCTGAG GATGCTTTGG CGCAATTCGA 

46151 TCTTCTTTTG GCGTTGGAAA ATCCCGACAG ACGCTTCTTA AAGGATTCTT 

4 6201 TTCTTACCTA CATTTGGTCG TCTTCATTTT TTGAGAAGTT TTTACATCGC 

4 6251 CATCTAGAGA GCTTGCAAAG AAAGCTCCCA GAGACAGCGA TCGATGTCGC 

46301 CCGCTATGAA GCACAAATAC AAACATTTCT CTCTCGCTAT TTTCAGAAGC 

46351 TCGATTTGAT AAACGCAATG TCCTTAGATT GGGG ATATAA CTGTGCTGAG 

4 6401 GGAGAAAAAT GTTATGAGAG CGCAAATCAA AGATTAGACA ACCTATTTAT 

46451 TGCTTTTTCT TCTTCTGTTC CTGCTATGAA GCGGCTCTTT GACAAATATG 

46501 GTTCTGTGGT ACGGGTAGAT CGTAGGCAGA TTCGTGAGCA GATTCTTTCG 

46551 AACACTGAAA TCTTAGAAAA TGAGTCAGGG TTCCTCTGCA GTTTGTATGA 

4 6601 ATATCCTTTA TCCTATTTGA TAGATTGGGC TGTTTTGCTA GACTGTGTTC 

4 6651 GCGGTACCGA AATCTCTCTA GAAGATCAGG CCGATTACAC CGTTTGTTTG 

4 6701 CAAGGCTTGG ATTCTATGTT ATCTCAATTT GCGAGTCGTT TACAGTCTGG 

46751 ACAAAAAGTA TTGAATCCTA GAGATGTTTT AAGTGAACAG GCTGCGGTTA 

4 6801 TGCTTGTTCA TGGCTTGGCA GCACAGGGCG TGTCGTTTCA AGGATTGAAA 

4 6851 GCTTTGATGT ATTTGACAGC CGTTCCCCAA AGAATGTGGT TAGGAGCATT 
4 6901 GCCTTTATTT GAATCTTTTC CTGTCTTTAA TCGGATGAAA GAATTTCTTG 
4 6951 GGGAATCTCT GGGAGACTAG GTGAATTTGT ATCAAAGAAG GAACAAGATT 
47001 GCATGTTAGG TTCTTTGCCA TGTTATCCTG GTGCTGGCAA TATTGAAGAA 
47051 TACAAAAATA GGTATTTCTA TTGTCAGTTA TGTGCTGAGG TCGTTAGTCC 
47101 CTATGTTGTT CCTGTTATTG TAGTTGATGT GCAAGGGGCT CCTCCTACAG 
47151 GTATCTTGCA GGTCTTGCGT TGTAAGCAAC ATAAATTTCA AGGCCTACCC 
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47201 GTACATGGCC CCATTACTTC TTTATGGGCT TTGGAGCCCG TGGGTAAGGG 

47251 AGCTCCGCAG CTGGAGTCTG CAATGTACGA GCTCTGTTCT CAAGTAAGGA 

47301 ATTTTGACAT CTGCTCTATT GTGAGTTGGG TCTTTGGTGG GTTGTGTATT 

47351 TTTGCAGGTC TGATTGTCGG GGTAATGGTT GAAGCCCCTT TGATTGCGGG 

47401 ATTAAGTGCT TGGGTGATTC CCTGTATCAT TGGAGGGGTT GGTGCCATTT 

47451 TATGCTTGTT TGCGATCTTG ATGGCGTACT TGGGAAGAGG GAGAGTCCGT 

47 501 GAGTGGCTCA ATCTTTCACA CGAATATATA ACGCAATGTC ATTGTCGTCA 

47551 GATACAGGCA CATTCTCAAA ACTATTCTGT GATCACAGAG TATCCTGCAA 

47601 CCTGTGCATT ATCTCAACCG ATTACAAAGT TACCTAATGG ATCACGCAGA 

47651 GATAACTAAG CGTGTTCGTC AGTTATTTCT CACATTTTCT CATGAATCTT 

47701 TTACTGCGCT GCACGAGATC CCTCTCGAAA ATTTTTAAGG ATAGATACTT 

47751 GGAAACTATG GTTTAAAAAG CTATAGAGGA TTCTAAATTG GGGTTCTAGC 

47801 AACTTCTTGA CTTTAAGATC CAAAGTTAAG AGACTGACTA ATTATTTTTG 

47851 TTTGCTTGTG TTTCCAGATG AGCAATTGGT ATGGTAAGAG ATATTCAGAG 

47901 TGAATCTATA GGGAAATTAG TATTTTTAGG CACAGGAAAT CCCGAAGGAA 

47951 TTCCCGTGCC GTTTTGCTCA TGTAGAGTGT GTCAAAACAC AGGGATTCAT 

48001 CGTTTACGAT CTTCGGTACT CATTCAATAT CAAAACAAGA CTCTAGTGAT 

48051 TGACGCAGGC CCTGATTTTC GTACGCAGAT GTTAGTTGCA GGGGTTTCCG 

48101 AGCTCGATGG GGTATTTCTG ACCCATCCCC AGTACGATCA TATCGGTGGT 

48151 ATTGATGATT TACGTGCGTG GTACATAGTC ACGCAGCGTT CGTTGCCTTT 

48201 GGTCCTTTCT GCAAGCACCT ATAGATTTTT AAAGAAGGCT AAAGAGTATC 

48251 TCTTCGCCAC TCCGAATGTA GAGTCTTCAC TTCCCGCAGT TTTAGAGTTT 

48301 ACAATCTTGA ATGAGGACTG TGGGCAGGAG GAATTTCAGG GCATTCCCTA 

48351 TACTTATGTT TCCTATTATC AAAAGTCGTG CCATGTAACG GGTTTTCGTT 

48401 TTGGAAATCT TGCTTATCTT ACAGATCTCT GTAGCTATGA TGCAAAAATT 

48451 TTCAGTTACT TAGATAATGT AGAGACATTG ATCTTGTCTG CGGGTCCATC 

48501 GGAAACTCCT ATTCCTTTTC AGGGACACAA ATCTTCGCAT CTTACTGTAG 
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48551 AAGAAGCCAA AGCTTTTGCG AATCATGCAG GGATAAAGAA TTTAATTATT 

4 8601 ACACATATCA GCCACTGTTT AGAAGCAGAG CGTGACCAGC ATCCAGAGGT 

48651 CACATTTGCT TATGATGGCA TGGAGGTCCT TTGGACACTA TAGATACGCC 

48701 CGGGGAACAG GGTTCTCAAT CTTTCGGAAA TTCGTTAGGG GCCAGGTTCG 

48751 ACTTGCCTCG TAAGGAACAG GATCCCTCTC AAGCTTTAGC TGTGGCTTCC 

4 8801 TATCAAAATA AGACAGATTC TCAGGTCGTT GAAGAACATT TAGACGAGTT 

4 8851 GATCTCACTT GCGGATTCCT GTGGTATTTC TGTTTTAGAG ACCCGTTCTT 

48901 GGATTTTAAA AACACCCTCA GCTTCCACCT ATATCAATGT GGGGAAGTTG 

4 8951 GAGGAGATCG AAGAAATCTT GAAAGAGTTT CCCTCTATAG GGACTTTGAT 

4 9001 CATAGATGAG GAGATCACTC CATCCCAACA ACGGAATTTA GAGAAACGCC 

49051 TTGGCCTTGT CGTTTTGGAT AGGACGGAGT TAATTTTGGA AATCTTTTCC 

49101 AGCCGTGCCC TTACTGCAGA GGCAAATATC CAAGTCCAAC TTGCACAAGC 

4 9151 ACGTTATCTC CTTCCTCGTC TTAAGAGACT TTGGGGGCAC CTATCTCGGC 

49201 AAAAATCTGG GGGAGGTAGC GGAGGCTTTG TTAAGGGGGA AGGAGAAAAA 

49251 CAGATCGAGC TAGACCGTAG AATGGTCCGT GAGCGTATCC ATAAGCTGTC 

49301 AGCACAGCTG AAAGCTGTGA TCAAACAGCG TGCGGAACGC CGTAAAGTAA 

4 9351 AATCTCGACG AGGAATTCCT ACCTTTGCTT TGATAGGGTA TACAAATTCA 

4 9401 GGGAAGAGCA CCCTATTAAA TTTGCTGACG GCTGCTGATA CGTATGTTGA 

49451 AGACAAGCTA TTTGCAACTT TAGATCCCAA AACGCGCAAA TGCGTACTTC 

49501 CAGGAGGCCG TCATGTCCTT CTTACTGATA CTGTAGGCTT CATTCGAAAA 

49551 CTTCCTCATA CTTTGGTAGC AGCATTTAAA AGTACTTTAG AAGCAGCTTT 

49601 CCATGAAGAT GTTCTTCTGC ATGTTGTCGA TGCTTCGCAT CCTTTAGCTT 

49651 TAGAGCATGT ACAGACGACC TACGATCTCT TTCAAGAGTT GAAGATTGAA 

49701 AAGCCTAGGA TCATTACTGT GTTGAATAAG GTAGATCGGC TTCCTCAAGG 

49751 AAGTATCCCT ATGAAATTAC GTTTGCTCTC TCCTCTTCCT GTATTGATTT 
49801 CAGCAAAAAC TGGGGAGGGG ATCCAGAATC TTCTTAGTCT TATGACGGAA 
49851 ATCATTCAGG AGAAAAGTTT GCATGTGACT TTGAATTTTC CTTATACAGA 
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4 9901 ATATGGAAAA TTTACGGAAC TTTGCGATGC CGGGGTTGTG GCCTCGTCAA 

49951 GGTATCAAGA AGATTTTTTA GTTGTTGAAG CGTATCTTCC TAAGGAGCTG 

50001 CAAAAGAAAT TTCGTCCTTT TATTTCTTAT GTTTTCCCTG AAGATTGTGG 

50051 AGATGACGAG GGT AG AGGGC CCGTCTTGGA GAGTTCTTTC GGGGATTAGG 

50101 TAGTTTTCTT CTAGGACATC GAATCTTTGT TAGTGAGAAA AAGAGTGATA 

50151 TTTTAAAATA GCCACTCATC GCTAAATCTA TTGAAGTCTC TAGAGGTATA 

50201 TGACGGTTGC GGAAGTCAAA GGAACATTTA AGCTGGTCTG TTTAGGCTGT 

50251 CGGGTGAATC AGTATGAGGT CCAAGCATAT CGCGACCAGT TGACTATCTT 

50301 AGGTTACCAA GAGGTCCTGG ATTCTGAAAT CCCTGCAGAT TTATGCATAA 

50351 TCAATACGTG TGCTGTCACA GCTTCTGCTG AGAGTTCGGG TCGTCATGCT 

50401 GTGCGTCAGT TATGTCGTCA GAACCCTACA GCACATATTG TTGTCACAGG 

50451 TTGTTTGGGG GAATCTGACA AAGAGTTTTT TGCTTCTTTG GATCGGCAAT 

50501 GCACACTTGT TTCCAATAAA GAAAAATCCC GACTTATAGA AAAAATTTTT 

50551 TCCTATGATA CGACCTTCCC TGAGTTCAAG ATCCATAGTT TTGAGGGAAA 

50601 GTCTCGAGCT TTTATTAAAG TTCAAGATGG CTGTAATTCT TTTTGCTCGT 

50651 ACTGCATTAT TCCTTATTTG CGGGGGCGTT CGGTTTCTCG TCCTGCTGAG 

50701 AAGATTTTAG CTGAAATCGC AGGGGTTGTA GACCAAGGAT ATCGCGAAGT 

50751 TGTAATTGCA GGAATTAATG TTGGAGATTA TTGCGATGGA GAGCGTTCAT 

50801 TAGCCTCTTT GATTGAACAG GTGGACCGGA TTCCTGGAAT TGAGAGGATT 

50851 CGAATTTCCT CTATAGATCC TGATGATATC ACTGAAGATC TGCACCGTGC 

50901 CATCACCTCA TCGCGTCACA CTTGTCCTTC GTCACACCTT GTTCTTCAAT 

50951 CGGGGTCGAA TTCAATTTTA AAGAGAATGA ACCGGAAGTA TTCTCGCGGA 

51001 GATTTTTTAG ATTGTGTAGA GAAGTTCCGT GCTTCTGATC CTCGCTATGC 

51051 CTTTACTACA GATGTGATTG TCGGATTTCC TGGAGAGAGT GATCAAGATT 

51101 TTGAAGATAC TTTGAGAATT ATTGAAGATG TAGGCTTTAT TAAAGTGCAT 

51151 AGTTTCCCTT TCAGTGCTCG TCGTCGTACT AAGGCATATA CTTTTGATAA 

51201 TCAGATTCCC AATCAGGTGA TCTATGAGAG GAAGAAGTAT CTTGCTGAGG 
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51251 TTGCTAAGAG GGTAGGCCAG AAAGAGATGA TGAAGCGTTT AGGAGAGACT 
51301 ACAGAGGTGC TTGTTGAGAA AGTAACGGGG CAGGTTGCTA CGGGTCACTC 
513 51 TCCTTATTTT GAAAAGGTTT CTTTCCCTGT TGTAGGAACG GTAGCTATCA 
51401 ACACTCTAGT TTCTGTGCGT CTTGATAGGG TAGAGGAAGA AGGGCTGATT 
51451 GGGGAGATTG TATGATAGAT ATAATGCAAC ATTTTAAGCC CTATACTATG 
51501 GTCCCAGGAC AAAAACTCCC TATTCCTGGA TCTTTGTTAT ATGCTCAGGT 
51551 ATTTCCTACC CTGTGGCGTC TATTTTCTTC GAAACACGAA ATCTTAAATG 
51601 AGCAGACCTT ACAGGTGCAA GGGCCTTTAA AACGCTTTGC TGTTTTCCAA 
51651 GATTTACATC GTGGGGGGCT TGCAGTGACT TCTGAGCGCT ACAAGTATTA 
51701 TCTCCTTCCC TCGGGAGAGT GCACACAATC TATCAAAGGG AAACTGCCTT 
51751 CGGCAGCGCA AGCAGGGCCC CTGTTATCTC TTGGGGTGCA TAAGCATGCA 
51801 GATTGGCAAA AGGTCCGTTG TCGTCGTGAT CTTAAAGAAA TTCTTCCCCT 
51851 ATGGTTCCGT TTCGCCGCTA TGGCTCCTAA GGGATCCTAT CGGGATCTAG 
51901 AGACGACGGC TATCGGTAGC TTGGTAAAGA CTGCCCATCA AAGAGTTTTA 
51951 CATAGGGAAA CT AC AG AG AT TGCTCCTGCG TTACTCTCCA TAGCCCTTGC 
52001 GGGATTTTCA GAGTGCTTTC TTCCTAGGAG CTATGATGAA GAGTTCCAAG 
52051 GAATCCTCCC CCAAGATGGA GATCCAGAGG GGGGAGTTCC TTTTGAGCTT 
52101 CTCTCGTATA GCTTTGGTAT GATCCAAGAT ATTTTTCTGA GACACCAGGG 
52151 ACAGCTAGTA GAGATCCTTC CTGCATTACC TCCTGAATTT CCTTGTGGCC 
52201 GCTTGATTCA TGTTGCCCTT CCTAATCTTG GGACTTTGTC TATCGTCTGG 
52251 ACTAAGAAAA CTATCCGTCA GGTCGAGCTC CATGCAGAAT ATAGTGGCGA 
52301 GGTATTTTTA AAGTTTTGTT CTTCACTATG CAGTGCGCGC CTTCGGGAAT 

523 51 GGTCGGAGCG ACGTCTCTCT GGATCTAAGA GACTTTCTTT AGGAGAAACT 
52401 CTGGAGATAA AAGCAGGAAC CACATATTTA TGGGATTGTT TTCATAAATA 

524 51 GATAGCCTTC CATGGTTGAT AAACTGATCC ATCCTTGGGA TCTTGATCTG 
52501 CTCGTCTCAG- GACGACAGAA AGATCCCCAT AAACTCTTAG GGATCCTTGC 
52551 TTCTGAAGAT TCTTCAGATC ATATTGTTAT TTTTCGTCCA GGGGCGCATA 
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52601 CGGTTGCTAT TGAACTTCTA 

52651 TCGGGGCTCT TTTTCTTATC 

52701 CCGTGTGTAT CATCAGAATG 

52751 CTCCTCTGTG GGGAGAAATT 

52801 TACCGCATTT ATGAACGCAT 

52851 CTCAGGGGTG CTCTTTGTTC 

52901 TAGTCGGAGA TTTTAATTTT 

52951 ATTTCCGATC AGGGGATCTG 

53001 AATACGGTAT AAGTGGGAAA 

53051 AAACAGATCC TTATGGGAAG 

53101 CGTGTTGCGG ATTCTGAGAG 

53151 GAGGCGCTCG AAGCAGAGTG 

53201 TAGGCTCTTG GCAATGGCAG 

53251 GCGCATCGCC TTGCTAGCTA 

53 301 GCTTCTTCCC ATTACGGAGC 

53351 TGACGGGATA TTATGCTCCA 

53401 CAGTATTTTG TAGACTATCT 

53451 TTGGGTGCCG GGACATTTTC 

53 501 ATGGGGAGCC TCTCTACGAG 

53551 CACTGGAATA CGTTTACCTT 

53 601 TTTACTAGGG AGTGCTTTAT 

53651 TACGTGTGGA TGCTGTGGCC 

53701 GATGGAGAAT GGACGCCTAA 

53751 TATAGAATTT TTGAAACACT 

53801 GAGTGCTCAC CTTTGCAGAG 

53851 GACGTAGATC AGGGAGGTCT 

53901 GATGCACGAT ACCTTTCATT 



PCT/US99/26923 

GGAGAGCTTC ACCACGCTGT AGCTTATCGT 
CGTTCCCAAA GGAATCGGAC ACGGGGATTA 
GACTTCTCGC TCATGATCCC TATGCGTTTC 
GATTCTTTTT TATTCCATAG AGGAACGCAT 
GGGGGCAATC CCTATGGAAG TTCAAGGAAT 
TTTGGGCTCC CCATGCGCAG AGAGTCTCTG 
TGGCATGGCC TTGTCAATCC TCTACGTAAA 
GGAGCTTTTC GTCCCAGGCT TGGGAGAGGG 
TCGTTACCCA ATCGGGGAAT GTGATTGTAA 
AGCTTTGATC CTCCACCCCA GGGTACAGCT 
CTACTCTTGG AGTGATCATC GTTGGATGGA 
AAGGGCCCGT CACGATCTAT GAAGTGCACT 
GAGGGAAGGC CCTTAAGCTA CAGCGAAATG 
TTGCAAGGAA ATGCACTACA CTCATGTGGA 
ATCCCCTGAA TGAATCTTGG GGCTATCAAG 
ACATCAAGAT ACGGGACTCT CCAGGAGTTT 
ACATAAAGAA AATATTGGTA TTATTTTAGA 
CCGTAGATGC GTTTGCTCTT GCCTCTTTTG 
TACACGGGGC ATAGTCAGGC TCTTCATCCC 
TGACTACAGT CGTCATGAAG TGACCAACTT 
TTTGGCTCGA TAAGATGCAT ATTGATGGCT 
TCTATGCTGT ATCGTGATTA TGGCCGTGAA 
CATCTATGGA GGTAAGGAGA ACTTAGAGTC 
TAAATTCTGT AATTCATAAG GAGTTCTCTG 
GAATCCACAG CGTTTCCAGG AGTCACTAAG 
GGGGTTTGAT TACAAATGGA ACTTAGGTTG 
ACTTTATGAA GGATCCCATG TATCGTAAAT 
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53951 ACCATCAGAA AGATCTGACA TTTAGCCTTT GGTATGCCTT CCAAGAGTCT 

54001 TTTATTCTTC CTCTCTCGCA TGACGAGGTG GTCCACGGTA AGGGCAGCTT 

54051 AGTGAATAAG CTTCCCGGGG ATACCTGGAC CCGATTTGCT CAAATGAGAG 

54101 TGCTCTTGAG CTACCAGATC TGTTTGCCTG GGAAAAAGTT ACTGTTCATG 

54151 GGTGGGGAAT TCGGACAATA CGGCGAGTGG TCTCCTGATC GTCCCTTAGA 

54201 TTGGGAGCTT TTGAATCATC ACTACCACAA AACTTTGCGA AACTGTGTCT 

54251 CTGCATTGAA TGCGTTGTAT ATTCACCAAC CCTATTTATG GATGCAAGAG 

543 01 AGCTCTCAAG AGTGCTTCCA TTGGGTAGAC TTCCATGATA TAGAAAACAA 
54 351 TGTCATTGCC TATTATAGAT TTGCAGGCAG CAATCGTTCT TCGGCGCTTC 
54401 TCTGTGTCCA TCATTTCAGT GCGAGTACTT TTCCTTCCTA TGTTTTAAGG 

544 51 TGTGAAGGTG TAAAGCATTG TGAACTCCTT CTCAACACTG ATGATGAGTC 
54 501 TTTTGGAGGC TCAGGGAAGG GAAATCGGGC TCCTGTGGTC TGTCAAGACC 
54 551 AAGGGGTCGC TTGGGGTTTG GATATAGAGC TCCCTCCTTT AGCTACTGTG 
54601 ATCTATTTAG TTACTTTTTT CTAAAAATTT AAATACTTTA TTTGTAAATT 
54651 GTTGTGGGAT TGTTCTATTT TGTGGTGTAG TTGATATTAA TAATTTATTT 
54701 TATAATTAAA AATAATTATT AGTATTTCTT TTATGTCTAC ATCACCAATT 
54751 AGCAACGATC CCCGATATTT GTCTTTGTCT AATGCAACTG AGAAAACTTC 
54 801 TCTTCTTGCA AATAGCCGGA GTCTCTCGCC AGTACCAAAT TCCCTAGTTC 
54 851 CTAGCAATCC TGAAGATACA GGATTGCGAA AAAGTATTTT CACCCATTCC 
54901 GTGACTTTAT TTGCTGGCCT GGTTGTTTTG CTGGTAGCGG TTTCTGTTGT 
54951 TGTTGTCGCT TTGACCGTCT TAGCTCCCGG AGTTCCTCAG GCTATTCTTC 
55001 TTGGAATCGC CATTTCAGGC GTGGGTATTG GTGGATTTTC TATAATGAAG 
55051 AGCTTGGTTT ATATGGTCCG AGACTATATG TCCCCCAGGA TGCAGGAGTC 
55101 GAGCAGAATC AAAAGTGCTT TAGCTGTAGG GACTGGATTT ACTGTCATGG 
55151 GTTTGGTCAT GAAGGTGGGG GCGAATTTTG TTCCTGGAGG GTATGGGGGT 
55201 CTCGTGGGTA GCTTGGGATC CAGTGCGTAT TCCCGGGGAA GCCAAACCAC 
55251 ATTAGCAAGC TTCAGTCATT ATATTTATAC TAAGTTTTTC CGTTCTGAAA 
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55301 AAGTTGCTAA AGGGGAGAAG CTTACAGAAG CAGAAACTAT AAAAGAGGCG 

55351 AAAAAATTAC ACTATATCAC GTTGTCAATT GCCACTATTG GCGTTGGTCT 

55401 TGCGGTTTTG GGGATTCTCC TTGCCATTGC AGGAACGGTA TTGCTAGGAG 

55451 GCGCTCCCGC AACGATTGCT ATTATTTTAG CTCCCCCTTT AATTTCTATA 

55501 GGGCTTACGA CGGTTTTGCA AACGATACTC CATAGTAGTA TCGGAAAGTG 

55551 GAGAGCCTTT CTGCTTACTC AAGAAAAAAA AGATCTTTTT GTAGACACCT 

55601 CCCTGAAAGA CATTCGCTTA GAAAAATTGC CCCCCAGTGA GGTGGAAGAG 

55651 AGTGAAACTT CCCAATCTGT GATAGAAGTT CCAGATTCAG AGGGGATTGC 

55701 AGAGACGAGG ATCTCTGCGG AAGAAATCGA TACGAGGCTT TCCCTGACGA 

55751 CAAGACAGAA GGTCATCTTT* GCTCTTGCGA CACTCTTGCT CTTAGCAAGT 

55801 ATTGCTGCCT TCATAGTCAC GGGATTTGGT GGATTGACAG TCATGCAAGT 

55851 TCTCCTTGTT GCTTCTGTAG GATCGGCGGT TGCTTCTGTA ACACTCCCTA 

55901 TGGTTTCCTC AGGATTTTCC TACGTCGCCT ACCAACTGAA AGCAAGATTG 

55951 AATATCAGTA AATTACGTTG GAAAGAAGCA AAAAATAAAA AGCGGGTGCG 

56001 CCAGTTCTTA ATTGAGTCTG GAGTGATTGC CTCGGATCGA GAATTTAACC 

56051 AAATGTGGAA GACAGTCTAC AAAAAACAGA TTCAGAAGAC TGACGCTGCA 

56101 ATTCGTGAAG AGGTTCGCAA TTTTGAGAAG GGTGGGGAAG TGAACAGCGC 

56151 CCTTGTTGGT GGAATCTTAC TTGGTGTAGG AACTGGGATC ATGCTTCTTG 

56201 CCCTGGTCCC TGCATTTGCT CCTATCGTTC CTGGTATTCT TGCTCTTGGA 

56251 GGATCGACGT TAGGAATCGC GGGATCGATT TTAATGAGGA AGTTTGTCAA 

56301 CTGGCTCTAT GATGAGCTTG TGAAGCTCTA TGAGCGTCGA CGTAATCGCC 

56351 GTGAGCTTCT CTATGGTCCT GAAAGTAAAA TGCGCTCCAT TGCTACGGAT 

56401 TTAGTTGTTG AGGCTCTTGC TGCTAGCCAC GATCATCTAT TTGATCTTGA 

56451 TGGTCCCGTA GATTTTATTG ATGTGGATGT AGATATAGAT GGAGCTGCTT 

56501 AGGCCAGGTC CTTGAATGTA AGATCCTCGA GCTTTGGGAG CTTGTCTGCC 

56551 TTCTCTAATT TTATTTTCTC TTTTACATCT AGTAGTTTAC TTAATTATAT 

56601 AATCAGCATT CTTTTTGTTT ATTTTAATTT ATATTTTGTT TTTAAAATAT 
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56651 TTTTTATTTT AACATTTGTT 

56701 TAAGGTTCAA TTATGGCAGT 

56751 CCCCATTCCT CCTAATAGAA 

56801 AAGACAACTT AGGGGAACAT 

56851 AGTCAGGGCC CTACAATAGA 

56901 TAAAATTCGT TTGCCATCTG 

56951 GACGTTCTGG GGTACTTCAG 

57001 AAAAAAACCC CTCAAGCGCG 

57051 CCATGTGCAA CATGGCCAAC 

57101 GTATCCAGAA AAGATCTGAA 

57151 CGTTCCTATT CTGATGGTGA 

57201 AGATTCTACA GAGGATAGCC 

57251 GTTCTTCCTT CTTATCAGGA 

57301 GCCCTAGGTG ATATTAAAGG 

57351 TTTAACAACT CAGGGCGAAG 

57401 GTTCCGAAGA AGCAGAGGCT 

57451 GTTCGAGGAG CGACGTCTAC 

57501 GAAGGTTTCG GCGTTCGGAG 

57551 CAGGGAATAT CAGAACTAGA 

57601 TCTAATGTGA ATAAAGCAGC 

57 651 GGAAAAAGTA GCTCCAGAAC 

57701 AATCTCTTCT TGCACGCATG 

57751 GTGGAGGATC TTATTACTTT 

57801 GGAGTATGCA TCCATCGTAC 

57851 CTGCGGAAGC TCCCGAAACA 

57901 GCATGGAAAG CGTTACGGGA 

57951 GAGCTTCTTT AGGGCAATTG 



PCT/US99/26923 

TAATAGTTTT TATTAAATAA TTTATCATTT 
TGGTGGCGTA GGCGGCTCAA GATCTCCTTC 
GGAATAGTGA GGATGGAAAA GTAAGTCCTA 
ACAGTTAGCA GTAGTGACAG TAGTCTTGCA 
AGAGAGAAAA GCCCAGTTAG GCGGGACTGA 
TCAAAGAACC CGGAGATTCT CAAACTTCAG 
AGAATTTGGA AAGGCGTTAA AGGGGTCTTT 
TCCTGAAGTT TCTAGTCCAC GTCTTCCATC 
GTCTTCCTGG ACTCGAGGGC TTTAGAGATC 
AATCCAGAGG CAGATTTAGG GAAGATGAAA 
CCTTGATCGA GTAGGACACG ATTCTAATGA 
GTTCTGAAGG AGGAGAGCCT TCTTCAAAGA 
GTTCGAGGAG CGGTGTCTAA AGTTCATGGT 
AAAGTTCCAG CGTTCTGCTT CCGAAGATGA 
ATTCTGCCGG CGATACTGTA AAAGAAAGGC 
TCTTCGAAGA GTTCTTCTTT TTTATCAGGA 
AGTTCAGGGA GCCTTAGGTG ACGCTAAAGA 
AGCAGGCTGC AGGTGCAATC AGATCAGCAC 
TTCCAACGTT CTTCATCGGA AGGTGATCTT 
AAAACATCTG CGTAAGGCTT TAGAAAATTT 
AAGTGTCACC AGAGGTGGCT TCTAGGGTGC 
GAGCAATTGA CTCATCAGGA ACCTCCTACT 
CGTAGAATCC AATGTAGGTA GTGATTCTGT 
CTCAAGATGG ATCGCAAGCC CCAGCAGAGA 
GGTGGGGTAG AGGGATCTGC AGCGCAGGGA 
TTTTGTAGTT AGCATATTCC AAGCGGTAGC 
CTTCAAGATT AAGTTCAGCA CGACGTGAAT 
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58001 


CAGCTGTAGA 


TGATCTTGCA TCAGAAAGTA 


ATACACAATG 


GTTTGTGGAG 


58051 


CAAGAGGGCG 


TTTCAAATCC ATCGGCTGCA 


CCTAGCTTAT 


CTTTTGCGGA 


58101 


AGAGATCGCT 


CGTAGAGCTG CAGAAATGAG 


TAACAGAAAT 


GCCCAGAGTC 


58151 


TTGAAAAATT 


GGAATCAGGC AATGTGACTG 


ATCCTGTCAT 


TCAACAAGGC 


58201 


TTAGGATTAG 


CTAGATCATT TGCTCCAGAG 


GGACAGTAGT 


CGTTATCTCA 


58251 


CTGTTTCTCT 


ATGCGCAAGG GAAACTTGAA 


GAGTTTTAAT 


TAAAACTCTT 


58301 


CAATATGTTG 


ATTATTTTAA TATATTTAAA 


AGCATTTTTG 


TTGTTTTTTA 


58351 


ATAAAATTAA 


ATTGTTTCAG AAAAAAGATT 


ATTCTTTTTA 


GGAAGTGTTT 


58401 


ATGGCATCAG 


GAATCGGAGG ATCTAGTGGA 


TTAGGAAAGA 


TTCCACCTAA 


58451 


AGATAATGGG 


GATAGAAGTC GATCGCCCTC 


TCCTAAGGGA 


GAACTTGGCA 


58501 


GCCACGAGAT 


TTCCCTGCCT CCTCAAGAAC 


ATGGAGAGGA 


AGGAGCTTCA 


58551 


GGATCTTCGC 


ATATACATAG CAGTTCCTCT 


TTTCTACCAG 


AAGATCAGGA 


58601 


GTCTCAGAGC 


TCTTCTTCGG CAGCTTCTAG 


CCCGGGATTT 


TTTTCTCGCG 


58651 


TACGTTCTGG 


GGTAGACAGG GCCTTAAAAT 


CATTTGGCAA 


CTTTTTTTCC 


58701 


GCAGAGTCTA 


CGAGTCAAGC GCGTGAAACG 


CGACAAGCTT 


TTGTTAGATT 


58751 


ATCAAAAACC 


ATCACCGCGG ATGAGAGACG 


GGATGTCGAT 


TCATCAAGTG 


58801 


CTGCTGCTAC 


AGAAGCCCGA GTGGCAGAGG 


ACGCGAGTGT 


TTCAGGCGAA 


58851 


AATCCTTCTC 


AGGGGGTTCC AGAAACCTCT 


TCTGGACCAG 


AACCTCAGCG 


58901 


TTTATTTTCT 


CTTCCTTCAG TAAAAAAACA 


GAGCGGTTTG 


GGTCGGTTGG 


58951 


TACAGACAGT 


TCGCGATCGC ATAGTACTTC 


CTAGTGGGGC 


TCCACCTACA 


59001 


GACAGCGAGC 


CTTTAAGTCT CTACGAGCTA 


AACCTCCGTT 


TGAGTAGTTT 


59051 


ACGTCAGGAG 


CTCTCTGACA TACAAAGTAA 


TGATCAGTTG 


ACTCCAGAGG 


59101 


AAAAAGCAGA 


AGCCACAGTT ACCATACAAC 


AGCTGATCCA 


AATTACAGAA 


59151 


TTCCAATGCG 


GCTATATGGA GGCAACACAA 


TCTTCGGTAT 


CTCTAGCAGA 


59201 


AGCTCGTTTT 


AAGGGGGTAG AAACTAGTGA 


TGAGATCAAT 


TCCCTCTGTT 


59251 


CAGAACTGAC 


AGATCCTGAG CTTCAAGAAC 


TCATGAGTGA 


TGGAGACTCT 


59301 


CTTCAAAACC 


TATTAGATGA GACTGCCGAC 


GATTTAGAAG 


CTGCTTTGTC 
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59351 CCATACTCGA TTGAGTTTTT CTTTAGACGA TAATCCAACT CCGATAGACA 

59401 ATAATCCAAC TCTGATTTCT CAAGAAGAGC CTATTTATGA GGAAATCGGA 

594 51 GGAGCTGCAG ATCCTCAAAG AACTCGGGAA AACTGGTCTA CAAGATTATG 

59501 GAATCAGATT CGCGAGGCTC TGGTTTCTCT TTTAGGAATG ATTTTAAGCA 

59551 TTCTAGGGTC CATCTTGCAC AGGTTGCGTA TTGCTCGTCA TGCAGCTGCT 

59601 GAAGCAGTGG GTCGTTGTTG CACGTGCCGA GGAGAAGAGT GTACTTCTTC 

59651 TGAAGAGGAC TCGATGTCGG TGGGGTCTCC TTCAGAAATT GATGAAACTG 

59701 AAAGAACGGG CTCTCCGCAT GACGTTCCAC GCAGAAATGG AAGTCCACGT 

59751 GAAGATTCTC CATTGATGAA TGCCTTAGTA GGATGGGCAC ATAAGCACGG 

59801 TGCTAAAACC AAGGAGAGTT CAGAATCAAG TACCCCGGAA ATTTCGATTT 

59851 CTGCTCCCAT AGTGAGAGGT TGGAGTCAAG ACAGTTCCGT CAGTTTTATT 

59901 GTTATGGAAG ATGATCATAT TTTCTATGAT GTTCCTCGTA GAAAAGATGG 

59951 AATCTATGAC GTTCCTAGTT CCCCTAGATG GAGTCCTGCG CGAGAGTTGG 

60001 AAGAGGATGT TTTTGGAGAT TATGAAGTTC CTATAACCTC TGCTGAACCA 

60051 TCTAAAGACA AGAACATCTA CATGACACCT AGATTAGCAA CTCCTGCTAT 

60101 CTATGATCTT CCTTCACGTC CAGGATCGTC TGGAAGCTCA CGTTCTCCGT 

60151 CTTCAGATCG CGTACGAAGC AGCTCACCAA ATAGACGGGG TGTGCCTCTT 

60201 CCTCCAGTTC CTTCACCTGC TATGAGTGAG GAGGGGAGCA TTTATGAGGA 

60251 TATGAGCGGT GCTTCAGGTG CAGGTGAAAG TGATTATGAA GATATGAGCC 

60301 GTTCCCCCTC TCCTAGAGGC GACTTGGATG AACCCATATA TGCTAATACT 

60351 CCTGAAGATA ATCCATTTAC TCAGAGAAAT ATAGATAGAA TTTTACAGGA 

60401 GAGGTCAGGC GGTGCTTCCG CTTCTCCTGT AGAGCCTATT TATGATGAGA 

60451 TCCCATGGAT TCATGGCAGG CCCCCTGCTA CACTTCCAAG ACCCGAGAAT 

60501 ACATTGACTA ATGTTTCGCT TAGAGTGAGC CCAGGGTTTG GACCAGAAGT 

60551 AAGAGCCGCT TTGCTTAGCG AGAGCGTGAG TGCTGTTATG GTCGAAGCAG 

60601 AGAGTATTGT TCCTCCAACA GAGCCGGGGG ACGGAGAATC AGAATATCTA 

60651 GAGCCCTTAG GGGGACTTGT AGCTACAACG AAAATCTTAC TACAAAAAGG 
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60701 ATGGCCTCGT GGAGAGTCGA ATGCTTAGGA TTTAAGTAGT TCTTTCGAAT 

60751 CTCTAGTGAG GATGTATCGG GTTCTTAATT TTTATGGGGG AAACGTATCT 

60801 GTGGATTTCC CTTAGCTTCT CCCATAAGAT TCATGATGGT AGAGAGTGTT 

60851 CACTACTGTC TATTTGGTCT TTACAGGTTG CATTGTCTAT ATAACATGCT 

60901 TTTAAAACTT AAAGGTCGTT CCTGCGTGAA GGTAATGTGT CGTCGTTGAT 

60951 GAGGATACCG AGCCTTGATA GTCTAAGAAC ACCGAAAGTT TAGGGAAGAT 

61001 AAAAATTTGG TTTCTTCCTT TAAAAGCAAT GGCATTGCGA GCAAGGGTGG 

61051 TTCCTGATAG GAGCCATGAC GATCCACTAG ATTCTAGACT CACGTTGATC 

61101 TCAGGATTTT GTTGGTAGAG GACAGGCTGA TAAGCAAGCT CTATGTTCCA 

61151 ATAGGTAGGA AGACGGAACT TGGATTCCCA AGCGCTCTGA ATTCCCAGAG 

61201 GGACTGTCAG GTTATATAAG GGTTTATGAA CAGAAAATTT TCTAGCTTTA 

61251 TCTCCACTTT CTTGAAACGC AGTTTGATTA GAACGAACGG CAATTGCTTG 

61301 GATAAAAGGA GTGAAGTGGA GAGGTCGTGA TCGCCATTGT AGAGATAGAG 

61351 AGCAAGAGAG AGCCGCCCCT AATGTCGTAC TATAACATTT GCCTTCCGTT 

614 01 TGTATTTTTC CAGAATATCC AGATGCTTTG ATATGGTGGT TGCTGTAGCT 

614 51 GTAGGCTAGA GATGCAGATG TAGAGAATCT CTCTTGCAGC CAAGGATTAT 

61501 TGATCTGGAG CGCTACAGTT GTCGTATGCG AAGCCACGGA ATTGTCGGAG 

61551 TGGCTCTCGT AGAGATTACT GAAAAGTTGG GAGAAGTTTA CACCAAAGCT 

61601 ATGATTAGAA GCAGTGTTTG AGGTTGTTCC CAAAGAATAA CCCGTAGCTT 

61651 CCATATGGAA TCCTTTCGCA TCATTGTTGC TATTTTGATG CACGAAGAGT 

61701 CGAGTAGCTT CTCCAGAAGC TGTAGGTGCT ATTTGGCCTT GCTGTGTTTG 

61751 ATAACGTAGT GTCGCAAATA AGTTATGGAA AGATTGCCAG AAGGCAGATA 

61801 GGGCAATGTC TCCTTTGTTT TCTGGGTTTA CCTTATATCC TGTAGGTGTC 

61851 CAATCACCAT AAAGCTGGCG ATGTAAAGTA TTCACAGTAT CTTCAGAAGA 

61901 GGTATCAGAA GTTGTGATTG TTTCGATCCA GTAAGGGGAC CAAACGCCTT 

61951 GGTAGCCGTA GTGTTGAGTT GTATTTAGAC CCTCAGGGTA GAAATTATCC 

62001 GTATTAATAT GTTTAGCTGT GACGTCTAAG AGATACAGAA GAGGAACTTC 
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62051 TGCGATAGGT TGGGCAAGGT CTGCAGTATC ATAGGGATCT AGGTTCTCGT 

62101 CATCCAGTAG GCTCAAAGGT CCTGAGAGAT TGATTATAGG GTTATTATCT 

62151 TCGCTATAGG GTGCTGATGA ACCTGTGGGG CGAATCCATA GCTTGGGAGC 

62201 AACTCTGTTG CCTAAGATAG AGGGAAGGTT AATTGCAAGA TTATTGATGT 

62251 TAATTACAGA ACCCACACTA CTGCTACTTT GTTCTTCGTC TGTTGTAGAA 

62301 AACACAGCTC TACTGCCTAA CCGTAGAGTC CCACCAAATT GATCAAATTT 

62351 ATAGACTTTC CACTCTGCTC GATCTTCAAG AGCGAGTGTG CCGTTGTACA 

624 01 GTCCAATGTG GTTTCTGAAA TGTGAAATGA AGTCATCACG AGAAGTCGAT 

624 51 GTATCCGGAA TATATGTTGA GGAGAACAAG ATAGTTCCGA GGTGTTCTGG 

62501 ATTAGGATTA AATTTTTGGA TAGAGTTTTG TATAGTATAT CTTTGTAGTA 

62551 TGGGATCATA GAAGGTAGCA GAATGACCTT GACTTGCTCC AACTGTTAAT 

62601 GAGACATTAC GCGTGCAGTT TACAGAAACA TGATTGCTGA AAGTATCTTT 

62651 GAAGTGTCTA TTATTATAAA AAATAATATC TCCCTGATCA GCAAATAAAG 

62701 TGCATGCACC ATCTTGACGG AGCATGATAG CGCCGCCCCA AGTTCCCTGA 

62751 TTGTTTGTGA AATAGACGGG ACCACTGTCT TGTATAGTTA GAGATTGTGT 

62801 ACAGATAGCA CCTCCATCTC GTGCTGCAGT ATTATTATCG AAGGCTGCAA 

62851 TTCCTGGGTT GTCTTTTATA GAACAACTAA TGCAATAGAT AGCCCCTCCA 

62901 GAGGAATGGT TAGCAGAGAT GTCCGCTTCC ATGGCAAAAT TATTGTTGAA 

62951 GATCACAGAA CCGGTATTCT TTGTAAGAAT GCACTCTTGA TGTACTCTTA 

63001 TTGCACCACC CAGACCTGAT TGGTTATTCA AAAAATAGAT AGGCTGAGAA 

63051 TTATTCTCAA TTCTACAAGC ATTAGCGAAC AACGCGCCCC CCGCTGTTCC 

63101 GCCTGCAGCA TTATTAAAAA ACAGGCAAGG GCCAGTGTTG TCCTTAATGT 

63151 TTATGATTGC 'AGCTTGGATT GCTCCTCCTG AAGATTTTGC CTTGTTGTTA 

63201 ATGAAGTATG CGGTTCCTTG ATTTTTTGAG ATTGTAACAT TTTTCGAACA 

63251 TAAAACAGCT CCCCCTGTAC AAGTATCAGC GAAATTACTT GCATTAGGAA 

63301 AGCTTAAATT CCCAGAGAAA ATGATGGAAC CATGATTCTC AGAAAGATCG 

63351 AAATTACCAT TC A CAT AC AT CGCACCAGCT CTTTTAATAG CAAAGCTATT 
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634 01 TAGGAAAAGA ATTTGGTTTT TTGTATTCGT TATGGCAAGT GATTTGCAAG 

634 51 AGAGAGCACC GCCGTCTTGA GAGAAGTTTT CGAACCAGCT TTCTATGGAA 

63501 TTCTGGTGAT CGAGGACAAT GTCTTGGTTA GTGTCATCCC TAACTCCAAA 

63 551 AAGTGTTGCT CTATGAGAGT AGGGAGTCAT GTTAGTAAGA GTATCAATTA 

63 601 GAGGGAAGAG TGTTGTGAGT TGATTTGCTT GATTATCAAA ATAGTCAGAC 

63651 AACGGAGTCG CATTAAGGAG TATTGTAGTT TTACCTAAAA TTAAGGCTCC 

63701 AACAAAGAAG GAAGATTTAC TAAGGGATCT GTTATTTTGC ACTGTTTATT 

63751 CCTTGATGTT TTCTTTGTGG TTAAAATGTG CAATGACTCT CAGCTTTAAG 

63801 GTATTGACCT ACAGTTGACG AGGAGACTTG AGCTGAATAA TCTAAGGATA 

63851 AGGTTACTCT TGAGAAAAGT TGGGAAGTAT TTTTTATTTT TGCAGCTACG 

63901 GAATTATAGG AGACGGGAGT TGCTTGTGTT GTCCATGTTC CATTGCTGAT 

63951 GAGTAGTGTC GTGAACATTT CTGGATTTTT TCTGTATAGG GTAGGTACGT 

64001 AGGATATTTC CGTAGTCCAT AGCATGGGGA TATGATGTGA AGTTTTCCAT 

64051 TCAGAACGGA AGCCTATGGG AGAGGAAAGA TCTGTAAGGG GATGTTTTGG 

64101 ATGGAATTTT CTTATATGGT CTCCAGTTTC TTGGAACGAG GCCTGGGAAC 

64151 AGCGCAGAGC AATGGCACTG ATAAAGGGCT GGAGTTCGAG AGTGCGGGTG 

64201 ATTCTAGCTG GTAAGAATGT GCAGTCTAGA GAGGCTACCA AAGTGTGGTT 

64251 ATTAAAGAAG GCTTTGGACG ACCCTTTTAA GATTTCTGTA TAGTGGCAAA 

643 01 GCATATGGTG ATCTCCGTAG CTATAACCTA GGGATAGCCC TGTAGAGATG 

64351 AAGTCCCTGA AGAGGAGACT GTCGAAGCGG AGTCCTGCAA AGTAGTTGTG 

64401 GGAGGAAGTC GTACTTGGAG ATTGACGTTC TCTAGTTTTG GAGAACATTT 

64451 GTGCGAATCC TAAAGAGAAA CTATGTCGTG CTGCAGTTTT TGCTGAGGTT 

64501 GTTGCTGCAT AGCCCGTAGT ATGGTTTCGG AAGCCTTTGC GTCCCTCGCG 

64551 ATTATGTTGG TTAATTAGAA GCCCGAGTCC TTGCAGAGAG GCTTCAAGGT 

64601 CATGCTCTTT GAGGTTTTGT GGAGGTAAGA TGCGGATTCC TAACAGAGCG 

64651 TTATAGGCAG ACTGCCATAA GGTATTAGCA ATAAATTCTC CGTGACGTTC 

64701 CGGGTTAGGG CGGTATCCTA CAGGAGTCCA GTCTACGTAG AGCTGCCTGT 
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64751 GGTTTGTATT GGTCTGTTCC GGTACTGTAG AGCTTGTTGT AGTCGTAGTT 

64801 TCCATCCAAT AGGGAGACCA GATTCCCTGA TATCCATAGT GCTCATCTAA 

64 851 GTTCATGGCT TCTACAATGA GATTCGAAGT ATCGATTTTT TTTGCAGTCA 

64 901 CATCGAGGAG GTAGAGGAGG GGGGATATCC TTTCGAGGTT CAGAGAGATC 

64 951 TAAGCTATCA TAGGGGTTTT CATTTTCATC GTTTAGAAAA GTCAAGGGTC 

65001 CTGAGAGAGT GATAGTAGAA GAAGTGTCTT CAGAATAGGT GGATCCTGTT 

65051 AATGTAGGAT AAATCCAGAA CTTTGGAGCT GAGGCTTCTG ATTGTAAAAT 

65101 AGAAGGAAGA TTGATCGCGA TTGCATTAAA ATTTATGGAG CTTCCCGGGC 

65151 CTTTCGTCCT GATTAATGCT GCGTTTCCTA AACGTAGAAT GCCCCCAGTT 

65201 TGCGATAGGG TTTTGCAAGA AATAGCAGCC CGATCTTCAA TAGCGAGCAC 

65251 ACCCCTTTCA AGTCGTGAAG AGTTAGAAAA TTTTGATAGG AAGTTCAATG 

65301 GATTTGTTGC GTTAGAATCT ACATTGATTC CGGAAAACAA CACGGTGCCA 

653 51 AGGTGATGGG GTTCATAATT AAATACTATA GGATCTGTTG TCGTCTGATC 

65401 GTGATCTATA GGATCATAAA AGAGAATTTT ATAACCCTGT CTTGCTCCTA 

65451 GTTTTAAGTT AATCCCCGGA GCAGCATAGA GTGCATTTCT ATATCCGGGT 

65501 TGAGGAGAAG AAGATGTGAT TGTATTATTG TTAAATAGAA TATCGCCGTA 

65551 GTCTGCAGAG AGGAAGAAAT TTTGAGGAGT ACTTCCTATA CCAGAAAGAT 

65601 TGATGAGAGC CCCTCCTGAA GTCGCAGAGT TATTAATAAA TGCTGTCGGA 

65651 CCGTTATTTT GGAAGATGAA AGATCTCGTG TGTATAGCTC CGCCGCTAAG 

65701 TGCTGCCGTT TTATTGTTAA AGATAAGACC TTTGGGATTG TTCTCAATGA 

65751 CTAAGGAGGT ACACATGATA CCGCCACCAC CGGGATATAG TTTTCCTGAT 

65801 GCTGTGTTAA TTGAGGATGC GGAGTGATTG CTGATCTCTA TAATTTCTTT 

65851 ATTTGAAGAG ATAATGACTC CTAGGGCAGA AAATATACCA CCCCTTGTCC 

65901 TGAAGCATTA TCATTGATTT GGATGTTTTG ATAATTCCTT TCTATATTTA 

65951 CGGCTCTGCA GAAGATGCCT CCTCCAAAGC TGGAATCTTC TAATGTTTGA 

66001 TTTTTCTTGA TAACGATAGG ACCTAAGTTG TCTGTGATTG ATAAACTCAC 

66051 CCCAACATAG ATAGCTCCTC CTCTATTTTT AGAGACATTA TCTAAGAATT 
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66101 GTCCTTCTCC TAAGTTATTT GTAATATAGC TATCTAGTGC ATGTATAGCC 

66151 CCACCAAATC CTGAAGTTGT AGTAGTAGCT AAGGAAGCCG CATTTGAAAT 

66201 AAACGAGTAG TTCTGATTCT TAGAAATCCA GCATTCCCGA ACACTGTAGA 

66251 GAGCCCCTCC AGAACTGTGG GAGCTATTCC TTTCAAAACT TAAGTTTCCT 

66301 TTATTTTCAG AAATGAACAA ATTTTTACAT GCAAGAATGC CTCCATCCTC 

66351 GTAGTGGTAG TTATAATCTA TAACAGAATT GATAGAGTTT CCTGTAATTG 

66401 TGATGTCCTG GGTTATATCG TGACCGAATC CATTAAGAAG ATTAGAGGGA 

66451 CGAAAGGAGG AAACCAAGGG CTCTGGAGTT ACATTCAGAC GG AAGCTTGG 

66501 AGACGTGTGG AAAGCAGAGA TGTCTTTTCT AGACATCTGA CAAGAGGCGA 

66551 GGTTAGGGAC TTCATTTCCT GATAAGGAAC AACATAGCGC AGTGGATAAA 

66601 ATGCTGAGAC AAATGGGTCG CATAAACATC CAAACTTTGA TATGAAAATA 

66651 GAGTTTGTCA TAAAAAATCA TCAATTTGTT GTCTAGAAGT TTATAAATTT 

66701 ATATTTAAAT ATGCTTATTA ACATGTTGAA TATTACAGAG TTAGTAAACT 

66751 CATATATTTG AATGATTTTC TATATTTAAA TTTTGAGGGA GTCCTCTACC 

66801 GATTGGGCTA ACGTAGCTCA TCGGATAGTT GTTAATTCTA AAGATTTCAA 

66851 TTTTATCGTT CTTTTATTTT AGAATTTTAA GGTACTTCCT GCTTGGAGAT 

66901 GGTGCGTAGA TGTCGAGGAG GAGACCGATC CTTGGTAATC CAAGAATAGA 

66951 TCGAGAGAAC GGAAGAGCGC AGTTTGATTG TGGACTTTGT ACCCTAAAGC 

67001 ATTGCGAACA TAGTTATGGC CTAGGATATC CCAGGAACCT CCGCTCGCAA 

67051 GTAGCGTGAC ACCGATTTGG GGATTTTGTT GATAGAGTAC CGGTTGGTAA 

67101 GAAAGTTCTA GAGTCCATTC TGTAGGTACG TGGAATTTTG ACTGCCATTT 

67151 TCCTTGGATT CCTAGAGGTA AGGTCAGATT ATAGAAAGGC TTTTGAGAGA 

67201 CAAACTTTCG GGGATTGTCA CCAATCTCTT CGAACGCTGT TTGGTGAGAA 

67251 CGTATTGCAA TTGCCTGAAC GAACGGGCTG AGGTGAAGAT AGGATTTCTG 

67301 TTGCCAAGGG AAAGAACAGC CGATAGCTGC TGCTAATGTA TGGCTATAAC 

673 51 ACGTCCCTTC TGCCTGTTCT TGATGTGAGG GATGTAGGCT GTGGAGGTGA 

67401 TGGTCCCCAT AGCCATACGC TAACACTGTG GATGTTGCAA AGGCCTCTTG 
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67451 GAACCACGGA AGCTCAACAT 

67501 CGTTGTTGCT TGATCCGATT 

67551 CCTAAGGAGA TTTTCTGATG 

67601 ATACCCTGTA GATTGGATAC 

67651 GATGAACAAA GAGGCCGTCG 

67701 TCGATATCAG AATCACCAGT 

67751 AGTATGAAAG GATTGCCATA 

67801 CAGGATTGAC CTTATATCCT 

67851 TAGAGGGTGT TTGCCGTCTC 

67901 CTCTACCCAA TAAGGAGACC 

67951 CATTTAAGCT TTCAGGATGA 

68001 ACATCCGATA AAGAAAGAAG 

68051 TATACTGTCG TAGGGATCGC 

68101 CTGATAAAGT AATTGTAGGG 

68151 TGTAGAGGAC GGATCCACAA 

68201 GGGGAGGTTA ATCGCAAGGT 

68251 TTGATGGAGT CTCAGAGTTG 

68301 AATTTAAGGA TACCTCCTTT 

68351 ACGATCCTCG ATAGAGAGGA 

68401 AGCTGCTAAT GAAATTATTT 

68451 GAAGAAAATA AGATCGTTCC 

68501 AGGATTTGTA GTTGGATGTT 

68551 TATACCCCTT ATTAGCTCCA 

68601 ATGGCGTTGT ATCTACCAAA 

68651 TGCGATATTT CCTTGTTCCG 

68701 GCATAAGAGC ACCTCCCCAG 

68751 CCACTATTTT TGATTGTCAA 



PCI7US99/26923 

AAAGTGAAGA GACTGTATTG TGAGCCGAGA 
TCTTTAGTGC GGGTGAAGAA CTGTGCAAAA 
TAAAGAAGTT TCGGAGGATG CTTGTAAGGA 
GGAATCCTGG AGCCCCGGGG ATGCTATTTT 
GCAATCCCTT GAATTTCTAA GAAAGGCCTC 
TCGATTATAA CTTCTTAATA GAGAGAACAT 
GGGGAGTCGT AGCAAGATCT CCTTGGTATT 
AAGGGAGTCC AATTGGCATA CAGAGCTCTG 
TATAGAAGCG TTATTTGTTG TTGTTATCGT 
AGATGCCTTG ATAACCGTAA TGCTCAGTCG 
AAGTTATCGG TATTGATATG ACGTGCTGTT 
ATGAATGTTT TGTAAAGGCT CAGAGAGATC 
GGTTTTCCTC ATTTAAGAGT GTCAGAGGAC 
TTATTGTCCT CTGTGAAAGG AG C ACT AG AT 
GGTAGGAGCT TTTCCTTTTG CTAAGATCGA 
TATTAATGAT GACCTGGGAG CCTACACTAG 
GCAGTTGTTG CAATACTCGC CGCATGCCCT 
TTGAGTGAAC TTATAGAATT GCCATCCCGC 
CACCATTGCG AAGTTCAGAG GTATTTTTCG 
TCGTAGTCAG AAGCTTCTGG GATATAGGCT 
CTGATGGTTC GCATTGGGAT TAAAGATTAG 
GGTGTTCTAT AGGATCAAAA AAAGCAGTCG 
AGTTGTAAGT TGCTATTTGG TGTACAATGT 
TGTGGTGAGG AAAACCTCAT TATTTTGAAA 
CGAAGAGTAG GCAGGTGCTG TCCTGTAGGA 
TTTCCTTGAT TGTTGGTGAA ATATACGTGG 
AAATTGTGTA CAGATAGCTC CGCCATCGCG 
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68801 AATGCAGTAG TTATTATTGA AAAGAATAGT TCCAGGGTTA TCGTCTATGG 

68851 ATAGGTTTGT TGTATAAATC GCCCCTCCTG AACCATTTCC TGAATTTATC 

68901 GAACCAGATA ACGCTGTGTT GTTATTGAAA ATCACCGACC CGGAGTTATT 

68951 TTTTATCGCA ACAGTAACGC TTGTTTGAAT GGCCCCGCCA TTGTTCCCAC 

69001 AGTTGTTCTT AAAATAAATA GGACGCGTGT TATCAGAGAT CGTTGTATTT 

69051 TCACTACGAA GCGCACCCCC TCCACTAGGG GCTGTATTGT TAAAAAAGAG 

69101 TAGAGGTGCC CTGTTGCTTT GGATGCGGCA GTGTCCATTG GTGGAGAGGG 

69151 CTCCTCCCCA GTTGTTGACG GAATTGTTGA CAAAGTAGAA AGTCCCTTGA 

69201 TTTTGAGAAA TCGTGAAGTC TCCATTACAG GCAATCGCAC CCCCACGAGT 

69251 TTCTCCTCCT GTACTCGCAT TGTTAAGACC TCGATTGCTG AAAAAAATAA 

69301 GGGGTCCTCT ATTCTTCGTG ATTGTGCAGG CTCCCTGGCA AGCAATCGCG 

69351 CCTCCAGTCC CAATCGCGAG ATTTTTACTG AAGAAGGCAT GGTCTTCAAC 

69401 ATTTGATAAT AAGAAATTAT TACAGGACAC AGCTCCCCCA GCCGATGTCC 

69451 AAAGAAGAAG GATGTTATCA ATAGACTTGT AGTTAGAAAG TACAATGTCT 

69501 TGAGAGGAAT TATGTCTATT TCCAACAAAC GTAGTTATTG GAGAAAATCC 

69551 TGTAAGAGTG GAGAGAGAGT CTAAGAGAGG AAAGCTCGTA CGAAACTCTT 

69601 CATCCCTCTC TAAAGCAAAC TTTTCAAGGG AGTCCGTTTG TAAACTATAC 

69651 ACTGCAGGAG TCATCCCGAA CATGCAGGCT GTGAAATTCC CGAGATAGAA 

69701 TAAAAACTTA GGAGGAGTCT TTGACACTAG AGGTTCCTTA ATCTTTCTGT 

69751 TTTGGTCACT TATTGTTAAA ATCTCATTCT ACTCGCCACG TTTAAGTAGT 

69801 GACTCAGCGT GGAGGAAGAA ATATCCGCAG AGTAATCTAA GGAGAGAGTG 

69851 ACTTTAGGAA ACACCTGCAT GGTATTTTTC ACTTTGATCC CTAAAGCATT 

69901 GTAGGTCACA GGAGTGGCCT GCGTCGTCCA CGTACCTTGG CTAATCAGTA 

69951 ATTTCGAGTG GAGTTCAGGA TCTTGCCTAT AGAGAGTAGA GCGATAGGAA 

70001 ATTTCTGTGA GQCAGACTAG GGGAACTCGG TGGTGGTTCT TCCAAGAAGC 

70051 GCGGATTCCT ACAGGGAGGG AGACGTCCGT TAGGGGGCGG TGTAGGGAAA 

70101 ATTCCCGAGC ATGGTCTCCA GATTCTTGAA ACGCAGCAAG ATTTCCTCGG 
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70151 ATGGCTAAGG CAGTAATAAA GGGATAGATC TGCAGGGACT CGCCGTGAGG 
70201 TTGAGGTAAG AAAACACAGG AGAGAGCCCC TGCTAAGGTA TGGTTGTGGA 
70251 AAGATCCCTG AGAGTTCCCT TCCAGGAGAC CCTGATACAT TGTATGGGTA 
70301 TGTTCCGAGG TAAACATATA AGCAAGAGAC ACAGATAGAC GTATCCACTC 
70351 TTTGAAGAGA GTATTTTCTA TGCACATTCC AGAGAAATAG TGGTGAGAGG 
70401 ACGTGCTATT TTGAGATTCA TGTTCTTTAG CTTTGGAGAA GAACTGAGCA 
70451 AATCCTAAAG AGAAATTCGG ACTTTGAGAA GAGGTTGCTT CGGTGGTAGC 
70501 ACTATAACCT GTCATATGAC TACGAAATCC CTTAAAACCG TTTTTGTCTT 
70551 TTTGATGAAC CAGAAGACCA ATGCCTTGTA GGGAAGCTGC ATGACCCTTC 
7 0601 TCTTCATCCC AGGAGGAGAG GGAGTGGAGT CCTGCAAGAG CCGTATATGC 
7 0651 CGATTGCCAC AAGGCATTCG TAATGAATTC TCCTCGACGT TCGGGATGAG 
70701 GACGGTAGCC TAGAGGAGAC CAGTTTGCAT AGAGCAGCTT GTGTTTTGTA 
70751 TTCGCGCCTA GTAGAGATGT AGGGTTCGTG ATTGTTGTAG TTTCTACCCA 
70801 ATAGGTCGAC CAGATGCCTT GATACCCATA GTGTTCGCCA GAATTTAATG 
70851 TGGATAGATC CAGTTGCGAA GAGTTAATTT TTTGTGCAGC GACATCGACA 
70901 ATATAAAGAA GGGGAACTTT CTCAAGAGAG TGCGAGAGAT CCAGACTATC 
70951 GTAGGGATCT TCGTTGTTGC TGTTGCGTAA GGTGAGAGTT CCTGAGATTG 
71001 TGATTGTCGG GTTGGAATCT TCAGTATAGG TAGATCCTGT TTTTGTGGGG 
71051 TAAATCCAAA TTTTTGGAGC CTGAGCTTGA AAAGAAAGAA TAGAAGGAAG 
71101 GTCAATGGCA ATGTGATTTA AAGTTATAGT ACTTCCTACT GTCGTTGGTG 
71151 TTGAGGATGG TGTGGGAATG GTTCCTGCTG TCGTGATCAC CGCACCTTGA 
71201 CCTAGAAGTA GAGTGCCTCC TCGTTGGAAG AACTTATAGC AGGCCAGCCC 
71251 CGCACCATCT TCAACAGCAA GGACTCCTTG ACGTAGTTCC GAAGTGTTCC 

713 01 TTAAATAGGA AAAGAAATTC ATTTCATCGG TAAAGTTCTG GTGTACATGT 
71351 TCCCCTGAAA ATAAAACTGT ACCTGTATGA CCGGTTTCGA AATTAAAGAG 

714 01 TATGGGGAAG GAGGAAGGGA GCTCATGTTC TATGGGATCA TAGAACAGCA 
714 51 CTCGATAGCC GGGACGGGCT CCTATTTGCA GATTCATATT AGGAGTCGAG 
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71501 TGAATGGCGT TTCTGTATGG AGGATTGAGG GCATGCTTGG AGGCCGTATT 

71551 ATTGTTAAAG ATAATATCTC CATTATCTGC AGATAAGATG AAGCTTCCGT 

71601 TTCCAGAACC TGCTGATAAG TTGAGGAGAG CCCCTCCCCG AGTTGCAGTG 

71651 TTATTTAAAA AGTATACAGG ACCATTTTCT TTGATAATGA TAGATTTCGC 

71701 ATGAATGGCT CCACCGTTGC TCTGGCTTTG GTTATTGTTA AAGAGTACCC 

71751 CTTCTCGGTT GTTCAATATC GTGCAAAAGG TGGTAGTAAG ACCTCCTCCT 

71801 CCTGGATTGA AGTTCGATCC ATAGTTATTT GCGAACGCGG AATTTTCACT 

71851 GATTTCTATG AGTTTTTTAT TCGAGGAGAT CGTGAGTGTT TGGGTAGAAA 

71901 ATATGCCTCC CCCAGATCCT GAAGAGTTGC TTGTGATCTG TATAGCTCCA 

71951 GAATTTCCCT CTATATTTAG AGAGTTCCCA CTATAAATCC CTCCTCCTAA 

72001 ACTGTCCGAA TTTAGTGCCC GATTCTGCTT GATTATGATC GGGCCTTTAT 

72051 TGTCTTTAAT AGATAAGTTC GTCTCAGTAT AGAGGGCACC CCCCTTATTT 

72101 AAAGCGAGAT TGTCAACGAA AGTTCCCTGT CCTAGGTTAT TAGTAATAGA 

72151 GCAATTTATG GCAAAGAGAG CTCCACCCAA TAGTGATCCC GCAGTGGCTG 

72201 TAGGATTGTC AGAGACCAAG TTTGTAGTAA ATGCATAGTT CTGATTCTTG 

72251 GAGATCGTGC AATTTTGAGC AGCATAAATT GCCCCGCCAG AATTGGGACA 

72301 GACATTCTTC TCAAAGAAGA CATTCCCTAT ATTTTCAGAG ATCAGAAGAT 

72351 TCTTACAGGT AAGAGCACCT CCATTCGACC GATAGTACTT ATAGTCCAAG 

72401 ATGAAATCAT TGTGATTCCC GACAATTGCG AGATCTTGAT TTTGGTTATG 

72451 AGTAAACCCT ACTTGAGGGG CTGCTTGATA TTCAGGACTT AATGTAATAT 

72501 AGGTCTCCAA AGGAAGTTGG AGACCTTCAT TAGCCAATAC AAAAGTAAAA 

72551 GGAAGCAACA TTCCGAAGCA AAAAAAGCGC ATACTCGTTC ACAAATAGAA 

72601 AGAAATATTC TGAATAAATC AATGCTAGAT TTTTTGTCTA TAATTTGTTT 

72651 TATAGTGTCT AATTATGAGT AGTTTAAAGT TATCTTATTT AAGACAAGAC 

72701 GATTTTAATT TCCTTTTTTC ACAATAAAAT GCGAGTCCAA GAGTTTTAGT 

72751 AAACGTACAT CATACTTTGG AAAATGGCGG CAAGTAGTCA GAGGATCAAG 

72801 CTCTTTGCAT ACTTTGCTTA GAGCAGTTTT TGGTTCTCAG GGTCGTTTGC 
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72851 CTTTAGGGAG CTTGTGATGT TTGGGATCGG AGAGCCTCCT TATATCCCTA 

72901 GGGGATGCCC TAGGAACTCT TCCGAAACAC CGAGGGTCTG TTAGAGATAA 

72951 AAACAAAGGA CCATCGGGGA GACTTGTACT CATAAGAGCC ACTTATCTCT 

73001 ATTCCTATAG ATTTTTGCTG ATTTTGATTA TTAGAAAATA ATAGATTTTT 

73051 TCTGTTTTTT AAAGTAAAGT ATTTTTTAAA AGACTCATTT TTAATGAGTT 

73101 ATTACTTTTC TCTTTGGTAT CTGAAGGTGC AACAGCACTT TCAAGCAGCA 

73151 TTTGATTTTA CTCGCTCCCT GTGTTCACGA ATTTCTAATT TTGCTTTGGG 

73201 AGTGATTGCA TTGCTTCCTA TTATTGGGCA GTTGTATGTA GGGCTGGACT 

73251 GGCTCCTCTC TAGGATAAAA AAGCCAGAAT TTCCTTCCGA TGTGGATCAG 

73301 ATCGTGCGAG TAGAACACGT CGTGGGTCAC GACCATAGAA GTCGAGTTGA 

73351 AGATATTCTA AAGAGACAAA GGCTCTCATT AGAGCCTAGA GACGAGGGGA 

7 3401 AGGTTCACGG AGATCTGCCT TCAGCTCCTT TTTTTTGATA TCCAAAGTCT 

73451 CAAGTTCCTA CAGTTGTTCT CTGAGGGGAC AGCTCTAAAT TTATTTCGTA 

73501 TATTTGCTCC ACTACGCAAC CGTGTGACTA CAGAATACAG TCGTGCTAGG 

73551 CAACCCGACC TACATAGAAT TGCCATCGTC TATATAGGAG TTCTCGATTC 

73 601 AGAAAGTTCC AAGATCCTAG AGCGGCTAAT CTCTTATATG AGTTGTATCT 

73 651 ATTCTGAATC GCAAATGTAT TTAAGATTCT TTATGGGCAA GAATGTAAAT 

73701 CAAAGTGCTG TACTCTCAAA ATTACATGTA GAAAATCTGC ACATCCGTTG 

73751 TGGGTTTTTC AGCGAGGATG CTGTTCCAGA GAGTGAGCCC TTCGATCTCT 

73801 CCATCTACGT GCACACAGAT CGTAGCTGTC CTCTCCCTAC GAAAAAACGG 

73851 AGCAGCTCCT GGGAACTCCA AACTGTAGAA CTCCCAGAGT CAATATATCC 

73 901 ACAGTCGGAA TTCCTATTGA TGAGACCTCG AATGCTTTCG TAGACTCTAT 

73951 GATGAAACAA GGAGTCGGGC AGGATGCTAA AGAGCTATAC ACATTTCTAT 

74001 CTCGTGGGAA TGAGCATTAC CAACCGTGTC TATGGTTCAG TCTCGAAGAG 

74051 GAACTCGGAT TCCTTTTCGA TGAAAAAATG CTCTGCGCCC CTCTATCTGA 

74101 GGATCACTAT TGCCACTCGT ATCTTGTAGA TCTAGTGGAT CAACATTTAA 

74151 AGGATTTAAT ATTATCGATG TTTTTAGATC CTCAGAATAT CTCAGCAGGA 
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74201 GAACTCCTCA AGGTCTCTAT AAACGTTGGA GATTCTTTTT CTCCTCTACA 

74251 ACAGAAAGAT TTCCTCTCGA TGGTCTTACG TGATGAAACG GGAAAAAACG 

74301 TCGTCGTGGT TTTTAAAGGA GTTCTCTCCT TACCCGCAAC CCAAGTCTGC 

743 51 AAATTAGTAG AGGAATTGAA CTCTAAGGAC TACTCCTACC TCAATATATT 

74401 TTCTTGTCAC GGAGATAGTA GTCCTCAGCT TTTATTCCGT AAGGAATTAG 

74451 AGGGAACTTC AGGGCGTTAT TTTACAGTGA TTTGCGCTTT ATATCTAGGG 

74 501 GATACAGACA TGCGTAGTTT ACAACTTGCT TCTGAAAGGA TCATGGTCTC 

74551 TAGAGAGTTT GATCTTGTAG ATGCCTATGC TGCAAGATGC AAGCTCTTGA 

74 601 AAATCGATCA TACAAATTGG AGACCTGGAA CTTTCAGTCG CCACGCCGAT 

74651 TTCGCAGATG CTGTAGACGT ATCAGCAGGA TTTAACTCAA GAGAATTTAA 

74701 ACTGATTACG CAGGCGAATC AAGGGATCCT AGAGTCTGGA GAACTCCCGC 

74751 TCCCTTCAAA AACCTTCTGG GAAGGATTCT TAGCATTCTG TGATCGAGTG 

74801 ACTGTCACGA GACACTTCAT TCCAATGTTA GACGCCGCTA TAAAGCAAGC 

74851 GGTATGGACT CATAAACATC CCAGCTTGAT AGATAAAGAG TGTGAAGCCC 

74 901 TAGACTTGAA AACACAGTGC TTGCCATCTA TCGTATCGTA CCTTGAATAT 

74951 GTCACAAACT CTCACGAAAA AACATCGAAA GGCCCGTTCA TACAAAAAGA 

75001 GATTATCGCA GACTGTTCTC CTCTTAAAGA GGCGCTCTTC CCAGGTTCTG 

75051 ATGAAGATGT TCCCTCTACC TCTGAGGATC CTTCAGATGA TCATCCTTCG 

75101 GATCTTGAAG ACTCTTAATT AGTTGCGATA GAATTCAATT TTTTATATAA 

7 5151 AAACTATCGT GTTGTTCTTA TTAAAAGATA GTTAATTTTC TATCTTTTTT 

75201 TAAATCTTTA TATAGCCTGC GTACGCTTTC ATTTTCAATG TTGGTTTGAT 

75251 CCTATGGCAT GCTATATTTC TATTTGGATA TCTACAGTTA AGCAGCATTT 

75301 TATTAGGGCT TTTGATTTTA CACGTCCTCT TGGT.TCTCGG ATTACAAATT 

75351 TTGCTTTGGG GGTCATCAAG GCTATTCCCA TTTTAGGATG CGTTGTTATA 

75401 GGGGTAAGTT GGCTAGTTTC CACATGTTCT GCACGAAGGT TTGGGAAACC 

75451 GGCATTTACT TCTGACGTTG CTAGTATCGT GAAAATAGAA AAAACTCGAG 

75501 GTTATAATCC CCTTGCTTGG GTGGAACAGT ACTTGAGACA GCTTAGGGTT 
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75551 


CGACTTCCTG 


AAGGAGATTT 


AGGAAAAATC 


CATGGGAAGG 


TCTCCAGAGA 


75601 


TTATGTTTGC 


GACAGGACTC 


CCCAAGAAAA 


TCTGAATATG 


GTTCCTCATC 


75651 


AATATCTGGG 


AGAGCTAGGT 


CGCGCGTTTT 


ATGGAATCCG 


CAACCGAGTA 


75701 


ACCAAGGCGT 


ATCAACGAGT 


CACTCCTCTG 


GAAGTCCCTT 


GTCTTACGCT 


75751 


CGTCGGTTTT 


GACATTTTAG 


ATCCCGAAGA 


TCAGGTGAAT 


TTCGTTCGTC 


75801 


TGGCTAACGG 


CATACAAACT 


CAGTACCCCC 


AAACTCAAAT 


AAAACTTTAT 


' JO 


TTAATCTCTA 


TCCAAAAGAT 
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ATGGAATCAG 


TGTGACGGTA 


CGATTTCTCA 
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AGAAAAAGAA 


CAGCAACTCC 


GCTCTCTAGG 


TTTGGATGCT 


AAAATCAAAT 


75951 


GTGTGTCGGC 


CCCCGCTCTC 


CTGCTCCAGA 


AATATCTTCA 


ATCCGAGAAC 


76001 


TTGCCTTCCT 


GTGATCTTCT 


CATTAATTAT 


TACGGGAAAC 


AACAGTCCGT 


76051 


CAGAGACGTG 


GACTCTATAA 


AGAGTCTACT 


CAATCTTTCT 


TCCGAACATA 


76101 


TCCCTGCGAT 


TTCTGTAACC 


TATAGACCTG 


ACGATCCTTT 


TTATAGCTAC 


76151 


TATTTPTTTC 

lnl X X w X a l v 


CTGGTTCTCA 

w 1 WW i. A w i- w *• 


AGGAGGAACG 


GCACCCGATC 


AGAGGATCCC 


76201 


TTGGAGTGAG 


CAGGAGCATC 


TTCAAACGTA 


TACCACCCTG 


TCTAACCCTA 


76251 


GATGTGATAG 


ATATGCTGTT 


CACTTGGGAA 


TGGAAGATTT 


TGCCTCTGGA 


'DJvl 


GTATTTTTAG 


ATCCTCTTAG 

I* 1 W W A w A A /AW 


GGTTTCGGCT 


CCTTTATCTG 


GAGAGTATTC 


; U J J 1 


CTGCCCCTCA 

w X \J w ww w 1 w<» 


TACCTCTTAG 


ATTTAAAAAG 


TGAAGAGCTT 


CGTTGTTTCT 


76401 


TGTTATCCGC 


TTTTATAGAT 


CCCAACAATT 


CTGGTCAGGG 


AAATCCGCGT 


764 51 


PPTATGTCCA 

w w X t\ X \3 X w w*» 


TAAACTTTGG 


AAACTCTCCT 


TTGGGTCAGA 


GGTGGTCTGA 


' O Jul 


OX X X V. X f \ X V» X 


CGTGTTCTAC 


ATGATGAAAC 


AGAAAAGCAT 


GTGGCTGTAG 


76551 


TPTGPAATAA 


TCCACAACTT 


ATAAAAAAGA 


GTTTTCCCTC 


ACATTCTTTA 


76601 


TCTCTATTAG 


AGAACGAACT 


GGAAGAGTCA 


GGTTATTCTT 


ATTTGAATAT 


76651 


CGTTTCAGTG 


AGTCAGGAAC 


GCACGTGTGT 


TAAGGAACGT 


AGAATTTTAA 


76701 


GTTCTGATCC 


TTCGGGGAGG 


TCATTCACTG 


TAATCCTCAC 


TGATCTTCCT 


76751 


GAAGGGAGTT 


CGGATATCCG 


CAACTTGCAG 


CTAGCGTCAG 


ATAGGATCTT 


76801 


AGTTTCTAGT 


GCTCTCGATG 


CTGCTGATGC 


CTGTGCTTCT 


GAATGTAAGA 


76851 


TCTTAGAATA 


TGAGGATCCC 


GAGCAAGAGT 


GGGCGCAACA 


GTATGCGTCG 
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76901 TTCTATAGAA ACATCGACAG GGCAGGCGAT CTTCAACGTC AGGGGATTCC 

76951 AGGAGAGCCT TTAGGGGTCT CAGCATCTAC GAGAGTAGTT TTAGAAAAGG 

77001 ACATCGTATT CAATCTCAAT GCGGTAATCC AACAGGCCAT GTGGAAGTTT 

77051 AAAAAACGGG ATCTTTTTGC TGTAGAAAGT CAGGCTTTAG GAGATGACAT 

77101 GCGACGTGCT TTAGAAGGTT ATATCGGCAG CAGTCTCTTA GTTGAGGGGA 

77151 CTATACAGCC TCAAGTCGCA TGTAATGTCA ATGTGAGTTT TGCTACGTTA 

77201 GACGAGGCTG TGTGTGCAGC TTGTGACTCA GCTCAAGATG CACCTTCTGA 

77251 GGAGAACAAT ACAGATGACT AAAGATCGCA ATCTTGTGAA CGAAATCGCA 

77301 GATTGATGGG AACTAATTAG ACACACCTTT CTAAGGTGTT TGTTTTGATG 

77351 AACCTTTTTA TTAGTCCAGC AGAGCTCTTT TTTGAAGATT CTTCTTTTTT 

77401 CTTAGGTCAT TCTGGGTTTT TTGAAGGTAT CGAGGGTTCT TATTGTCTAG 

77451 TTGTCTATAG AGGGTATCGA GGTTTTTTCT CTTAGGTATC CCACGATTCT 

77501 TTTGTATAGA AAAATTTTAT GAAAGCTTGA ACTCTTTACA CTGACTTTTT 

77551 ATTTTTCAAA TAAAAACGTT TTTAAAAATA TTATTATCAT AATTAGATAC 

77 601 TTATTTGTTT TAATGTCTTA TTTGATTAAA ATAACTTTGT TAAAATTTTT 

77 651 ATACATAAAT TTCTATTGTG GCTTGTCCAA GTATTTCTTC TTGGTTTACT 

77701 GTCGTTCGAC AGCATTTTGT AAACGCCTTT GATTTCACCC ATCCCGTTTG 

77751 TTCTCGGATT ACAAATTTTG CTTTGGGGAT CATTAAGGCA ATTCCCGTAT 

77801 TAGGACACAT TGTCATGGGA ATCGAGTGGT TGATTTCCTG GATTCCCAGA 

77 851 CACACCGTTC GTCATGGAAT GTTTACTTCT GATGTCTCTA GTGCTATTAA 

77901 AGTAGAACAA ACACGGGGTC ATAATTGTTT AGCTCCCCTA GAAGCCTATT 

77951 TAAGTAGCTT GAGAGTCCCC ATTTCCCAAG AAGATCTAGG CAAAGTACAC 

78001 GGGAGAACCC CAGAAGATCC CTTCGTAGAT ATCACACCCA CAGAAATTGT 

78051 CCAACTTCTC CCTGATGAAG AACTCTCTAC TGTAGATGAG GCACTGCAAG 

78101 GCGTTCGTAG TAGGTTAACC TATGCCTATA GGTCCGTAGA GAAACCTATG 
78151 ATTCAAGATC TTGCTCTTGT GGGTTTTGGT CTCCGAGATT CTGCGGACCT 
78201 CATAAATTTC GTGCGTCTTG CTAATGGCGT GCAGAATCAC TATCCCCATA 
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78251 CTAAAGTGAA GCTCTATTTA GCGAAGAACT TGGCAGATGT CTGGGACTGT 

78301 GAAATTTCTG AAGAGGAAAA AGGGCAACTC CGAGCTCTAG GTTTAGACCC 

78351 TAAAATAGAG AGTATATCCC TTACGAGTGC AGGTCTTCCT TCAGTGCCAG 

78401 AAGTCGCTAC TGTCGATTTT ATGATTACCT GTTACGGGAA AGATCAGGAA 

78451 GTCCAAGATC CCTAGGTGAT ACAACATCTT CTAAACTTTG CTCTAGAAGA 

78501 GACCCCTTCC ATTTCCGTGC AATACCAAGA ACAAGAGAAG CTCTCTCCGT 

78551 GCGATCATTC CCCAGAAATA GGTAAAAAGA AAAGATGGAA TAAGCTGGAA 

78601 TCCTTCTCCA CGTATTGTTC TCTGTTTATG TCTGTTAAGG ATCATTATAA 

78651 GCTGAATCTA GGAATTCAGA ATTCCCTGTC AGGGTGGCTT CTGGATCCCT 

78701 ATAGGGTTTG CGCGCCTTTA TCTTCACCGT ACTCGTGTCC TTCCTATCTT 

78751 TTAGATTTGC AAAACAAAGA GCTACGTCGT TCCCTTCTGT CAACGTTTCT 

78801 AGACCCTAAA AATCTCACTA GCGAAACATT CCGTTCTGTC TCTATAAACT 

78851 TTGGCAACTC TTCGTTTGGA CAGAGATGGT CAGAGTTTCT ATCTCGTGTT 

78901 CTGCACGACG AGAAAGAAAA GCACGTAGCT GTTGTTTGTA ATGATGCAAA 

78951 ACTTCTGGAA GAAGGATTGT CCCCAGAGGC ATTGTCTCTA TTAGAAGAAG 

79001 ACTTAAGAGA ATCAGGGTAT TCGTATCTAA ACATTCTCTC GGTGAGCCCC 

79051 GAAGGAGTCT CCAAGGTTCA GGAACGTCAG ATTCTAAGGC GAGATCTCCA 

79101 AGGACGGTCC TTTACTGTCA TGATTACAGA TCTTCCTTTA GGTAGCGAAG 

79151 ATATCCGTAG TTTACAATTA GCCTCGGATA GGATTTTAGT CTCCAGTTCT 

79201 CTTGATGCCG CGGATGCATG TGCTTCGGGA TGTAAAGTCT TAGTCTACGA 

79251 AAATCCAAAT GCATCCTGGG CTCAGGAATT GGAGAACTTC TACAAACAAG 

79301 TTGAGAGAAG AAGGTAGTGT TTCTTTCAGA GAATATTTCA GAGCCTATAT 

793 51 GTGTGATAAA 'ATCGTGGCAC AGAAGAACTT CTTATTTACT TTAGACGCTG 

79401 TAATTAAACA GGCCGGTTGG AGATCACAAG AGAAACTCAA TTTATTTTAT 

79451 GTTGAAAGTC AGGCTTTAGG AAGAGAAATC AAAGTCAGCT TAGAGGAATA 

79501 TATTCAGAGT ATGGTCGGGA TTTTGGGATC TCAGAGAACC AAGAAAAGCT 

79551 TTAAGTTTTC TGTCGACTTT ACCCCTTTAG AGCAGGCTCT ACAAGAAAGA 
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79601 TGCTCTTCTG ATGATGACGA AGATGCAACA GCAACTTCGA CCGCTACAGG 

79651 GGCAACAGCA TCTCCGACTG ACATGCACGA AGATGAGTAA CGTTTGTCTG 

79701 ATACCTTAAA AGTTCCTTGC AAAGGGCTCC CTGAAAACTA AATTCCCTCA 

797 51 GAATCTCGAA TTCTCCTGAC TCTGAAACAA TCTTAGGTTT TCCTGAATAG 

79801 AATCTGACTG AAATTTCTGC TCGAATCTAA GGGCTGTTTC TTATTTTACC 

79851 CCTAGATGAG GATATTAAAT CCAAGCTAGG ACTTCAAAAG TAGTTGGTTA 

79901 TTAGTTTATT AAAGAAAATA ATACTAAAAA TATTTAAAAG CTGTTTATTC 

79951 AATTTAATTG ATATTTTCTA TGTTGTTATT TAAAATTGTT TGTTTCTAAT 

80001 TTTATTTTTT TTGTTGTTAT GCCAATTCCC TATATTTCTT CTTGGATTTC 

80051 TACCGTTCGA CAGCATTTTG TTAAGGCGTT TGATTTCTCT CGTCCCTTTT 

80101 GTTCTAGGGT TACGAATTTT GCTTTAGGGG TCATCAAGGC CATCCCTATT 

80151 GTAGGACATA TTGTCATGGG GATGGAGTGG TTAGTTTCTT CCTGTGTTGC 

80201 CGGGATTATT ACTAGGTCCT CCTTTACCTC AGATGTCGTT CAGATTGTAA 

80251 AGACTGAGAA GGCGTTAGGT CGAGATCATA TATCTCGAGT GGCGGAGATA 

80301 TTGCAAAGAG AAAGGGGGAC CATAACTCCT GAGAATCAAG ATAAGGTGCA 

80351 TGGGAAGTTT CCTGTCTGTC CTTTTGGTCG TTTAAAATCC GAGGAAACTT 

80401 TAAAACTTAA GCCGGGAGAA AGAGAGGGAA CTTTAGATAC TGTATTTTCT 

80451 CCGATTCGCA CGCGCGTGAC TCGTGCGTAC TTACAGGCCC CCCGACCCGA 

80501 AATACGTACG ATTTCTATTG TGGGTTCGAA ACTTAAAACT CCTCAAGATT 

80551 TCTCGCAATT TGTGAGTCTC GCGAATGAAA CGCAGAGACT GCATCCTGAA 

80601 GCGTTAGTTT GTCTGTATTT GACAGGCTTG AATCGCGAAT CTCAGATGTG 

80651 CGATACAACT ACTGCAGAGA AGAAGCAGTA CCTACATAAC TCAGGTCTCG 

80701 ACTCTAGAAT CCAGTGCAAA GACAGTAAAG AAGACGACGC TGGCTCTCCT 

80751 GAAAATCCCG AACTTTGGAT TGGCTATTAT TCACGAGAGC AACAGCATAA 

80801 TATAGACGGG CAGTATATTC AGCAGTGTCT AGGGAAGAGT GCAGATCCAA 

80851 TTCCTTGGAT TCATGTTACT GAAGACACAA AGGATTTTTA TTACCCACCA 

80901 AACTTTACTT CATACTCACA TACAAGACAA TCTACAGACC CAACATCGCC 
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80951 ACCAAGACTC CCTGAAAGTG AGGGGGATAA GGATTCCTTG TACGGACAAC 

81001 TGAGTCGATC GTATCACCAT GAGTATATGC TTGGTTTGGG ATTAAAACCA 

81051 GAGGATGCAG GACTCCTGAT GGACCCGGAT AGAATCTATG CTCCTCTATC 

81101 CCAAGGGCAT TATTGTCATT CCTACCTTGC GGATATAGAA AATGAGGATC 

81151 TACGAACTTT AGTCCTTTCG CCTTTCCTAG ATCCTGGCAA TCTTAGTAGC 

81201 GAGGATCTTC GTCCTGTAGC ATTCAATATC GCTAGATTGC CATTAGAATT 

812 51 GG ACTCGTTA TTTTTCCGCC TTGTTGCGGG TCAGCAAGAA GGGAGAAACA 
81301 TAGTTACCCT TGCCCACGGA ACTCCTCGTC CAGAAGATCT TGATCCTGAC 

813 51 TCAATGAACA TTCTGACCAG AAGATTACAA ATGTCTGGAT ATAGCTATTT 
81401 GAACATTTTC TCCTATAAAT CACGGAAAAT GATTGTAAAA GAACGTCAGT 
81451 TCTTTGGAGA TCGTTCTGAA GGGAAGTCTT TCACATTGAT CTTATTTGAG 
81501 GATCCCATTA GTGCAGCAGA TTTCCGTTGT TTGCAGCTAG CTGCAGAAGG 
81551 TATGGTTGCT AAGGATCTCC CCAGCGTAGC AGATATTTGT GCCTCTGGAT 
81601 GTTCCTGCAT TCAGTTTTCT GAGATGCAGA GTCCTCAGGC TATTGAATAT 
81651 AGACAATGGG AGGCACGTGT CGAAGATGAA GCAGGAGAAG AAGCCAGAGA 
81701 ACCAGTAATT TATTCTCAGG ATCAATTGAG CAGCATGCTC ACTACACAAC 
81751 AGAATTTTGT ATTTTCTCTA GATGCTGTGG TAAAACAGGC GATCTGGAGA 
81801 TTCCGTTCGA AAGGTCTTCT. TACTATGGAA AGAAAGGCAC TAGGCGAGGA 
81851 GTTCTTAACT GCGATATTTT CCTATTTAGG GAGTCAGGAG CGTAATGAGA 
81901 ATATGGGGAA AAGAACTACC GAAGAACATG AGGTCGTTAT CAGCTTCGAA 
81951 GAGCTAGATC GCATGGTGCA AGTCCTCCCA GCCGAAGTCC CTGCAGATTC 
82001 AGGCAATGAT CCTACGCGTC CCGTTCCTAA TCCAGATAGT AACCCTGATT 
82051 CCTCGCAAAA TGAAGGCAGT TAGAAAGTAA AAA TACT AGA GAAATTTCTT 
82101 ATCTCCAGAT GGAATCCGTG GTCCATGTAC CTAGGATTCC AGGAAGGGTT 
82151 TGCCTAGGAA TATCTTAATT TCACAAACCC CAQGAGTGCA CAAACTCCTT 
82201 AGAGGACTCT CTCTATATTT GTTCTTTCTA CACTAGTATT CCGTAGATTT 
82251 CTGATTTCCA GGATAGGATT CTAAATAGTT GTATCTAAGC GCTTTTTACA 
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823 01 ACCTTTCTCA GGTCTCCTCT TTCTAATTTT AAAAATAGCG AATTTCTTGT 

823 51 GGCTATAGGA GTCGCTAAGT ATCTTAGAGA TTGCTTTTTT CAAGTTTTTT 

824 01 TTAAATACTT CTTTCCTAAG TATTTCTTCC TAGTCGTGTT ATGGCTTCTT 
82451 GTTTATCTGC CTGGTTTTCT ATAGTTCGTG AGCACTTTTA TCGAGCCTTT 
82501 GATTTTTCTT TGCCGTTTTG TGCTCGTATT ACGGAATTTG TATTAGGGGT 
82551 CATCAAGGGG ATCCCTGTTG TGGGTCACAT TATTGTTGGG ATAGAGTGGC 
82601 TCGTTTCTAG GTATTTAGAG AGTTTCGTGA CCAAGCCGAC ATTTGTCTCT 
82651 GATGTGGTGA GTCTTCTGAA AACAGAGAAA GTTGCTGGTC GCGATCACAT 
82701 TGCTCGTGTA GTGGAGACTT TGAAGAGGCA GAGAGTCGCT GTGGCTCCTG 
82751 AAGATGAGGA TAAGGTCCAT GGGAAGATTC CTGTGCATCC TTTCGGGGGA 
82801 ATCCAACCTG TAGAAGTTCT CACTCTCTAT CCCGAAGTTC AAGATGCAAC 
82851 GTTAGGGCTT GCCTTCTCTA AAATTCGTAA TCGTGTAAGA CAGGCGTATT 
82901 TGCAAGCTCC ACGGCCAAAA CTGCAGAAGA TTTACATCAT AGGAAACGAT 
82951 ATGAATCCTT TTGAAGTTGA CGACTTCTTG CATCTAGCCC GTCTCTGTAA 
83001 TGAAACTCAA AGACTCTATC CTGACGCTAC GATTTCTCTA TATCTAACAG 
83051 CTTCTGGTGG TCGCAATGCT ATGGACAAAA AGAATCGGAA GTTACTTAGT 
83101 GATTGCGAAC TAAACCCCAA GATTGCTTGT TTGGACTTTA ATCAGGGTGA 
83151 TGTAGTCAAA CAAGCAACTT GTGACTGTTG GATGGTGTAT CATGGGGAGA 
83201 ATGATCAAGG TACGTTGAAT CAGATTCAGG AAGAGTTAGA AAAGTCAGGG 
83251 GAGGAAACCC CTTGGATTCA TGTGGGGCAA AAGCCTCTTT CACAATCCTT 
83301 GTGGGATTTC TCTCCATTTT CATCTTTGGA GATGAAGGGA GATAAAGAGA 
833 51 AAGCTCTAGA GTACTCTGAA TTAGAAAAAG AACAGCTATA TTCTCGATTG 
83401 GTATACGTAG GAGAGCGCTC TTCGGTTCTT AGTTTGGGGT TTGGAGATAG 
83451 TCGGTCAGGG ATCTTGATGG ACCCAAAACG GGTGCATGCT CCCTTATCTG 
83501 AAGGGCATTA TTGTCATTCC TACCTTGCAG ACTTAGAAAA TCCCGGGTTA 
83551 CAAAAAACAA TTTTAGCGGC ATTTCTGAAT CCTAAGGAGT TGAGCAGTAC 
83601 CATACTGCAA CCTATATCTC TAAATCTTAT CTTAAATAGC AAAACTTACT 
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83651 TAAGGCAGCA CTTTGGCTTT TTTGAGAGGA TGAGCAGAAG TGATCGCAAT 

83701 GTGGTTGTCG TTGTATGTGA TTCTTGGTGG GGTACCGACT GGAAGGAGGA 

83751 GCCAAGCTTC CAACACTTTA TTATGGAGCT AGAGTGTCGA GGGTATTCGC 

83 801 ACTTCAATAT TTTTGCCTTT AGATCTAATA GCATGTGTGT AGAAGAACGT 

83851 AGGATCTTAA ATGAAAGTTC TCAAGAGAAA GCCTTTACCA TGATTTTCTG 

83901 TGAGGATTCA GTATCTCAAG GAGATATCCG CTGTTTGCAT TTGGCGTCTG 

83 951 AAGGAATGCT TTGTGGTAAA GAGTGCTATG CTGTCGATGT CTATACGTCA 

84 001 GGATGCGCGA ACTTTATGAT GGAAGAAGTC TTAACTTTGG AGCGAGAATC 
84051 TAATCTGTGG AATAGAAAGC ATGGTCTTTG GAAAAGAGAA GTTAGAAAAC 
84101 AGAAACAAGA AGCTGCTTTG GATCAAGACG AGAGCGAGAT TTACGTTTGT 
84151 AATCAGCTGA CGGCGCAACA GAACTTCGCT TGTTCTTGAG ATGCTGCAAT 
84201 CCGCCAGTCT ATATGGAGAT CCCGTATGCC AGAACTTCTC TCTATTGAGA 
84251 GACGGGCGTT AGGGGAACAA CTCTTTACTA CTGTACATCA CTACCTAACA 
843 01 ACGCAAAAAA AGATCCTCAG GGGAATCTAG AAACGCAGCA ATCCGCGCAA 

843 51 TTGTCTATAG ATTTCACAGC ATTAGATGAA GCTGTTGAAT CTCTAGGATC 

844 01 GACTCTTAGC AGAGCTCCTT CAGAAATATC TCCAATTCCA GAGGAGGAAG 
844 51 CTCACTTAGG AGCCAACAAA TAGAGACAAA GAAAATTCGA CGGTTTGAGG 
84501 ATAACGATAC GCAATGTCTA AGCTTTGAAT CAGGATACTC TGCTTTACAG 
84551 GCGTGTCTTT GTGCCTATGG TCTCTCTCTC ATAACAGAGT CTCTCAAATC 
84601 TAATTGTCAG AACCTATTTC CCCTAAGAAT CGATAACGTA TTTGTTAGGA 
84 651 AAACGTGCTC AACAAGAGCG TTTTATTTTC AGTGTACTTG ATGTCTAATA 
84701 CAAATTGTTT ATTTTGTATT TTTGGCACAT CATTTAAATT CCTTGTACTT 
84751 TTGAATCTAA ACGTAAATTT CTTATGACTC ATTGCTTACA TGGTTGGTTT 
84801 TCTGTAGTTC GTCATCACTT TGTGCAGGCG TTTAATTTCT CACGTCCTTT 
84851 ATATTCTCGA ATTACCCACT TCGCTTTAGG GGTGATTAAG GCCATCCCCA 
84901 TTGTAGGGCA TCTTGTTATG GGAGTCGATT GGTTGATCTC TCATTGCTTC 
84951 GAGAGGGGAG TCTCACACCC TGGGTTCCCT TCAGATATTG CTCCTATACT 
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85001 


GAAAGTAGAA AAGATCGCGG GCCGAGATCA TATTTCTAGA 


ATCGAAAATC 


85051 


AGCTAAAGAG 


CCTTAGGAAA ACTATCGAGG TTGAAGATCT AGATAAAGTC 


85101 


CACGGGCAAT 


ATCAAGAGAA TCCTTATGCA 


GATATGGCCT 


CTAGTGAGGT 


85151 


TCTTAAACTC 


GATAAGGGAG TTCATGTTAG 


CGAGCTTGGC 


AAAGCCTTTT 


85201 


CTAGAGTTCG 


CAATCGCATC ACCAGATCCT 


ATAGTTATGC 


CCCTACTCCT 


85251 


CAGTTGGACT 


CTATAGCTAT TGTTGGTATA 


GATCTCGTCA 


GTCCTGAAGA 


85301 


ACAAGAGAAT 


TTAGTACGCT TGGCGAATGA 


GGTCATTCAA 


CTCTATCCCA 


85351 


AATCAAAGAC 


AACTCTATAT CTTCTTATCG 


ATTTTAATAA 


GGAGTGGGTA 


85401 


GGGGATATCT 


CCTCTGATAA GGAAAAACAG 


CTCCGTTCTC 


TAGGTCTACA 


85451 


TTCTGAAGTT 


CAGTGTCTTT CCGTCTTGGA 


ACCTCAGGGT 


GCCGAGGGCG 


85501 


AAGATACGAA 


ACACTTTGAC CTTATGGTCG 


GCTGTTATGG 


GAAGGATTCT 


85551 


TACTTAAGGG 


AGGGTAAAAT TTTACAGCAG 


GCCCTAGGGA 


CTTCGTTAGG 


85601 


TACTGTTCCC 


TGGGTGAATG TTATGCACAC ATTGCCATCT AGGTATAGAT 


85651 


CTCGGCTTTC 


CTTACCTATA AATACCGAAA 


AGGATAAGAC 


AGAGCTTTAT 


85701 


AAAGAGATTT 


CTCGTACACA CCATCAGTTG 


CATACTTTGG 


GAATGGGACT 


85751 


TGGAGCCCAG 


GATTCAGGAT TGCTCTTAGA 


CCGGCAACGA 


CTCCATGCTC 


85801 


CTTTATCTCA 


AGGGTCTCAC TGCCATTCCT 


ATCTTGCAGA 


TCTCACCCAT 


85851 


GAAGAGCTGA 


AAATTTTGTT ATTTTCAGCA 


TTTGTGGATG 


CTAAGAACAT 


85901 


AAGTAAGAAA 


GAGCTTCGTG AGGTATCTCT 


AAATTTTGCT 


AACGATACTT 


85951 


CCGTAGAGTG 


TGGCTGCGCT TTTTACTTTT 


AGTGTCCTAT 


GATGAGAAGG 


86001 


AGAAAGACGT 


AGTTGTCGTT TGTAATCATT 


CTGAACCTAA 


TATCCTCGGC 


86051 


CTGCCTCCTG 


AAGCAGTCTC TCAGCTTATT 


GAAGAGCTTA 


GCGATGAAGG 


86101 


CTATAGCTAT 


CTGAATGTAG TGCGTTGTGA 


TCTCTCCGGG 


GAGACTACGG 


86151 


TTCAACAACG 


TCTGCTATTG AATGCCGATG 


AAGGGAGATC 


TATGACGGTG 


86201 


GTGATCTCAG AQCTTCCTGA AGGGCACCCC GATATTCGGA 


ATTTGCAGTT 


86251 


GGCATCCGAA 


AGAATTTTTG TTTCTCGTGA 


AAAAGAAGCT 


GCTGATGCCT 


86301 


ATGCTTCAGG 


ATGTAAAGTG GTCGCTTTCG 


ATGATGAGCA 


TCTCCCTTGG 
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863 51 GTCTCCAGTC ATATTGCCTA CGCGGAGGAG ATCAGAGAGA AACAAGAACA 

86401 AACAATGCAA GGGTCTTTAA CTGAAGAGCA GTTAGGAGCA CTCCTCTGCA 

86451 ACACAGTCTC CACAGAGAAA AATCTAGCCT TTGCTCTAGA CGCCGTGATA 

86501 AAACAGTCTG TGTGGAGATT CCGCAATCCG GATCTTTTTG CTTATGAGAG 

86551 AGAAGCTCTA GAGGCTTCAG TAACAGATGC TTTAGTATCT TACGTTTCAA 

86601 ATTTAGACAT GATACCGTAC ACAAGTTCTC AGGGCATAGT CATAGAAGAT 

86651 AGTAGTATCG TCCGTACCTC TCAAGAGCAT ACACTCATTG TGAACTGTGC 

86701 AGCATTCGAT AAGTTAGCGA GCCAAATAGA GTTCTTATGC CCCAGTGACG 

86751 TGTTGCCCAT TTCTGGTAAA GACCCTTTGA TTTCTGATGA TGAGGATGAG 

86801 GAACTGAATC CTAAAGTTTC ATCTGCTGCA GACTCTAAAG ATAAAACCTA 

86851 GGGAGTGAAT TCTACACGAG AATCGAGAGG AGAGCGAGTC TTTCAAGGAT 

86901 TCATAATCCT TGTTAACGTA TGCATAAACA AGTGAAGCCA TTGCAACGTG 

86951 AAGTAATCGC ATTGATGAAA GATGCTTTCC CTAGGGATGC AAAGCAGATC 

87001 GTATTCCCTT CTTTCCAAAA CAGACTATAG ATTCAAAAGA TATTCTTTCT 

87051 TTTCAAATAG ACTTGAGAGA GGGGGGGGGG TTCTCATAGA GTGAGAATCT 

87101 TGGCCTTCAT TGCTAAGTTC TTCGATGATG GATAGAGGAT TCTAAGACGA 

87151 CCGGGGCTAC AGAAAACTCT AAGCAGAGCT TAGAGTTTTA AAATGTGGAT 

87201 TTTAGTCCTG TAGACACTCG GTGGTTTGTA AATCCATTTT TCCCGTCAAA 

87251 GGTATAGTTT AGAAAGGCCT GAGTGTCCTC GGTGAGATCT ACTACATCAT 

87301 GGATTTGTAC AAACAAACCG TGGCGGCGTA GATTTGCTCC TGAGATCGAA 

87351 GTGCTCTCTT GGTTTGAGAC GACAGTCACA ATATTGTGAG GGTTGACACG 

87401 ATAGATATCA GGCTTGTAGG CAAGTTTGAT GGTCAGTGTA GAAGGAGCCT 

87451 TCTTAAATGG 'TGTGAACCAT TGAGAAGAAC ATCCTATCGG TAGGGAAACA 

87501 TTGTACCCTT TACCTCTACT AAAGCTACGT TGCAGATCTC CAGTTTCTGT 

87551 GAACTTACTT TGCCAACCAC CTAGGAATTC TGCGGAAATA AAACCTGAAA 

87601 GATCCCAAGC TTGAGCCAGA GGTCTTGTAA GAAGACACCA GTTTAGGAAA 

87651 GGATGTTCTG CAGAAATAAG AACATAGTAA CTATTGTTAT GCCATTGCCC 
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87701 TTGAGATTTT GGAGCTTTGT CAGGTCTGAG GTAGGTGGTA TTTAGGTGAT 
87751 TTTTGGAATA ACCATAAGCT GCTTTCCAGG AAATTAAGGC CTCGCTCTTT 
87801 TGAGTCACGA TAGGGAATTG ACCAAAGAAC GAGAGTAAAT ACATTTGTTC 
87851 TGAGCAACGT GAATCGTAGG GGTTGGCGTT AGTTTTTCCA TAAAGCTGCC 
87901 CGAAAGAAAG TCCTAACGTA GTGTGGTCCG TGTAGTTCAT AGATAGCGCA 
87951 GCTTGGTAGC CTCCATAGCG ACCTGAAAAG CCCTCATGTC CTTGTCTTGG 
88001 TGTGTGTTCG ACATAGGCTC CTAAAGCTTT CGCGGTTATG GACAACCCGG 
88051 GATGATCTAT CAAAAGAACA TCTTGGAGAA TATCAGAGAA TGCCTGATTT 
88101 CCTAAGAAGG AAATCCATAA GCTGTTGCTG ACAATTTCTC CGTAACGCTC 
88151 GGGATCTAAG ATATAGGTAG AACGCACGAG AGTGTCTGAA TTCCATACAG 
88201 CATAGAGAGT ATTTGCGCTA GGAGAGGGAC CTCCAGGAAA TCCTCCATCA 
88251 GGAGCTGGAA TTAACAGGGG ACGGGACCAT GTGTAGGACC ACTTTCCTTG 
88301 GTAGCCGTAG TGGCTTGGAG TCGCAATCTC CCCATCAGGA AATCCTGTCT 
88351 TAGTAACGGT TGCTCCTTTG AAAACAGCGA TAGGAATTGC TACTGGAGTT 
88401 TGTAATGACA CCATATCATA AAGATCTGTA ACGTCATGTT CATCAAGAAC 
88451 CAGAGCTCCT GTTAAAGTGA CGTTTTTTGT GCCTGCATTT ACTGATGCTG 
88501 AAACAAAATC TCTTTTTAGG AAGGAAAAAG GATCGAATGC TAACTTTCCA 
88551 ATCGTAAAGT CTACAGCGGC AGGTGCTCCC GTGGGTGTTG CCAGCCCTAA 
88601 GGTTCCTCCA GAGCCCAGGG TAAGCTGACC TGAGCCCTGA GTAGCGAAGC 
88651' CAAGAACATT GACAACCGCA TTGTCAGTAA TCTTCAGTTC TCCACTAGCG 
88701 ATCTTGACTG TTCCTAGAAG TATAGTTGTC GTGTTGGCAG GCAACAGGAG 
88751 TTCTGTAGAG GAGAGTCCCT TACTTGTAAA GACTACAGAT CCTGAAGCGC 
88801 CATTAGCGTT GATTGTAATG TCTTTATTAG ACGGACTTGT GGTTGGGAGG 
88851 CTATGTGTAA TGGGATCATA AAATACAAGA CGTGAGCCTC CTTGTGCAGA 
88901 TAGAGACACA ATCTCTCCCC CTGCTTCTAC AGTGATGGCA TTGCGGATTC 
88951 CAGGTTTTGT ATTGAGCATG TTTCCTTGGA ACGCAATATC ACCTTCGGAA 
89001 GCCAAAATCG TAAGTGTCGA TGTTTGCTTC GCAGGGTCTC CAACAGAGGG 
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89051 GCCTATGAAA ATAGCTCCAC CCTTCTCCGC GCGGTTTCTA GAGAATTCAA 

89101 TAGGACCGCG TCCTTGAATA TTGAGCACTT TAGCACAGAT GGCTCCGCCA 

89151 TTTCTCGCTG CAGTGTTGCT ATCTAGGAGC AAGGCACCCT GGTTCCCTAC 

89201 GATGTTGCAG GTTTCGGCGT AGATCGCACC CGCATCATTT GGTGTACCAT 

89251 TATAAGAGAA GGTGCACTTC CCCTGATTGT TTTTTAATTC GAAAGTTCCC 

89301 GTAGGGATGC AGATGGCTCC GCCACTTCCT AAAGCATAGG TTCCTGATGA 

89351 AGGGGTGACG CCTTTTAAGC TGTTCACACA GGAGTTGGCG GTGAAGATGA 

89401 TGCATCCAGA ATTGTTTTCA AAAAGGAGAG AGCTTCCTCC ATAAATCACG 

89451 CCGCCTCCTG ACGAAGTAAA GTTATGGGAG AAGCTCATGG TAGCGGTGTT 

89501 ATTGATGAAT TTAACGACTG CCGTGGGAGA ACTGCTAATT GCAGAGCCGA 

89551 AGTTGGCAAG GTTCCCAAAA AACTTGATCG ATTGGGATAT GTTTTCGACA 

89601 GTGAGTGAGT CTGTAGGTTG GAGAGCAGAG GCTCCTGTAG TTACTGTAGT 

89651 GATTGCGGGA GTTGCCGACG TAGTTACAGA TGCTGGAGAA TAGGCAACAC 

89701 GGTTCGTAGT GAAGATCAAA TCTTTGATAG ATTGGAAAAC AATATCTTTT 

89751 CCGTAGATGA GGCCGCCAAG TCCTGTCGAT TGGTTCCCTG TAAAGTTTAT 

89801 AGAAGAAAAA TTCTTGAAGG TCAGAGTCTC TGCGGCAGAA AGTAGCGCAT 

89851 AGTTACTATT TGTAGGAGCT TGACAAGAGG TAAACGTTAA GG AAAGGCCC 

89901 TTCCCTAATA AGGAGAGATT GCTGGAACTA TTGATAAAGG AACTTGAAAA 

89951 ACTCGCCCCC AAGAAGTTTG AACAAACAAA ATCTGAAGTG AGTAAGATCT 

90001 CTTCACCCTG ATTCGTAATT GGAGGAACAA AGTTCCCTCC GAGTCTAGTC 

90051 TCAGCAAACG CGCAACTTGC ACTACATAAA CAGGCAAGTA GACAAAAAGA 

90101 TGAAGATTTG AAAGAAAGAG GCATGCCTCA ACCCTGTCGT TAAATAAGGT 

90151 TTAAAATCGT AATTTGCTTC CTATATCTAG AGTATATTGA CGGGAGGTTC 

90201 TGCGAATATC ACATCCACAG TGCCCGAAGA TCTCTAGAAC ATCATTTACT 

90251 GAAGTATGGC TGGATGCTTG TACTAGCAAA GTACTTCTGG TTAGATTATT 

90301 CCCTATAGAG GTCCACGTAG CTCCATTAAT AGGTAATGTC GTATCGCAAT 

903 51 CAGGATTGTG ACGATAGACA TCAGGAGCAT AGGCTACGAT TATAGTGTAA 
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90401 AAATCTGGTC GATTATGAGA ATTTTTACCA AAGCGGACGC CTACGGGAAC 

904 51 TGCAACGTTG AGTAGATGAC CGTGTCCAAA AATCCTCCCC TCGGGGGTAT 

90501 TTTCTTGGAT GCCCCCATGA GTCGCGTAAG CAACTTCAGC TTTTACAAAG 

90551 GGAATGATCT GCTTGAGGTT TAAGATGCGA GAAGAGAGAG TGATGGGAAG 

90601 GTTCCCTTCG AGTTCTCCTA ACCAGCAATT GTTACTCCAA GAGCAGCGCC 

90651 CTTTAGGCAA TTTTGTATAT GAAGTCTTTA CTTTCTCATT GCTACGGCTA 

90701 TAGGTAACTC GAGAAGTGCC TCCTGAGAAG AATCTCGATG ATCCAAACAG 

90751 AGACTTGGTG ATGTTAGAGT ATACTGTAGC GAAATAAACG TTAGAATGAC 

90801 CGTGACCTAC GAGGTAATCC TTAGATTTTG TAAACAGCTG TCCAAAACCT 

90851 AGACTTAACG CAGCATCTGG AGTGATGCGT GTGTAGGTAT TGATGAGGTA 

90901 GCCTCCACCC ATATGGCGGT AGCTGCGTGC ATCACCGGTA TGATTCGCAT 

90951 GGAAGAAATT TGTAATTCCT GTGATGCTCA GTTGCTTCCC AGGGACATCT 

91001 TCGCCATCAG CTGCTGACGC TTGACTTACA GCTCGTAAAT CTATGACGTT 

91051 TGCCCATAGG CTATTAGGAA TGAGGGGAGC AAGACGCTCC GG ATGAGGAA 

91101 GATAACCGGT TTTTTTCCAA TTTCCTGTGA CCGTATGGGT TGTCGTGTCT 

91151 ATGGTAAACT CCCAAGTTCC TTGATACCCA TAGGGAGATT GCTGATAGCC 

91201 GTTTGTGCCG AGACTGAAGT CCGTAGTGGT TACAGTATTT GAAGTCGCTT 

91251 TGAGTTCTAA AATCGGAACT TGCTGTAAAT CTTTATTAAA CATCCCGTGG 

913 01 TTGTCACAGC AATCTTGAGA GTTTTTCACA AGTCCTAAAG TTCCGGATAT 

913 51 AGTGAGAGCT CCATTGGTAC TCTGCACATT AACGACAGCC GCTTTAGTGC 

914 01 CATCCAAAGA ATCCAGATTG ATTACAAGCT TGTTTAAGGT GATAGCACCG 
914 51 TCAGTATTAT TAGCTCCATT TGTAGTTGCT AATGTGGTCC CTGCATCCAT 
91501 GATGACGACG GACTTTTCAT CTTGCGTGAA GTTATGAACA TTTAAGGTAG 
91551 CACCGTTTCT TAAAGCGAGA GTACCGCCTT CAAGTTCTAG CTTTTGGTTT 
91601 AATGTAGATG TAGCATTTGC AGGGGTTGCT GCTTCGGTAG CAGTGAGGGT 
91651 TTCTCCTGAA AAGACAATAG TCCCTGAATA CGCACCATCT GCACTGGCTT 
91701 TGGGATTGAC GACCACAGTA GCGGCTGCGG ATGCAGCAGA TAAATCATCA 
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91751 GATGTAATCG GATCATAGAA 

91801 AGTGGCAAAC TTGGCATCTT 

91851 TCCCACTAGT GATGCTGAGG 

91901 GCGGATAGAG AGAGCTCCCC 

91951 TCCAGCTGTA TTTCCAAGAA 

92001 TCTTTTTGCC ATAGAGGGCT 

92051 AATGTAACGC TGTTTTCTCC 

92101 GGGGGTAGCT GGAGGAGTAC 

92151 GAGTGACTGC TGGTGGTGTA 

92201 TGTGGAGATG CTGTATTGTT 

92251 TGTGACATTT CCGTTAGCAT 

923 01 TAACGAACAA GACTCCTGAG 

92351 GTAAGGTAAA TAGCCCCACC 

92401 CACAGGAGCG CTATTGTTGA 

92451 CCCCTCCATT AGAGATGGAT 

92501 GTATTTTCTA TTGTGATGCT 

92551 AGAACGGTTG TATTGGAATA 

92601 GCGTTGTAGC GTAAATCGCC 

92651 GTATCTGAAG TCAAAGATTC 

92701 TGAAAAACTC AAGGGACATA 

92751 TCGCGGCTCC GTCAGCTGAA 

92801 CCTTTTCCTA AGATTTGTAG 

92851 AGAGGTTCTG GATATGGCAT 

92901 AAAAAATACA GAGTGTCCCT 

92951 ATCTTCCCAC TCGATCCATC 

93001 AGTAAACGCT GTTGAAGCAA 

93051 ACTTACGAAT AGACGTTTTC 



PCT/US99/26923 

GTATAGGGTA TAGCCTTGCG TAGCTCCTAG 
TTCCGAAGTG AATACTATTG CGAGTAGGTG 
TTCTTGTTAA AGAGGATATC ACCTTGATTT 
AGATTCGGGA ATAGCAATAG CGCCTCCTTT 
ATATTGTAGA TTTATTAGAA TCTATAGAGA 
CCTCCTTGTT CGGAGGCAAT GTTTTCTAGG 
AGATATAGTC AGGCTAACAC CTGTTGGTGG 
AGAAGATGGC GCCTCCATAT CCTAACAAAG 
GGTGGAGGTG TAGGTGCAGG TAAGGAGTTT 
TTGGAAAGTC AGGTCGCTGT TATTAGAAAA 
AGATAGCGCC TCCTGAGCGC GAGCTATTAT 
AGGTTCCCAG AGGTGAGCAT AGATCCTCCG 
ATAGATCCCT GTAGCATTCG TTGAGAAAAT 
TGAGGTTGAT CGCTGCAGAT CCCGTGAGGG 
CCATTACCAT TAAAGAGAAG GCTCJTTTTC 
TGTGCCTCGA ATGGCAGCTC CAAATCCTGC 
GTATGGAGTC ATTGTTTGTA AAGAGCATGG 
GATGCGTGAG GTATGACATT ACTCGCTGAG 
ACAGTTATCG AAGATCATCT GACTAAATCC 
GTTCAGGATT TTGGGTGATT ACACTACTAA 
GAACGGATAT TTAAGAAGGA GAAAACCCCA 
TGCTCCCGCC CTATTGCTAA AGCAACTGGA 
TATCAAGATT CGCAATGTAG AGATCCCCTG 
CTAGGATCAG AAAGTGTTGT GTAAGGAAAA 
AAAGTTCTCG GAAGGCATGA TAACTTCTAC 
AACATGGCGC CAGTGTGGTA GAAATTAAGA 
ATTTGCACGT AGAGATGAAA CCAGATTATC 

69 



WO0027994 f file:/A\dcwas03 \ firmdata\l p\Fole yPa 



Page 197 of 330 



WO 00/27994 PCT/US99/26923 

93101 CTACAAATAA GGGAAAGGCT GTAAAAAAAC AAGTACAATA AGACACAGTT 

93151 TTAATCTCTT AATTTTGACA GCTTTAAGAT TACAGGATAT TTTAAAGGGC 

93201 ATTTTCCCAT TTCTTACATT GCTTTCTTAG AAGAATACTT GATAGAAAAT 

93251 GGCGATTCTA TTTTTGAAAA ATCTCAAGAA ATTCTCCCAA ACGAAGATGT 

93301 TTTAGAGAAC CTGTAAAGTA GAAAATGGCG CTACGCCCGA GACTCGTGAA 

93351 CATGTGTGCA TAGAGGGACC TATCTCTGAA TACAGATAGG TCCCAAAATC 

934 01 TTCTTAAAGG GGATTCCCTT AATTATAGTG TAGACTTAGA GATTATGGCG 

93451 TAGAAGTGAT CTCAGGATTC CAAGTGAGGA AAACAGTTTT CTTAGTTGGT 

93 501 GTGATCCTTC TGTCTTTATC CAGAGAAGGG GGGTACTCTT CCCAGCTTAG 

93 551 GG ACCAGAAA CCTCGAACGG CGTCATTCTC TGTTGTGACT TGGGTTCTCT 

93 601 CCAAAGTGGT CTCCCCTAGG AGAAGACTGT CAAAAGAAGG CCCTAGAAGT 

93 651 TCAAGAAGAG GAATCGTAAA GGCTTCCTTA AATTGAGGAA AGTCATAGAC 

93701 GGACCAATAA GCCTCAAAGG CAACTTTAAG TTTTTCTAAT GAGACAAGAG 

93751 CATCCTTGTC CTCGGCGCGA ATCCTTACAG GAACAAAGTT GTCGGTATCT 

93801 TCAATCAGGA TGTGCAGATT CTGAACCCGA GCATCTCCTG AGCAGAGCAG 

93851 AGTGGTTCCT GGAGACATAG TAAGCGTAGA GCTTGCTTCC TGCTTAAAAG 

93 901 AATGCAGTTG CAAGGTAACC CCATCCGATA GAGAGAGAGT TCCTCCTGCT 

93 951 AATGTGACAT CTTGTAGGAT TGTGGAAGTA AGATTTTCCG CACAAACTTC 
94001 ATGATCATCC AGGCATAGTC CTGAGAAGCT AATTGTTCCT TCATAAGTTT 

94 051 CCTTTCCTTC AGGAGCATTG ATTACAAGAT CTGTAATTTT ATGCGACTCG 
94101 CTATGGCTTA TAGGATCATA GAAATAAACT CCGGATTCTG AAACAGCACG 
94151 TAGGTTCTTA AACTGTGCTC CAGATTGCAG ATGGATGGAG TTGTGTATTG 

942 01 TATTTCCGTC TTGTGATGCT GTATTTCCTT TGAAGATGAG ATCTCCGCTT 
94251 TTCACGGATA TAGAGATCGA TCCTCCAGGA GCAATGGCAA TGGCTCCTCC 

943 01 ATTACTATTC ACGTCATGAT AAGCATGATT ATTTTCAAAA CACGAAGGTC 
943 51 CTCGAGTCGT GAGTGTGAGA TTGTGGGTAG ATATGGCGCC GCCATAACCT 
94401 TGGCTCACAT TGTCTCTAAA CACCAGGTAG CGGTTCCCGC TGAGATTTAC 
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944 51 TGAAGGACGA CTCGCCTTAG AACCTAAAAG GTAGGGAGTA TAAATCGCAG 
94501 CTCCACTCCA CGAAGAGTAG TTCCCACAGA AAGTCACTTC CTCACTATTT 
94551 TCGATCATCA CGGAACCAAG ACTATAAATC GCTCCTTGTC CTTGAGGTAG 
94601 TAGAGGTGCT GAGGTGAACG CTAAGTAAGA AAAATTAGAG AGAGTGAGAG 
94651 TGGTGTCTCC AACGCGGTTC GAAATGGCAG CGCCAAAACC CTCGGTCATA 
94701 AGGTTGTGAA AAGTGAAGTT GCAACGGTTG CCCATGAAAA AAAGATTCCC 
947 51 AGATCGATTT ATAAAAACCC CAGCATCTTC TTGATCATGC TTAACGTTGG 
94801 AAATCCTCAC GTCATCTAGA AAGATGTAAG AAGTTCCTTC TGGATAACAG 
94851 GTAATTTTAG GTTCTAAGCT TTTATTATTG ATAGCACCGT TATAACCATC 
94901 ACTTTCATGA AGATATACAA CTTGTGCTGC TGCAGGGAGA GCGAGGAATA 
94951 AAGCCGAGCA GGTAAGAAAA TTTCGAAGTA TGGTCATGGT TTCCTCGTTA 
95001 AATCAATAAG GTTGAAGCAA CTTTAATAAA CAAGAAAAAA AGAAGTCAAT 
95051 AAGAATAGAT TATTGTCTAT TAATTATTTA ACTGTTTTTA AAATAAAATT 
95101 ATAACTAGAA ATTATTAAAA GAAATCTTTT TTGAAGAGGG ACAAATGTTA 
95151 TTTTTTACAG TTTGCAAGGA AAGCATTCCC TATAGCAAAT ATTTCCCTAA 
95201 AAGTATGAGA AAACTCCCTA GAAGAACTAG GGAGTTTTAG CAATCTAGAA 
95251 TCGGAGTTTG GTACCAACAT CTACATTGTA GTTCCTTGAA GATCCACGGA 
95301 GTTCCATAGC GTAATGTCCG AAGAGCTCAC AATTGGAGTT GTAGACGTAG 
95351 TTGTTGCTAC CCCTCAGTAA AAATGCCTGT CTTGAAAGAT TGCCACCGCG 
95401 AATTTTCCAA GAGTCTGGGC TCATCACAAG AGTCGCTGTA GATTGGGGAT 
95451 TGTTACGATA GACATCGGAA ACAAAGAATC CTGAGAGATC ATAGGTGTAG 
95501 GAATCTCCGA TATCCCCCTG CACGAATTTC GCACGCACAG GAATCGAGAG 
95551 GTTAAGCAGC CTTpCAATAC TAAAACCACG GCCATCACTA GAGCTTTCGA 
95601 AGAAGCTATT TTGTGATACA TAAACCATTT CGACTTTCAT CTGTGGAATG 
95651 AAGGTCTTGA AAAGAGGATG TGGGTTGGAA AGAACAAAAG GAAGGTCTAG 
95701 GCCGATACCA CCAGCTATAC ACTCGTXGCT CCAAGAACCT TCGGATTCTG 
95751 GCAATGAGGT ATAGTGCGTT TCCATACGGT TGTCTGAATG GCTGAACGAA 
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95801 ACTTGGACAT CCAAGGCTAG GGGAATTTCC CTAGGGAATT TTTCTATAGC 
95851 TGATTCAGAA AACTTTGCTC TTCCTAATCT CAAATAGTTT TGGGGTTGTA 
95901 GGGTATGAGA GTGCTTGAAG AATAAAGTTC CACCGTAGGT TCTAGAGTTG 
95951 TTGTGAGCGA TAAAACAATC TTTGTCTCTA GCAAAGAGAT GGCAGAACGC 
96001 AAAGGTAAAT AGGTCGTCTT TAGGAGTGTG AGCACTTCCA CCGATGACGT 
96051 AGCCTCCAGA GGTATGACGG AAGCCTTTGC GATTTTCATC TCCAGTCTTA 
96101 TGCAGGAAGT TCGTCATGGA GGAAACCCAG AAACCTTGTT TGTGTTCCAT 
96151 ACCAGTTGCG CCGATCTCTA CAAGCTGTTG CAGAGAGCGA ATGTCAGTAA 
96201 AGACTCCCCA TAGGGTATTG CATACTAACG CAGATTTTCT TTCGGGGCTG 
96251 GGAACAAATC CTGTTTTGGT' CCAAGTTGCC GTGGCCTCTT TTGTATTTGT 
963 01 AGCTGTATCC GTAGTCCAAT TAACATTCCA TTGTCCTTGG AATCCGTATT 
96351 CTGAATTAGG ATCCTCAGCA GGAACAGGGA TAAGGCTGCT GATGTCAACG 
96401 TTAGTATCAA CATCAGCATC AACCGTGATT TTTAATAGAG AGAAGAGCTG 
96451 GTCATGGCTG AACATATGAC TTTCATAAAT GTTCCCTTCA ATATCAATCA 
96501 GGTTGAGCTT CCCAGATACG ATCACTTTAT TTGAAGCACC TTTTGCTGTT 
96551 AGGCTGACGG GCTGCTTAAG ACCTAAGGAG TCAACATTGA TTCCTAGGTT 
96601 CGTGATTGTA ATACTCCCAG CTGTAGTTGA TAATGTCGTT CCTGAATCCA 
96651 TGCCGAGGAG AGAACCGGCC TCTTGAGAGA AGCTCGTGCT CTCTAAAGTG 
96701 ACTCCCTTTT GTAGCAATAA CTTTCCTCCG GATAGGGAGA CTGGCTGCGT 
96751 GAATGAAGAT TTTAAATTGT CAGCAACTTT AAGTTCATCT GCTGTTAGGG 
96801 TTTCTCCAGA AAATAGAATC GTTCCTTGAT ATGGATTGAG AGCTCCCGCA 
96851 GAGCCGTTAT TTATCTTCAA TACGTCTGAT GAGGTTCCTT CTGAAGTGAT 
96901 GGGATCATAG AAGAAAATTG TATGATTTTT AGCAGCCCGT AATTCCGTGA 
96951 ATTTCCCGTT ACTTCCTATG TTGATCGCAT TACGTTTAGG AGTATCGGTA 
97001 CTTCCGGTTG TTGTAAGGGT ATTTCTTACA AAGGTAATGT TTCCTGTCTC 
97051 TGCAGAAAGA CTGAGCTCTC CTGAGGCATC GATGCTGATA GCACCCCCCT 
97101 TAGGAGTTGC TGATGAGACA TTATTTCGTA GAAACTCTGT AAAGCCTCCA 
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97151 GAGGAAAGGG CTAGCTTTTT AGCATGGATG GCGCCACCGC TTGTTTCTGC 

97201 TACGTTTGAA GCAAAGATCA GAGTCTTATT GTTAGAGATT ATCAGTTCAG 

97251 GAGATCCACT CGCCTTGGTG TTGCAGATCG CACCGCCAGT AGTTTTCGCT 

97301 GCATTCCCTT CAAAATATAG AAATTTGTTG TTCGATAGTA TCGACGTGCC 

97 351 TTCATCATCG ATAGCGCCTC CTGACGTAGA CGCTATGTTA GATAGGAATC 

974 01 TAACATAACC TGTGTTATTT GCTATGCGAG CGCCTGCTGT AGTAGCAATT 

97451 GCTCCTCCCT TTGTTGATGA AGAGTTGTTA CTAAAAAGAG CATCTCCAGA 

97501 AGTGCCAGTT AAAAGGAAAG ACGCTCCTTT GATAGCTCCA CCATCTGCAG 

97551 TAGAAAAATT CCCAGCAACT ACAAGTTTAC GAATATTTTC TAAATTTACG 

97 601 CCTCCTGCTG AGGAAAGCGT TCCCTGACCT GTAGTAACCG TTGTGCTAGG 

97651 AGAGGAATCA AAACTCAGTA AGGAAAACCC TGAGAAGGTA AGATTCTTAT 

977 01 TTGCTGTTGT AGATGCAGCA GCACCTGCAT GAGTGCCAGC ATCTATAAAG 

97751 CCAAACGTTA AGCTATGACC GTTCCCCAAG AAGGTAAGAT TGTCCGTGGT 

97801 TTGCTTAAAA CAACTGTCAG ATAAGGGAGT GCCTTTTCCA GGCTCGTAAA 

97 851 AGAAGACATC TCCTGTTAGA GAATATGTTG TGGCTGAAGT TTTTGGAGTA 

97901 AACGTTCCTG AATCGATATT TCCATTAAAG CTATCATCAG GTGATAAAAG 

97951 TTCCTCGTTA GCTAGTGACT GTAGGTGACA TGAGAAAGCT AACACGGAGG 

98001 AAACTAAAAC CCAAGGAATC GAAGTCTTCA TGGTAATGCT TTTGTTTTTT 

98051 AGAGAACTAT TCGCATCAAT ATAGAAACAA AATAAGTAAA TCAAGTTAAA 

98101 GATGACAAAA CAGCTGTCAA GAATTTTTAT CTTGACTCTC TGAGTTTTCT 

98151 ATTTTATATG ACGCAAGTAA GAATTTAATA ATAAAGTGGG TTTATGAAAT 

98201 CGCAATTTTC CTGGTTAGTG CTCTCTTCGA CATTGGCATG TTTTACTAGT 

98251 TGTTCCACTG TTTTTGCTGC AACTGCTGAA AATATAGGCC CCTCTGATAG 

98301 CTTTGACGGA AGTACTAACA CAGGCACCTA TACTCCTAAA AATACGACTA 

98351 CTGGAATAGA CTATACTCTG ACAGGAGATA TAACTCTGCA AAACCTTGGG 

98401 GATTCGGCAG CTTTAACGAA GGGTTGTTTT TCTGACACTA CGGAATCTTT 

98451 AAGCTTTGCC GGTAAGGGGT ACTCACTTTC TTTTTTAAAT ATTAAGTCTA 
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98501 GTGCTGAAGG CGCAGCACTT TCTGTTACAA CTCATAAAAA TCIVKUUTA 

98551 ACAGGATTTT CGAGTCTTAC TTTCTTAGCG GCCCCATCAT CGGTAATCAC 

98601 AACCCCCTCA GGAAAAGGTG CAGTTAAATG TGGAGGGGAT CTTACATTTG 

98651 ATAACAATGG AACTATTTTA TTTAAACAAG ATTACTGTGA GGAAAATGGC 

98701 GGAGCCATTT CTACCAAGAA TCTTTCTTTG AAAAACAGCA CGGGATCGAT 

98751 TTCTTTTGAA GGGAATAAAT CGAGCGCAAC AGGGAAAAAA GGTGGGGCTA 

98801 TTTGTGCTAC TGGTACTGTA GATATTACAA ATAATACGGC TCCTACCCTC 

98851 TTCTCGAACA ATATTGCTGA AGCTGCAGGT GGAGCTATAA ATAGCACAGG 

98901 AAACTGTACA ATTACAGGGA ATACGTCTCT TGTATTTTCT GAAAATAGTG 

98951 TGACAGCGAC CGCAGGAAAT GGAGGAGCTC TTTCTGGAGA TGCCGATGTT 

99001 ACCATATCTG GGAATCAGAG TGTAACTTTC TCAGGAAACC AAGCTGTAGC 

99051 TAATGGCGGA GCCATTTATG CTAAGAAGCT TACACTGGCT TCCGGGGGGG 

99101 GGGGGGTATC TCCTTTTCTA ACAATATAGT CCAAGGTACC ACTGCAGGTA 

99151 ATGGTGGAGC CATTTCTATA CTGGCAGCTG GAGAGTGTAG TCTTTCAGCA 

99201 GAAGCAGGGG ACATTACCTT CAATGGGAAT GCCATTGTTG CAACTACACC 

99251 ACAAACTACA AAAAGAAATT CTATTGACAT AGGATCTACT GCAAAGATCA 

99301 CGAATTTACG TGCAATATCT GGGCATAGCA TCTTTTTCTA CGATCCGATT 

99351 ACTGCTAATA CGGCTGCGGA TTCTACAGAT ACTTTAAATC TCAATAAGGC 

99401 TGATGCAGGT AATAGTACAG ATTATAGTGG GTCGATTGTT TTTTCTGGTG 

99451 AAAAGCTCTC TGAAGATGAA GCAAAAGTTG CAGACAACCT CACTTCTACG 

99501 CTGAAGCAGC CTGTAACTCT AACTGCAGGA AATTTAGTAC TTAAACGTGG 

99551 TGTCACTCTC GATACGAAAG GCTTTACTCA GACCGCGGGT TCCTCTGTTA 

99601 TTATGGATGC GGGCACAACG TTAAAAGCAA GTACAGAGGA GGTCACTTTA 

93651 ACAGGTCTTT CCATTCCTGT AGACTCTTTA GGCGAGGGTA AGAAAGTTGT 

99701 AATTGCTGCT TCTGCAGCAA GTAAAAATGT AGCCCTTAGT GGTCCGATTC 

99751 TTCTTTTGQA TAACCAAGGG AATGCTTATG AAAATCACGA CTTAGGAAAA 

99801 ACTCAAGACT TTTCATTTGT GCAGCTCTCT GCTCTGGGTA CTGCAACAAC 
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99851 TACAGATGTA' L'CAUCUUTIV UTACAUTAUU AAL'aLTALO ulimiooui 

99901 ATCAAGGTAC TTGGGGAATG ACTTGGGTTG ATGATACCGC AAGCACTCCA 

99951 AAGACTAAGA CAGCGACATT AGCTTGGACC AATACAGGCT ACCTTCCGAA 

100001 TCCTGAGCGT CAAGGACCTT TAGTTCCTAA TAGCCTTTGG GGATCTTTTT 

100051 CAGACATCCA AGCGATTCAA GGTGTCATAG AGAGAAGTGC TTTGACTCTT 

100101 TGTTCAGATC GAGGCTTCTG GGCTGCGGGA GTCGCCAATT TCTTAGATAA 

100151 AGATAAGAAA GGGGAAAAAC GCAAATACCG TCATAAATCT GGTGGATATG 

100201 CTATCGGAGG TGCAGCGCAA ACTTGTTCTG AAAACTTAAT TAGCTTTGCC 

100251 TTTTGCCAAC TCTTTGGTAG CGATAAAGAT TTCTTAGTCG CTAAAAATCA 

100301 TACTGATACC TATGCAGGAG CCTTCTATAT CCAACACATT ACAGAATGTA 

1003 51 GTGGGTTCAT AGGTTGTCTC TTAGATAAAC TTCCTGGCTC TTGGAGTCAT 

100401 AAACCCCTCG TTTTAGAAGG GCAGCTCGCT TATAGCCACG TCAGTAATGA 

100451 TCTGAAGACA AAGTATACTG CGTATCCTGA GGTGAAAGGT TCTTGGGGGA 

100501 ATAATGCTTT TAACATGATG TTGGGAGCTT CTTCTCATTC TTATCCTGAA 

100551 TACCTGCATT GTTTTGATAC CTATGCTCCA TACATCAAAC TGAATCTGAC 

100601 CTATATACGT CAGGACAGCT TCTCGGAGAA AGGTACAGAA GGAAGATCTT 

100651 TTGATGACAG CAACCTCTTC AATTTATCTT TGCCTATAGG GGTGAAGTTT 

100701 GAGAAGTTCT CTGATTGTAA TGACTTTTCT TATGATCTGA CTTTATCCTA 

100751 TGTTCCTGAT CTTATCCGCA ATGATCCCAA ATGCACTACA GCACTTGTAA 

100801 TCAGCGGAGC CTCTTGGGAA ACTTATGCCA ATAACTTAGC ACGACAGGCC 

100851 TTGCAAGTGC GTGCAGGCAG TCACTACGCC TTCTCTCCTA TGTTTGAAGT 

100901 GCTCGGCCAG TTTGTCTTTG AAGTTCGTGG ATCCTCACGG ATTTATAATG 

100951 TAGATCTTGG GGGTAAGTTC CAATTCTAGG AGCGTCTCTC ATGTCTCAGA 

101001 AATTCTGAGA GAGATCGCAT TTAGGATTTT CTTAAACACG ACTCACCTTG 

101051 TTTTTGAACC AGGAGAGATC GGGGATTAAA AAGGCAAGAG GGCAGAGTTC 

101101 GTGAGGTCAC GTACTCTGCC TTTCTTGTTA CAAACACGTT TTAAAATTAA 

101151 GGAAATTTTT TAATAGAAAC CCGTTCTTTA AAATACGTTT CTTTAATTCT 
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101201 TATTGAATAA GATAATTCAC TATTTTTAGA TCCTAAATTT TAAGTGGTTT 

101251 TTGTTATGCT TCTTATAGAG AATAGCTGCA AAGATTAGAG TTGCAGAGAC 

101301 GGTACGTCTC TTTCTTTTTT AAGGGAAGGG GTGTTGTTAC ACCCATCCTA 

101351 AGATTTGTGA GATTCCCCTC AGGCAGTAAC TTTTACAATC GTACTTTATG 

1014 01 TTTTGATCTA GCTGTTTTCT TGTCTTTAAT TTATTCAACC ATCGAGAAGA 

101451 GAGATCCATG AGTGGAAATG TATTTTATTA GGATCATCTC TAAGGATGGA 

101501 AATGATGAGC CCATTCCAAC AACCTGAGCA ATGTCATTTT GATGTTGTGG 

101551 GAAGTTTCTT ACGTCCTGAA AGTCTTACAC GAGCACGCTC TGATTTTGAA 

101601 GAAGGAAGAA TTGTCTATGA GCAGATGCGA GTTGTCGAAG ATGCTGCTAT 

101651 TCGTAATCTC ATAAAAAAGC AAACAGAAGC AGGTCTTATC TTTTTTACTG 

1017 01 ATGGGGAATT CCGTAGGTAT AGTTGGGATT TCGACTTTAT GTGGGGATTC 

101751 CATGGCGTGG ATCGTCGCAG GGACTCTAAT GACCCTGAAA TTGGAGTGTA 

101801 TCTTAAAGAT AAAATCTCCG TATCAAAACA TCCGTTTATA GAACATTTCG 

101851 AGTTTGTCAA AACTTTTGAG AAGGGAAATG CAAAAGCAAA ACAAACGATT 

101901 CCTTCTCCAT CACAATTTTT CCATGAGATG ATTTTTGCTC CTAATCTGAA 

101951 AAATACTCGG AAGTTTTATC CTACGAATCA AGAGCTAATT GATGATATTG 

102001 TCTTTTATTA TCGCCAAGTC ATCCAAGATC TTTATGCTGC AGGTTGTCGT 

102051 AATTTGCAGT TGGACGATTG TGCTTGGTGT CGCCTCTTGG ATATACGAGC 

102101 GCCTTCTTGG TATGGTGTTG ATTCTCATGA CAGGTTGCAG GAAATTTTAG 

102151 AACAGTTTTT ATGGATCCAT AATTTAGTGA TGAAGGATAG ACCCGAGGAT 

102201 CTTTTTGTAA GTCTGCATGT CTGTCGTGGT GATTATCAGG CCGAGTTTTT 
102251 CTCTAGACGA GCTTATGATT CTATAGAGGA GCCTTTATTT GCTAAGACCG 
102301 ATGTGGATAG TTATCACTAT TATTGGGCTC TTGATGATAA GTATTCAGGA 
102351 GGTGCTGAGC CTTTAGCTTA CGTCTCTGGA GAGAAACACG TCTGCTTGGG 
102401 ATTGATCTCC AG.CAACCATT CTTGTATTGA AGATCGAGAT GCTGTGGTTT 
102451 CTCGTATTTA TGAAGCTGCG AGCTACATTC CCTTAGAGAG ACTTTCTTTG 
102501 AGCCCGCAAT GTGGGTTTGC TTCTTGTGAG GGAGACCATA GAATGACTGA 
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102551 AGAAGAACAG TGGAAGAAGA TCGCCTTTGT GAAAGAGATT GCTAAAGAGA 

102601 TCTGGGGATA AAGAATCCGG AGTTTTTATC GACTCTAAGA GTTTTCGGAT 

102651 CATAGAAAAC ATTTAAATAT TCAAGAGTCT TTGGCTATTG GATCATAGAC 

102701 AGTCTTAGTA TACTAAAAAG TCTTTGGATT CTAAGACGGG CAGAGTTCGT 

102751 GAGATCACGT ACTCTGCCCA TTCTTTCTTG TGATCTAGCG ACTTCTTTGA 

102801 ATCTTCGACC TCTTGTAATC TGGGATTTTT TCTAGTTCTT AGATTCCTCT 

102851 GATCTTTCGA CTTCTCCTCG TCTAAACAAG GCGCATTGTC TTTGAGAAGT 

102901 CCCTAGATAC ACTCAGGATC TCTTAGAATT TCTAAGGGAT CAGGAACGCT 

102951 TTTAGAACTG GAACTTACCT CCAAGATCTG CATTGTAGCT GCGTGAAGAT 

103001 CCACGAATTT CCATAGATAG GTTACTTGTG ACCTCAAGAT TTGGAGAGAA 

103051 GGCATAAAAG ATCCCTGCTC TTCCGATACC AGCTTGTCTT GAGAGATTCG 

103101 TTCCTGTAGT TTTCCACGAG GTATTGTTGA TTAGGAGAGC TGTCGTGCAG 

103151 TCAGGATTCT TACGATAGAC ATCGGCAACG TAGATGACAG TAGCTTCGTA 

103201 AGACGCACGC TCGTTTCTCG AGAATCTCTC GAAGGTAATT CCAATAGGCA 

103251 CAGAGACGTT AATTAAATCA CCGCTATCGA AAGATCGTAC CAAGGTAGTA 

103301 TTACGTTCTT TGAAGCTATC TTGGTGTATG TACGAAGCTT CTACTTTGAT 

103351 GAAAGGAAAA TACGCGTGGA AGAGACCCTC ATGGCTTAAA GCAGTGTGTG 

1034 01 GTAGGGAGCT CGCAAGTTCC AGAGCGCAAC CGTCATTATA CCACGAGCTC 

103451 TCTCCCTTTG GTGCTTGGGT GTAATAGGTT TTCATAGTAT TTTTACTATA 

103 501 GATATAGCTG ATCTGAGCAT CAAAGAGGAC AGGCTGCTCA CTTTCAGATC 

103551 CAGGAAGGTA GCGTAACAAG CTTGGAGAAG ACAAGGTCGC TAGATGCTGG 

103 601 AGATGGAGAG AAGCTGCATA GGCAGAAGCT CTATTTTTAT TTATAAAGTG 

103651 ATCTCTATCT TTCCCGAATA ATTGGCAGAA GGCTGCAGTG ATAAGATTAT 

1037 01 CAGAAGCTAA TGTTGTAGTC GCTCCTACAA CATAACCTGC ACTTATGTGG 

103751 CGAAAACCTT TATTTATCTT CGTGCTATCT TTATGGAAGA AGTTCGAGAT 

103 801 CCCTTCACAC CAGATGCCGC GAGTTTCTTG AGATTGGCGT ACTTTAGTGG 

103 851 CTACAAGCTG TTGTATGGAG CGCACATCAA CAAAGGATCC CCATAGCGTG 
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103901 TTAGCAACTA AGGTTCCACG ACGCTCAGGA TTCGGATTGT ATCCTGTTTT 
103951 TGTCCAGGTA AGAGTCGCTG CTTTGGATTT AGTCGCAGTA TCCTCTTGCC 
104001 AAGATAATGC CCAATTCCCT TGGTATCCCC AATGGATAGG ATTTTTTTCT 
104051 AGGGGATCAG CAGCTAAGTC TGTGATGTGA ATATTCGCGG GGTCGTCAGC 
104101 AGTAAGAGTG AGACAAGAAA AGACTTGAGG GTTATTCCAA GAGACATCTT 
104151 CGTAGACATT TCCAGAAGGA TCTACAAGAG AGAGCGATCC AGATAAAGTG 
104201 ACTGTCTGAC TTGCTTGTGT TGCTTTTAGC GTAGCCTTCT TGGTCTCTTT 
104251 TAAGGAATCT ACATTGAGAA CAAGATTATT GATAGTGATC CCATCAGCGG 
104301 TTTCTAATGT GGTCCCTGCA TCCATGAGGA GGGTAGAGCC CGGAGATTGC 
104351 GAAAAGGACT TAGCAACTAG AGTGACTCCT GATTTAAGAG AGAGTTGCCC 
1044 01 TCCCGCAAGA GTTAGAGGTT GCTGAATTGT AGATTTGAGA TTATCAGCTT 
104451 CTGCAGCTTC TGCTTCCGAG AGCTTCTCTC CAGAAAATAC GATGGTTCCT 
104 501 TGATATGCAG GATTCCCTGC AAGGTCAGGA CCATTTAAGT TTAGAGCATC 
104 551 TGAGAGAGCT GCAGTGATGC TAGTTGTTAT AGGATCATAG AAGTAGATAG 
104 601 TATTGCCTTG AGAGGCTCGC AGCTGTACAA TCTTAGCATT GGTGTTTCCG 
104651 ATGTTAATAG AATTTCTGGT AGTGGTCTGA CTCGAAGAAG CTCCTTTGAC 
104701 TACTGTGTTT CCTTCAAAAG TGATGTCTCC ACCAAGAGCC GAAAGACTCA 
104751 AAGATCCAGA GTCAGCAATC GCAATTGCTC CTCCTAAGGG AGCTGCAGTA 
104 801 TCTATAGCAG AGTTGTTTTT AAAAAGCGTA GGTCCTCCAG AAGAAAGAAC 
104851 TAGATTGTCA GTATAAATCG CCCCACCACT AGTAATTGCT GTATTTCCTA 
104901 TAAAGTTCAG TTCCCCGTTG TCTGATAGAG TTAAGACTGG TTTGGGGGCT 
104951 GATGTACTAC TACAGTAAAT GGCTCCCCCT GTAGCTGAGG TTGCGGTCAC 
105001 ACTATTGTTT ATAAAGCTAA TTGCTTTGTT GCTGCTAATA AAACTGCTAG 
105051 CTTCCGTGTA AATGGCTCCG CCATTGTTCG CCGCGGTATT TTCAGAAAAT 
105101 GATGCTGAGT TTAACGTATT GTTAATTGTA ATCCCTCCCG TGGAATAGAG 
105151 GGCACCCCCT TTTTGCGTTG CTTTGTTTTT GGCAAACGTT AGGTTGGGGT 
105201 TTAGCGATAG ACTGATAGAG CTGCCTTGGA GGGCGCCTCC ATTGTCATTA 
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105251 GAAAAGTTTT GGCCAAAGTA GCAACTATAG TTCGACTGAA TAGAACAAGC 
1053 01 TCCTGTGGAC TTGATGGCTC CTGTTCCTGT GGTAGCATTC GTGGTTTGTA 

1053 51 TTAGTGACAA ATAGGAGAAT CCTGAAAAGG AGAGAAGCTT ATTTGCAGCT 

1054 01 GTATTGGTAA AGGTACAGTT CGCTCCCGCA TCGATATTTT GTAGGAGAAA 
105451 TTGGTAGCCG TGGCCTTGGA AAGAAAGATT CCCAGTAGTT TCTTTAAAGC 
105501 AGGAAGCGGT TAGAGCTGTC GGAGATCCTG CATTGGTGAT TGAGACATCC 
105551 CCTGTTAGAT TATAGATAGT TCCATCTGCA TTTGTTGTTT GGGCTGGAGG 
105601 AGTGTAGGTT CCTGGTCCAG AGAAGCTATT GGTAGGTCCT AGATTGATTT 
105651 CAACAACAGC AGCAAACGCA GAGAAATTTA GTGACAAGGG AAGTGCTAAA 
105701 GATGACGAGA TTAAAAACCA ATGAAGAGAG GATTTCATGT AGAGGGCTAT 
105751 AGGTGGTTTA ACAAATTATT TCACCACATA CTGCAATAAA TTAAAGAAAG 
105801 CAAGAGGAAA GGAGAGACTA GTAAGTTAAG AATCTACAGG GTTTTTATAA 
105851 GAATTCCTCC CTAAAAGTTT AGGGAGGAAA GTAGGAACTA GAATGAGTAT 
105901 CTTAGCCCAC AATCTACATT GTAGATGTGT GCTGAGCCAC GAAGCTCATA 
105951 AGCAGCTTCC CCAGAGAGTT CTACATGAGG GGAGAGAGTC AGATGGCTTC 
106001 CAGCACTTGC TAAGAAGGCT TGTCGTGCGA GGTTTTTACA TAGCGAAGTC 
106051 CAAGAGGCTC CACTGACCAT TAGAGAAGTA CGCGAACGGG GATTTTTACG 
106101 ATACACATCA CCAATGTAGG CTAGAGAAAT CTCGAAATTA TTTTTTTCAT 
106151 CTTCGGAGAT TTTTTCTAAC CGAATGCCGA CAGGGATAGA GCAGTTCACT 
106201 AGGTCTCCAT CATCAAAAGC ACGGGCTTCA GCGCCACTCT CTTTAAAGTT 
106251 TTGTTGGCGG CTGTAGACTG CCTGGAACTT TAAGAAGGGG AAATATCCCT 
106301 GGAAGAACGG TGCTTCTTTA GGGAGATATA GAGCCAGAGA TCCTCCGAGC 
106351 TCTAGAGCCC CAGAGTTATT GGTCCAAGAG CCTTGAGCTT CAGGATAGGA 
106401 AGTATAGCGA GTATCCATAT CATTTTTAGT GTAGCTGTAG CTTAGCTGGG 
106451 CATTCAAAAT GAGAGGAATA TCTTTCAGCA TGTCGGTGAT ACTTCCAAAT 
106501 GAGGGCATGG GAAGTCCTCC TAGGAATGCT CGATGTTGCA GGTATAGCGA 
106551 CGCTAAATAG TTATGAGAGG TATTTTCAAC TATAAACAGG TCTTTATCTT 
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106601 TACCGAAGAG CTGGCAGAAA GCTACACTGA AGATATTTTC AGAAAAATCT 
106651 TCAGCACTTC CTCCAACAAT ATAGCCGTAG CTTTTATGTC GGAATGCTTG 
106701 GTTAGTTCCT GATTTATCCT TATGGAAGAA ATTCGCAGTT CCTGATGCCC 
106751 AGAGTCCTCG TTGCTGATAG ATACTATTCG CTTGAGATGT CATGATCTGC 
106801 TGTAGAGTGC GAATGTCAGT AAAGGATGCC CATAATGAAT CGGGAACTAC 
106851 GGAAGCTCTA CGCTCAGGAT TAGGGTTGTA GCCCGTAGTT ACCCAAGTCA 
106901 TAGTTCCTGA TTTTGCAGTT GATGTGTCTG CCCAAGTGGC TTCCCAATGT 
106951 CCCTGATACC CGTAATGAGG TTCTGGAGTT TGTACTGGAG AAGTGAGAAG 
107 001 CGCATCGATA TAAATATCGC TAGCAGCAGT AGCAGCAGTG AATACCACCA 
107051 AAGGCTGCGT GAAGGCTTGG TTTATCGTAT GGCTTTCATA AAAATTGCCG 
107101 CTACTATCTT GGAAAACAAG AGGAGAGGTT AGAGTTATAG TTTTGTTGGC 
107151 TCCTGCTGTT TCAATGGACA CACTCTTATT TCCCTCTAAG GCAGAAAGAT 
107201 CAACGACAAG TTTGGTAAGA CTGATAGCTT CAGTATCTGC TTTGAGCTTT 
107251 GTTCCTGGTT GCATGAGGAG TGTAGAGCCT TCAGTCTGTG TGAAACCATT 
1073 01 GACATCTAAC TCGACATTTC CTTTGAGTGC TAAGGTTCCA GAGGCTAGAG 
107351 CCAATGGTTG CTTTAATATA GATGTGAAGT TATCAGCAGC TTTCGCTTCA 
107401 TCTGCAGAGA GCTTTTCCCC AGAAAATACA ATCGTTCCTG AATAATCTAA 
107451 AGGCGAGTTG CTATCCGGTT GGTTGATGGT CAGAACGTCT GAAGCTCCTG 
107501 TGGTGTTAGA TGCAATCGGA TCATAGAAAT AGATAGATTG GCCTTGGGCT 
107551 GCCCTTAAGT TCGTAATTTT TGCTGACGAT CCCAGGTAGA TAGCATTCCG 
107601 TGTCGATGTT GGCGCGGAGG TTGAGGTTAG AGTGTTGCCA AGGAACGTGA 
107 651 TGTCTCCTTG ATTTGCAGAG AGACTTAAAG ATCCAGAGTC GGCAATTGCA 
107701 ATAGCGCCGC CCTTGCCTGC AGCTGTGTTC CCGCATCTAT TATTTGAAAA 
1077 51 TAGGGTAGGG CCAGCAGCGG AAAGATCTAG ACCATGGGCA CAGATTGCTC 
107801 CGCCTTGAGT TACTGAAGAG TTCTCGGCGA AGGTCAGACT TTTATTTCCA 
107 851 GAGATAGTAA GAGTAGGAGT CTCTCCTGTT TTTTCACAAT AAATGGCCCC 
107901 GCCCTTGCCT GCAGCATCTG TTGCAGTGTT TCCAGAGAAG AAAAGGGAGC 
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107951 TATTTTGAGT AATCGAGGAG CTGGCTTCAA AGCCCAGAGC CCCACCCCCA 

108001 GTTTCTCCTT TATTATTCAT AAAGACTAAC TGGCCGGTGT TTCCTGAAAT 

108051 ACTTGCAGCC GCAGAGCTAT AGATCGCTCC ACCTAATTTT TTTGCGCTAT 

108101 TACTAGTGAA GGTTATAGAA GAGGTATTCC CAGAAATAGA AAGAGTTTTT 

108151 GTGGTGATCG CTCCGCCATT GTTATTAGCT TCATTGGAGA CGTTTTGGCT 

108201 AAAGAGAATC GTTCCATTAT CGGTAAGATT TAAGGCTCCT GCAGAACTTA 

108251 AAGTACTTTT TCCTGAAGCA ACTGTAGTTC CAGGAGCTGC AATGAAGGAA 

108301 AGGTTAGAAA ATCCTGTGAA TGTTAGGGCT TTATCAGCAG TTGTGCTTGC 

108351 CGCAGCTCCT GCATTCGAAC CCGCATCTAC CGTGTTGAAT GAAAATGAGT 

108401 ATCCCTTTCC AGTAAATGTC AGATCACCCG TAGTTTCTGT AAAGCAGCAG 

108451 CCTGTTAATG CTGTGCCTTT CCCAGCATCG TTTATATAGA CATTTCCTGA 

108501 TAAGACATAG TTCGTTCCAT TGGCATCTGC TGTAGATTTT GGAGTAAATG 

108551 TAGAGCCGCC CGCTCCATCA AAGCTATCTG TAGGGGATAA AGAAGCATCT 

108601 GCTCCGTAAG TTGCAATGCT CAATAGAATG GGAGTGACAA GAGTCGAAGA 

108651 GATCAGGAGT TTGTGCAAGG GTATTTTCAT AGAAAGATGC TTGGGTTCAA 

108701 TTAATTAACA CGTTTTCGAT AATCTAGAAA CAAAACTTAG AGCCTAGGTT 

108751 TGTATTATAA TTTCGTGAAG AACTTCGTAC TTCAAAAGCG AATTGACCGA 

108801 AGATTTCCAT GTGGGGGTTC ACTTGGAAAT GGTTCGCAGC ACGAACAGAA 

108 851 AAACCTTGTC GTGCGAGGTT GGTACCATAG GCCATCCAGT TAGCATCGCT 

108S01 AGCTATTAGG GAAGTTTGAC ATTTAGGATT GCGTCGGTAA GCATCGAGTA 

108951 TATACATAAG AGTAAGATCG TAAGTTCCCT TTTCTGATTT TGAGTCTCTT 

109001 TCGAAGGTGA CGCCTATAGG AATCTCTACG TTGATAAGCT CGCTTTTATT 

109051 GAAAGCGCGT CCTTCAGCAT GACGCTCGTA GAAGTCTTGC TGATGCGCAT 

109101 AGATATACTG TACTTTGACA AAAGGTTCGA CTTCTTTCAG AAGATACGGA 

109151 ACGGAAATAA CAAAAGGCAG GCTAGCTCCA AGATCTGCAC AGAAGGCATC 

109201 GTTTCTCCAA GAACCCTTGA TGATAGAGTT ATCGGTATAA TATGTCTTCA 

109251 TGTGGTTGTC TGTATGGAGA TAACTGAATT TAGCATCGAA CGATAAAGGA 
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1093 01 ATGATCTGGG AGATCTCAGA GAGCACCCAG GGAGCTCGGG TTGCTTTTCC 

1093 51 CCAGAGGAAA TTGGCGATGT CGAAGAGCCC TTCTGTATGG TGGAAATACA 

109401 AAGAGGCACC GTAAGTATCT CCGTGGTTCT TACCTGTAAT ATGATTGCGA 

109451 TCTCTAGCAA AGAGCTGGCA GAAGGCAAAA GTAAGCTGAT CCTCGGCAGG 

109501 AGTTGTTGCT GTGATCCCTA GTGCATAACC CCCGCTGATA TGGCGGAAAC 

109551 CATGGCGGGT GGGCATAGAA TCTCTATAGA AGAAATTCGC AATTCCTGAA 

109601 AGCCATAGCT CACGCTCAAA AGGCTCCCCA CTGGACTTGG TTTCTATAAG 

109651 CTGATTGATC GAGCGTATAT CTATAAAGTT TCCCCATAAG CTATTTAGAG 

109701 GGAGATTACT TTTTCTCTCA GGACTAGGAA TGTATCCTGT ACGGGTCCAG 

109751 TTGATGCTTC CTATTTTTGA GGATGTTGCA TTTGCCCAAG ACAACTGCCA 

109801 GTTTCCTTGA TACCCGTAGT GGGTTTCAGG TTCTTGAAGA GTCAGGGTAG 

109851 AAAGAGCTCC CAGAGTAATC GTTCCGTTGG CTCCTGCGGT GGTAAGTTCA 

109901 AGAAGAGGAT AGGTACTAGC ACTTTTTAAG TTATGATTCT CATAGAATGA 

109951 CCCTTCCGTG TCAATAAGCG CAATCGTTCC CGATAGGCTG ATATTTTTAT 

110001 CTGCAGCTTC TGTTTTTAAA GCTGCCTTGT TGGTTCCATC TAAAGAGGAG 

110051 AGATTTACTG CTAAGCCATT AAGCGAAAGA TTTGCCTCTT TAGCACTAAG 

110101 TGTAGTCCCC CCATCCATTA AGATGCGGGA TCCTGGACTT TGAGTCAGAT 

110151 CCTTGAAAGT TACGGTGACT CCATCACGAA GTACAAGATC TCCCCGCGCT 

110201 AATACTGCAG GTTGTCGGAT AGTAGAGGTG ACGTTTGCAG CGATTGCTTT 

110251 TTCTGTAGGG GAAAGCTTTT CTCCAGAAAA GACAATCGCA CCCCCATACT 

110301 CG ATCTCACT GTTCGCATCT GCTAAGTTTA AGTTCAATGT GTCGGTAGAA 

110351 GCTGCGGTTC CTGGATTTGT GATGGGATCA TAGAAATAGA TAGATTGCCC 

110401 CGTAGCAGCT CGTATCGATG TGACTTTAGC GGTATCAATG ATATTTATTG 

110451 CGTTTCTTGT ACTTGTGCTT CCGTTGGTGA CTTGGTTGTT ATTGAAGGTA 

110501 ATATCTCCAG AAGTAGCAGA GAGAGCGAGT TCCCCAGCAG ATGCTATATT 

110551 GATCGCTCCT CCTCCTCCCT GACCGGCGCT ACTTCCTGAG ATATTACTTT 

110601 GAAATAGAGT AGGACCTCCA GCGGAAATAC TGACCTTGAG TCCAGAGATG 
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110651 GCTCCGCCAT ATGTCAATGC TGTATTATTT GTGAAAGAGA GGTTTTTGTT 

110701 CCCAGTAAGA GTCACTGTTT TATCTGTCGT AGTGCAACAA ATAGCCCCGC 

110751 CCTGAGCTTG AGCGGCTTCC CAAGCACTAT TGCCGTCAAA GATCACTTGA 

110801 AAGTTATCTG TAATCGAACA GTTGTCAGTG CTGTACAGAG CACCGCCAGA 

110851 TCCTTTCGCT AGGTTTTGAG AGAAGGAAAC TATCCCAGGG CTGTTCTCGA 

110901 TAGTTATAGT TCCTGTAGCG TAAACTACAC CGCCTTGCTT CCCTGTGAAG 

110951 GCTTGGTTTC TCGAAAAGCT CGCAAACTGA GATGTCCCTG ATAATAAGAA 

111001 GTTTTTCGTA TTGATAACAC CGCCGTTATC TGACGAGAAG TTCTGAGTAA 

111051 ATATAATTTG GGAATTGCCA GTTAGAGATA GATTCCCCAC AGATTTTAAA 

111101 GCACATTGTC CAGTAGGAGA GAGAAGAAGA GAGGGACAAG AGATAATAGA 

111151 GAGTCTAGAA AAATCATTAA AGAGAAGATT CTTATCTGCT GCTGAGGTAC 

111201 TGGCTACAGT TCCAGCGCTA GAGCCCGCAT TGATAAATGC AAACTTCAGT 

111251 GCATGTTGAT TTCCTTGGAA AGTAAGATCG CCGCCCGCTT CTAGGAAGCA 

111301 TCCTGAGGCT AAGGGAATTC CTAAAGCCCC TGCATTTTGA AAGGATACGT 

111351 CGGAAAGTAA GGAATAGGTA GTTCCTGCAG CAGCGTCCGT AGTGGAAAAG 

1114 01 ACCGTGAAGG TAGTTCCGTT AGATCCATCA TAGCTATTAT TGCTGCTATC 

1114 51 TAAGGTCACC TCTGCCGCGA CTATAGAGAG CGATGAAAAG AGCGGGATTG 

111501 AAGAAAAGAA CAACCAAGAG ACAGAGGACT TCATTTGTAA GCACTTTTTT 

111551 GAAACAAGGA AATTAAATTA GCAAATACTG TAAAGAAAAA AAGAAATCAA 

111601 GGGAAACGCA AGGAATTGAT TGATGCGGAG AATCAGAACC CCAAGGATGG 

111651 CGG ATCTTTT ACTTCTCTTC ATACGGATCC TAAGAATCTC TTTGATGAAG 

111701 AGGGGATGCC CTCCCCCTCT GATACCCTAC AGTGCGATCT CAATAACGTA 

111751 TTCATCTTTA -TAAAAAGTAT GTTTTTCTAA GATTCTCGGA GAATCTTAGA 

111801 AAGAATAACG AGTTCCACAG TTTGCATTAT AGCTTCTTGA GGAGCTGCGC 

111851 AGTTCACAAC TTCCAGAAGC GAAGCAGTCA AGACCATGAA GTAACTTCAG 

111901 ATGTCCAGAA GCCTCAGCAA AGAAAGCTTG TCGTGATAAG TTTGTAGCAA 

111951 ACGTAGACCA CGAGGTGCCA TTTGTTAAGG AGGTCAGGCA GTGAGGGTGA 
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113351 ACTATAGTTC CCGTGTTTGG GTTTGCAGGT GCTACAGGGA CTACAGGCAC 
113401 TGAAGCTATA GGACCATTTT TTGGTTGGGG AGGAGGAACA ATAGCTTTGA 
113451 CAACAGGATT GATGACTAAC TCCTCTATTG TTCCTCCAGA TGCAGGAGCT 
113501 TCCATCGTAA TAGGATCATA AAAATAAATC GTATGACCAG GAGCTGCTGC 
113551 AAGCTTAGTG ATCTTAGCCC CTGCACCTAA ATGGATCGAG TTGGGAGTTG 
113601 AAGTTCCCTC AGTCGCTCGG TTCCCTGAGA AAGTAATATC CCCATCAATA 
113651 GCCTCTAAGG AAAGTTCTCC GCTATCGGCT ATATAAATGG CGCCTCCCTT 
1137 01 GCCTCCAGAA TTATTGGTAA AGGAGACAGG ACCGTTAGCT GTAATCGAAA 
113751 GGTTTTTCGA ATAAATCGCT CCTCCCGAAG TTTCAGCAGT ATTGCCATCA 
113801 AAGTTTATGG ATTCACTGCC TGAGATTACA CACTTAGGAG CATAAATACC 
113851 ACCACCACTT CTTTTTGCCG TATTGTTAAT GAAACTTAAA CTCTCATTTT 
113901 CAGTAAGAGT TAAGCTTTTT GTAGCTATGT CAGACTCTGA GATATTACAG 
113951 AGGATCGCTC CACCACAACC TTCTTGATCT GTAGTTGTTG TTGCTGTTGC 
114001 TGTTGCTGAA TTTCCAGAAA ATACAAGAGC CTTATTTTTG GTAAAGGAAG 
114051 TATTTCCTTT AGTATGTAGA GCCCCTGCTG TCTTTGCTGT ATTTGTGCTG 
114101 AAGGTCACGG TTCCCGTACT TCCCGTAAGA GTAAAATCTT CGGTTTCTGT 
114151 ATAGATCGCT CCTCCTGCAG TTTCGGCAGT ATTGCCATCA AAGGTAAGAG 
114201 TCGTGTTTCC ATGCAGAGCA CACTTGGTCG CATAGATCGC ACCGCCACTT 
114251 ACTGTTGCAG TATTACCAGA GAGACTCACG TTTTCGTTAT CTTCAATCCA 
114301 GAGTCCTTTT TTAGTACTTA CAGATGCTGA CTCAAGAAAC GATAGGATTG 
114351 CCCCACCGCA ACCCTCTTGA TTTGCTGAAG AATTACTCGG GCCCGTAGCT 
114401 TTGTTCCCTG AAAAGAGCAG GTTGGTATTA CCAGACAGAG AGTTGTTGCC 
114451 TTTAGAATAT AAGGCGCCGC CTGTCTTTGC TGTATTTGTG CTGAAGGTCA 
114501 CGGTTCCTGT ACTTCCTGTA AGAGTAAAAT CTTCAGTTTC TGTATAGATC 
114 551 GCCCCTCCTG AAGTTCCAGC AGTATTGCCG TCAAAGGTCA GGGAGCCGTT 
114601 TCCAGTTAGA GTACATTTGG TAGCATAGAT CGCACCACCA CTTACTGTTG 
114651 CAGCATTACT AGTGAGGCTG ACTTCTTGGT TGTTTGCAAT CGATAGTCCT 
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114701 GTTTTATCGC TTACGGATCC TGAATCAATA AAGGCTAGGA TTGCCCCACC 

1147 51 GCAACCCTCT TGATTTGCTG AAGAATTACT CGGGCCCGTA GCTTTGTTCC 

114801 CTGAAAAGAG CAGGTTGGTA TTTCCAGTCA GCGAGCTGTT TCCTTTAGAA 

114 851 TATAAGGCGC CGCCTGTCTT TGCTGTATTT GTGCTGAAGG TCACGGTTCC 

114901 CGTACTTCCC TTAAGAGAAA AATCTTCAGT TTCTGTATAG ATAGCTCCGC 

114 951 CACATCCTGC TGTCGCAGTA TTCTGATCGA AGGTAAGAGT TGTGTTTCCA 

115001 TCCAGAGTAC ATTTAGTAGC GTAGATCGCT CCACCATTCG CAGTTGTTGT 

115051 ATTACTAGTG AAGCTCATTT CTTGATTCTG AGAAATGGCT AATCCAGTTT 

115101 TGTCTGTTGC TGTAGCAAGA TAACAACAGA TTGCCCCACC ACAACCTTCC 

115151 GGGTTATTTG CCTGTGCTGC TGAGCCGGTT GTTTTATTTT CCTGAAAAAG 

115201 TACTTGAGTG TTGCCGGTAA GAGCAAGATT GTCATCAGAG CTCCAAGCAC 

115251 CCCCCGTCTT TGCAGTATTA GATTTGAAGG TAACGACTCC TGTATTGGCA 

115301 TCTAGCGTGC TATCCTTTTC TTTTGAGTAG ATCCCCCCAC CTTTATCTGT 

115351 AGCAGTATTT GAGGAGAAGG TCACCGTTCC TGAGTTTCCT TGGACTGTAG 

115401 TGTTTGCTGT ACTACAGAGG GCCCCGCCAT TTTTTGTGCT AGTATTTTGA 

115451 TCTAAGAGAG CTGCTGTCGT AGTCTTAGCA AGATCGATGC TGTAGGCAGA 

115501 AACTGCAGCT CCATCTTTTT CTGAAGTATT TTTTTGGAGG GTGACACTGG 

115551 CATTGTCAGT AAAAGTCGCA GTACCTCCCT CTGTATTTGT CACACAAATA 

115601 GCACCCTTGC CGCCCGAAGT TCCTGTTGCT GGAGCTGAGT CGATTAAGAG 

115651 TGACGAGAAT CCTGAGAAAG AAAGAGCTGT GTTGGTATTG TTAATTGCAG 

115701 CACCATCATG CGTAAGCGCT ATGGTTTGCA GAACCAATGA GTGATCAGCT 

115751 CCAACAAAAC TCAATGCTCC TCCTGTGTTT GTAAAACAGC TTTTATCTGC 

115801 AGGAGTAATT 'GCAGATACAT TCGTAATAGA AACATCGCTA GTGAGAGTGT 

115851 AGGTAGTTCC TGAAGCATCC GAAGTTTCCT TGGCAGTGAA TGCTGCGCTA 
115901 CCACTACTAC CATTTTCATA GTTATCGGAT GATGAGAGAT CCGTGTTAGC 
115951 AGCCATTAGT GGATGTAGGG AGAAAACTAA AGCCGAAGAG GTAAGTAGCC 
116001 AAGGTAAAGA ATATTTCATG TGTCTTTGGG GAAAAGCTTT TTATCAAAAA 
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116051 TACTCCCATA GCATGTGGCT TTAGGAGCAT GGTGCACCAA TAGAGAATAC 

116101 AGTTAAATAA ATCAAGTAAA TGCTCTGGAG AAGACTCTCA GTTATAGAAG 

116151 TTTCAATCTT. GGGAGAGAAG CATTTAAGGT ATTTTTCTAT ATTTAAGAGT 

116201 CCCTTAAAAC ATAAGGGAAA TGCTTAAGGG TAGGGGAGAA GGTGTACAAG 

116251 CGGTTTTGCT TTTAGACCTT CTTGAATTTT AGAAGGAGAG AGTAAGGAAG 

116301 ATGGTTATCT AAACCACGGA TCCTTATTTG TTCTTTCTGT GTTTCCGTTT 

1163 51 TTCTCTTTCG TCATAACCGT CAGAGAATGG ATTGGAGGGG CTGAGGTTAG 
116401 CCGTGCTTTT TCCGTCTGGA GGCGGAGTTT TAAGAGATTG ATTCGATTTC 

1164 51 CCGCTTTTCT TCTTAGATTG CTGTTTCTGT TCTTCATCTT GATTTTGCTG 
116501 TTGCTGCTGT TTATCTTGGC GATCACGAGG GGAACGGCGG GAAGATGAAG 
116551 AAACTCGAGG AATTGAGGGT TCTTTTCCTC CCTTAGGGTA GACCGGCTCA 
116601 GGGTGTAGAA CCGTCCCCGT GGAGAAATTA GGAGAGCGGG AACTTGCAGG 
116651 CATTATGGGT GTAAACGCAC TGCTCGCTCC ACTACCGAAT GAACTGCTTC 
116701 TTAAATCTTT GAAGTGATAA GGCTGAAAAT CGTCCTTAAA TGGAGGACTT 
116751 ATAGATGCGG GTCGTGTAGA AGCCGCATCT GAGGGTTTTG TTTTAAAACG 
116801 TGAGAGGCTG CCGAAGGAGC GTCTATGCTC AGGATTTCGT GACGAATAGA 
116851 AGAAAGATCC TTGAGCCTCC CGACCGATCC TACGATGACG GGCATCTTGG 
116901 CGTTGTGAAG CTTCGTGTTC CTCCATGCGG GCAGATGCAG AAAGATCTTT 
116951 TTTCTTTTCC ACTGCGATTT TTTTTGCATC GTCGGCGGAT GG AGTCACTA 
117001 GAATAGGCTC CGCGGTCTCT TGGGTTTTCT TTTCCATCCT CAAGAGGTGT 
117051 TCCCTACGGA AATAATCTAA GGTGAGCTTA TTCAAGGACA TGAAGTATAG 
117101 CCCCACCGTA ATCAATCCCA TGACAAACAT AGGGTTGGCA AAGACTAACA 
117151 TCGTCCCACT AGAGGCAACG AAAGCCCCTG CAATAAGACC CGCAGCAATC 
117201 GCTAATACAA TGATAGGAAC GACGATAGCA GTGATTGCCT CACCGATTTT 
117251 CTTGGCCTTC GGACTGTCCA GAATATCAGA AATAAGTAGA GTAACTCCCA 
117301 AAGCTCCCAG GGCAAGTGCC GGAGCGAGAG CATAAAGCAT AAGGCTGTTT 
117351 CCTGAAGCAG TCAGGAGAAT CGAAAGGATC GCAATTGCCG CAAGGGCAAT 
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1174 01 AATGCCTGTG TCATAGACAT AGCGCATTTT AGGATATTTA TCAGGAATAG 

1174 51 TAATAAATCT TTTGAATAAC CCTGTGGATG ACGTTTTTAA ACGTTTTACC 

117 501 ACGCTTGGCT GCCCTGATGG GGATGCTGCT GCAGGAACTT GAGGTTGACC 

117551 TAAAGGGTTT ATAGGGGGTT GGCTCATACT ATCAACTTAC TGTAATTATC 

117601 ATTAGGCCCA TGAATTTTCA TTCATAGGAT ATATTTCATA CTATTATAAG 

117651 ATTTAATAGG ATTTAGTTAG TTCTCTTTTC TTCTGAGTCT TAACTTTTTT 

117701 ATTAAATAAA GTTTATTTGT TAAAATCTTA ACAGATTTTT AACTAAAACT 

117751 TTAAGTTATT TTTATTTGGA ACTTTTAGTC GAAATAAGAC TCGCTTATGA 

117801 GAGGGACATA CTCATCAGCA AATGGAGGGG GCGTGTGAGG TCGTGAGGGT 

117851 GGAGGCTCAT ACGGGGGGAA AAGATTTTTA TGCACTTGGA AGAGGGTAAC 

117901 GCTAGTAGTT ACAGACATGA GAAGCCCCAA GGAGATAAAG ACAACTGGAG 

117951 GGGGGATAAA GACCAGACTA AAGACCAGAC CGATAATCAC AGCAATCGTA 

118001 AGAATGTGGA GGAGCCAAGC CATAGCATAC GAAGCAAGAT GTTGGCAGCT 

. 118051 TTTTGTTTTT ATAGCGTTAA AGAGAGCCCG TAGGGGTAGG GTCAAGGCAG 

118101 CATATGTAGA AGCGCAAGGG ATCAGGATAA CCTTAAGAGC TCCTAAGACC 

118151 GAGGATACCA GTACTTCTAT GGTAAGAGCA GTTCTGGGAT AGCTCTTCGC 

118201 ACAGTTTTTT AATTTTCGAG CTAGTATTCG TTCCGGAAAA ATGCCATTCA 

• 118251 GGTATAGCTG AGAGCCTTGT TTGCAGATAT TTTTGAATCC CATATCCGTT 

118301 TGAAAGAGAA TATTTTATGA AAAATTATGT AAAAATTCTA AGAGGATAGT 

118351 GGTTTTTAGA CAATCGAAAT TCCTGAAAAG GCAGGAAAAT GAGAGACACA 

118401 AGTAGACAAA ATCTCCTGAA GTTTTTGTAT GGGCCTGTAA AAAAATCTTT 

1184 51 CTGGAAACTG GAAATTAGAA GTTCATTACA GCGGAACCAC CGAAAAATGC 

118501 GGAGGTTTTA AAAGTATGGG ACGTTTTATT ATTGTTGTGA TCATCAGCTA 

118551 TCGAGATATC ATTCCCAATA GACCAACCGT AAAATCCCTT AATAAAACTT 

118601 CCCGGCCAAG GGGTGAGCTT CACGTTAGCT TCGATTTCAC GTCCTTGATA 

118651 TTCAAAGATG CCGCGAGAAG AAATTAGGTG ATTTTTGTGA AGTTTTTTTC 

118701 GGAATCTTGT AAGGCGGTAA GCAAGCCCTA AGTTACAGAC AGATGTCGAG 
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118751 CGGTAATCAA TAGAAAAATT CACAGGATAG ATGCAATTGA GAGTTAGTTG 

118801 GTCGGTAGCC TTGTAACTAA CACCTACTAA AGGCCAAGCC TTCTCTTGAT 

118851 GGAGGCCTGT TTCATTAATG ACGCCAAAAA TAGCAGAAAG CTTCTCAGTG 

118901 GCCTGGTATT TTCCAGAAAG AACTCCTTGA TAGAGTCCAT AACCCATCTC 

118951 AATATTTTTA GG ATCCACAA GCCCAGAAAG AATGATAGAC CACTGCCAAT 

119001 TTTTTAAGGA GAGTGTATAA GCTCCTAAAG AGAGGAGAAC ATAGTTATAA 

119051 AAAGAAGTAT CTTGGAAAGT CGCCCACCCA AGTCCATTAG GATCTGTCTC 

119101 CGAAATAGGA AGTGAGCTTT TCCATTGAAT ATCCGCACCT ATATAGCCAG 

119151 TAGAAAACAG TAGCCCAGAA TGCTCTGTAA TCGGAAGTGT GCAGAGAAAC 

119201 GTTCCATCGT ATTGACGATA GCCTATAGTT TGATGAGGCA GCTTTTTAAA 

119251 TTTAGCATCG TTCACCTTTA GGTATTGTAC CTGAGCAGAG AAAGGACGTG 

119301 GAGGAGGATT TTTACATGCT TCTTCATCAA TTCCACAAGC ATCTTGAACA 

119351 ATAAAAATAG GAGTCGAGAG TACGTGTCCC GCAAATGCAG CGATGTGGAA 

119401 GAGCAGTTTG AACATGTTCT GTAAGATTCT CCAACGTTAC TAGAGATTGA 

1194 51 AATGGAATAT ACGTAATTTT TAAATTACTA TCTAATAATT TTCCTACTCA 

119501 GGAGGGACTT TCAAGAAGAT TTCCCTAGAT TTAGGGGGAT GAAAGGACTA 

119551 GAATTTTTCT AATCAGCAAT CGAACAAAAT TACAGGGTTC TTGGAGAGGT 

119601 GCTTAATACT ACGCATAAAT GATTTCTGAG ATGATTTCCG TTTCGGCTGT 

119651 TTCTGTGCTA ACTTGTTTTG GCACTCAGTA ATTTTAAGCT CAAGATTTTG 

119701 GATCGATTCC TTTTTCGCTT CAATTTCATT TAAAATTCTT GTTTTTTCTT 

119751 GATTTTGTGT TGTCAGCTCT GGAAGATGCA CGCTTTTAAT TGTAGAGAAC 

119801 ATCAACCCGT ATCTCTGAAC AAATCCTGAT CCTATAGAGG AAATGTCTTG 

119851 AGATATCTCA TCAATCAGTT TTGTTCCTGT CTGCTTATTT TTAAGAGGGA 

119901 TTGCTAGAAG AAGAGCAAGA ATCAGTGTGA GTACGATAAT TCCCAAGCCG 

119951 ATGCCACAGA TGATCCAGTT TCCAGTATAT CCTGCGTAAC CTGCTGAGAT 

120001 CGTTCCCCCT AAGGTTAACA GTAGTGTTAA GGCAAAACAG AGCGTATGCT 

120051 TTGTGGAAAG TTTGGAGCGA AAGGTCTCCC AAGGAGCCGC TGGAATCGGA 
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120101 TGAGGCTGCG TAATATAAGC ATTAGGAAGA TTGAGAACTT CTGTAGCATG 

120151 AGAGAGAGGG CTGCTCTCGG GGACTGGTGA TGGGGCTACG GAAGTTGCCA 

120201 TGTTGTTTCC TCGAATCGTT GCTAACTAGT TTTGATTTGT CTTTTCATTC 

120251 TTGTGAGAAA GCTCAAAACG TTTTTGATAA AGAGTTGATT GCAGTTTGAA 

1203 01 CTCTTTTTGT GAGAGGTCGC AGAGTTCTTC GTATAACTTT AGCGTATCTT 

1203 51 GAGTGGTTTT TCTCTGTGCA TTGGTTACAA GCTGTAGAGA GGATTCAGGT 

120401 CGAATCTTCT CCCTGATAAA TTGTACTGTC GTATGGAGCT GCTGAGGGAG 

120451 CTGCCGGATG AATTTAATAA ATCCTACCAA GGCTTGTAGG CAGAGAAGAG 

120501 TCAAAATAGT AAGAACAATG CCAACGGCAA TCAACAGAAT GCTTTGGCTA 

120551 TAGCAACCCA AACAGATAAT AGCGATTCCA GCAAGAGCTA CAATCACTAA 

120601 GATCGTAATC GCAGCAATAT GCATAGGAAT GGAGTGGGTG AGAAAAACAT 

120651 TGCCCTTCTC CCCCAAAGCT ACGATCTCCT TATTCGTAAT GAATAAATCA 

120701 GCAGACTCTT CCGGAAGGGA TGAGGGAAAT ACCCCGTTTA AAGTACTAGA 

120751 CACAAAGAGA ACTCTATTAT TTGAGGAAAT AATTTAAGAA AAATGGTATT 

120801 TTTAGTCAAT TAGTAAGCGA GTCATGCCTC TTAGTTATTC AAATTTTTAA 

120851 AACCTTACCC TTCCTATGAG GAGACAAGTA AGAGAAATTA TGCAACAAAC 

120901 TGTAATTGTA GCAATGTCAG GAGGCGTGGA TTCTTCTGTC GTTGCCTATT 

120951 TATTCAAAAA ATTTACCAAT TATAAGGTTA TTGGCATCTT CATGAAGAAT 

121001 TGGGAAGAGG ATCGCGACGG CGGTCTCAGC TCGACTACTA AAGATTATGA 

121051 TGATGTCGAG AGGGTCTGTC TTCAGCTCGA TATACAGTAT TACACCGTAT 

121101 CTTTTGCTAA AGAATATAGA GAAAGAGTGT TCGCTCGTTT CCTCAAGGAA 

121151 TACTCTTTAG GCTACACTCC TAACCCCGAC ATTCTTTGTA ACCGAGAAAT 

121201 CAAATTTGAC CTTCTACAAA AGAAAGTCCA GGAACTTGGC GGAGATTACC 

121251 TCGCTACAGG GCACTACTGC CGATTAAATA CCGAGCTCCA AGAAACCCAA 

121301 CTCCTTAGAG GTTGCGATCC TCAAAAAGAT CAGAGCTATT TTTTATCAGG 

1213 51 AACTCCTAAA AGTGCTCTTC ACAATGTGCT CTTTCCTCTT GGGGAAATGA 

121401 ATAAGACTGA AGTTCGTGCG ATTGCAGCTC AAGCAGCTCT TCCCACAGCA 
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121451 GAAAAAAAAG ATAGTACAGG CATTTGCTTT ATAGGGAAGC GCCCTTTTAA 

121501 AGAGTTCCTA GAGAAGTTTC TTCCCAATAA AACAGGCAAC GTTATCGATT 

121551 GGGATACCAA GGAAATTGTA GGGCAACATC AGGGAGCTCA CTATTATACT 

121601 ATAGGGCAGC GGCGAGGACT TGATCTTGGA GGATCCGAGA AACCCTGTTA 

121651 TGTTGTGGGA AAAAATATAG AGGAAAATAG CATTTATATT GTGAGGGGGG 

121701 AAGACCATCC CCAGCTCTAC CTACGGGAAT TAACAGCTAG AGAGCTCAAT 

1217 51 TGGTTTACCC CTCCTAAATC CGGATGTCAC TGTAGCGCTA AAGTCCGCTA 

121801 CCGTTCTCCT GATGAAGCTT GCACGATAGA TTATAGCTCA GGTGACGAGG 

121851 TCAAGGTGCG ATTTTCACAA CCCGTCAAGG CGGTAACTCC AGGACAAACA 

121901 ATAGCGTTTT ATCAAGGAGA TACCTGCCTT GGTAGTGGAG TTATCGACGT 

121951 TCCTATGATT CCAAGTGAGG GCTAGGGAGA GCAGCTTCCT GCTCCTCTTC 

122001 TTCCCTTTCA AAGGCAACGC GATTTTCAAC CAAGGTTGCT CGTAGCTTGC 

122051 GAGCTTCTTG ACGGCAGGAC TCTTTAAGCA AGAGCTCCGC TAGAGGATCT 

122101 TCAAGGTACT GCTCAATGAC ACGGCGTAGA GGACGTGCTC CCATTTCTGG 

122151 AGAATGCCCC TTCGTTACTA GGAAGGAAAT CACAGAGTCT GGGATGTTCA 

122201 AAGCCATTTG GTAGTTTTTC AGTCTCGAGT CCAGTTTGTT GATCTCTAAA 

122251 TGGATGATCT CCGATAGAGA TTCTTTCTCG AGGGGACGGA AAATCACACT 

122301 TTCATCCAAA CGGTTAATGA ACTCAGGCTT TAAGTGTTTC TTCATAGCAT 

1223 51 GTTCGATTTT CTCTTGGATG ACCTTATAGT CCATATGGGA CTTCAAGCCA 

1224 01 AAACCAATTT CTCCGCTTTT ACGAATGAGA TCAGCTCCCA AATTGGAGGT 
1224 51 CATGATAATA ATGGGATGAC GGAAATCCAC TTTGCGACCA AAAGAATCAG 
122501 TAAGACGTCC TTGCTCTAAA ATTTGCAACA TCAGGTCCAT AATGTCTGGG 
122551 TGTGCCTTTT CTATCTCATC AAAGAGAACA ACGCAGTAAG GACGGCGACG 
122601 TACCTGTTCC GTAAGGTGGC CCCCTTCTTC ATGACCTACA TATCCTGGAG 
122651 GTGATCCCAT CATCTTGGTA GCAGCAAATT TCTCCATGTA CTCTGACATG 
122701 TCTACCTGAA TCAGAGCGTC TTCACCACCG AACATCTCTA TAGCAATTTG 
1227 51 TTGGGCGAGC AGGCTTTTCC CTACACCGGT AGGCCCAAGG AATAGGAAGG 
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122801 AGCCCGTAGG TCGGTTAGGA TCTTTGATCC CTGTTCGAGA ACGTCGGATG 

122851 GCACGGCAAA TGCTGGTAAC GGCATCATTT TGACCAATGA CTTTTCTTCT 

122901 TAACGTGTCT TCTAACTTCA GAAGCTTCTC ACTTTCAGCT TCTGTGAGCC 

122951 TTGCTGAGGG AATTCCTGTT TGTAGAGAAA CTACCTGAGC GACTGCTTCT 

123 001 TCATCTACAG GAACTTGGTG CTCTTCTTTA TGATTTTCCC ATTCCTGTTT 

123051 CATACTTTGC AGACGTTCGC GAAGTTTTTT CTCTTCATCA CGTAAACCTG 

123101 CAGCTTTTTC GTATTCTTGA GTTCCAATGG CCTGCTCTTT GGCCAATTTT 

123151 GTATTTTCGA TTTCAGCCTC TAGCTTCATT AAATCTGTAG GCTGACCCAT 

123201 TGTATTCACA CGGACACGAG CCCCAGCTTC ATCTAAAAGA TCTATTGCTT 

123251 TATCAGGGAG GAAACGTCCA TGAACATATT GATCAGAAAG AGTCGCAGCT 

123301 GCTTTTAAAG CTTCTTCAGT AATGAAGACA TTGTGATGTT CTTCATACTT 

123351 TTTCTTGAGG CCACGTAAAA TCTCAATAGT CTCATCTACA CTAGGAGGGT 

123401 GAACCACGAT TTTTTGGAAA CGACGTTCTA AAGCTGCGTC TTTTTCTATG 

123451 TGCTTGCGAT ACTCATCTAT CGTAGTTGCT CCAATACACT GAATTTCACC 

123501 TCGCGCTAAC GCAGGTTTTA AAATGTTTGA AGCATCGATA GCACCTTCAG 

123 551 CTGCTCCTGC TCCTACAATC GTGTGGAGCT CGTCAATGAA GAGCAAGATG 

123 601 TTTCCATGCT TGCGAACTTC ATCCATGACA GCTTTGATCC GTTCCTCAAA 

123651 TTGCCCTCGA TATTTTGTTC CAGCAATCAT TAATGCTAGA TCTAGAGTAA 

123701 TCAGTCGCTT TTTCCGTAAG GCATCAGGAA CCTCATTCAG AATGATTTTT 

1237 51 TGAGCCAGAC CCTCAACAAT TGCAGTCTTA CCAACTCCAG CTTCTCCAAT 

123801 AAGTACAGGA TTGTTTTTTC TTCTTCGGCA AAGAATCAAA ATCAACCGTT 

123851 CGACTTCTGA AGAACGACCA ATGACAGGAT CGAGCTTAGA CTCTCGGACC 

123901 ATCTCCGTTA AATCATAACC ATATGCTTTC AGAGCAGAAA GCTTTTCGTT 

123 951 TTTGTCAGAA CCTAAGCTAT GACCTAAAGG AGATTTTGAA GATGAAGGGT 

124001 TGCTTCGAGA GGATGAGGAA GAAGACGACG ACGAAGGAGG AAGTTGTAGA 

124051 TTGAAGGTCT CTAATTCTCT AAGAATTTCC TTACGAACCT CTCTTGGATC 

124101 GATATGTAAG TTTTCTAATA CCTGAAGAGC GACACTATCT GATTGATGTA 
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124151 GGATCCCTAA GAGTAAATGC TCCGTCCCGA CATAATTGTG CTCTAAAAGG 

124201 CTGGCCTCTT CATTTGCTGA TTCAAAAGAT TTTTTTACTC TTCCTGTAAG 

1242 51 GGCAGGGTCT CCGTAGACTT GAATTTCTGG ACCATAACCA ATCAGGCGTT 

124301 CCACCTCTTG CCGTGCCGTA TCAAAATCTA TACCGAGGTT GCGTAATACA 

124351 TTAACAGCTA CCCCTTGACC AAGTTTGAGA AGACCAAGCA GGATGTGCTC 

124401 AGTACCCAGG TAGTTATGAT TTAAACGCTG AGCCTCCTTT TTCGCCAGTT 

1244 51 TAATGACTTG TTTTGCTCTA TTAGTGAACT TCTCAAACAT AAAAACCTAA 

124501 AAGACAGGGG TAGAACTTTC CTTAAGCATA TACGAAATTT AAAATAATGA 

124 551 TGCAACTCTT CGCTCTAAAC CAGCAAATTT GGTAAAATTC CTCTGAGTTT 

124 601 AAGGGAAAGT TATGCACAAA CCTTTTGTAT ATGATACAAT AGTTCAGCTT 

124 651 CTTTTGAAAC AGTCTTAATT AGTTTTATGT TTGTTATATG AAAGTTCGTA 

124701 TCGTAGATTC AGGAAAATCT TCAGCGGCCT CCCACATGGC TAAGGACAGA 

124751 GATTTATTAG AATCTCTGCA AGATGGGGAG CTCATTTTAC ACCTTTATGA 

124801 GTGGGAGAAT CCTTGTTCTC TGACGTACGG TCACTTTATG CGTCCAGAAA 

124851 AATTTTTACT TTCCAACTAT GCGGATCTAG GATTGGACGC CGCAGTGCGG 

124 901 CCTACGGGAG GGGGATTTGT CTTCCATAAG GGAGATTATG CTTTTTCTGT 

124 951 TCTTATGTCT GCGACACATC CTTCCTATTC TTCTTCGGTA CTTGAGAACT 

12 5001 ACCATACTGT AAACTCTTTT GTAGCGAAGG TTCTAGAGAA AGTATTTCGG 

12 5051 ATCCAGGGAA TGTTAGCTCC AGAAGACGAA AACTCTTCTT CCAGAGATTC 

125101 AGGAAATTTT TGTATGGCAA AAACTTCGAA GTATGACGTT CTTTTTGGGG 

125151 ACAAGAAGAT AGGGGGCGCT GCCCAACGCA AGGTGCAACA GGGATTTTTA 

12 5201 CATCAAGGAT CCTTATTCTT ATCGGGAAGT TCTTCTGAGT TTTACCAGAG 

12 5251 ATTTTTAAAA CCCGAGGTTC TTGAAGAAAT TATTGAACAA ATCCAGATTC 

125301 ACGCGTTTTT CCCTTTAGGT TTGGAAGCTG CTGATGAAGT GCTGCAGGAG 

125351 GCGCGTCAGC AAGTCAAAGA GGCGTTTATT AAATTGTTTT GTGGTGAGGG 

1254 01 GTTATGATGA GTCGGTTGCG TTTTCGCTTG GCAGCTCTTG GAATATTTTT 

125451 TATTTTGCTG GTTCCTAATT CTGTTTCAGC AAAGACAATC GTAGCTTCAG 
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125501 ACAAGGAGAA GGTTGGAGTT CTTGTTTATG ACAATAGTGT AGAGGCCTTT 

125551 CAACAGATAT TGGATTGCAT AGATCATGCA AATTTTTATG TAGAACTGTG 

125601 TCCCTGCATG ACAGGAGGCC GAACGCTTAA AGAGATGGTA GATCACCTCG 

125651 AGGCTCGTAT GGATCTGGTT CCAGAGCTCT GTAGCTATAT CATTATCCAA 

125701 CCCACGTTTA CCGATGCTGA AGACCAAAAA TTACTCAAAG CTCTCAAAGA 

1257 51 ACGTCATCCC AACCGGTTTT TCTACGTTTT TACAGGGTGC CCACCCTCAA 

125801 CAAGCATCCT CGCTCCTAAT GTCATTGAAA TGCATATCAA ACTTTCTATC 

125851 ATCGATGGGA AATATTGTAT TTTAGGTGGT ACCAATTTTG AAGAGTTTAT 

125901 GTGCACTCCA GGGGATGAGG TTCCTGAGAA AGTGGATAAC CCACGTTTAT 

125951 TTGTCAGTGG AGTGCGTCGG CCCCTAGCAT TTCGTGATCA GGATATCATG 

126001 TTGCGTTCTA CAGCATTCGG TTTGCAGCTC AGAGAAGAAT ATCATAAGCA 

126051 ATTTGCTATG TGGGACTACT ATGCACATCA TATGTGGTTC ATTGATAATC 

126101 CTGAACAGTT TGCAGGCGCC TGTCCTCCAC TGACTTTAGA ACAAGCCGAG 

126151 GAGACAGTAT TTCCTGGATT TGACAAACAT GAAGATCTTG TTCTTGTCGA 

1262 01 CTCTTCCAAG ATCAGGATAG TTTTAGGTGG TCCCCACGAT AAGCAACCCA 

1262 51 ATCCTGTGAC TCAAGAATAT TTGAAACTTA TCCAGGGAGC TAGATCTTCT 

126301 GTGAAGCTTG CTCACATGTA TTTCATCCCT AAGGACGAGC TTTTAAATGC 

126351 TCTTGTCGAC GTTTCTCATA ATCACGGTGT TCATCTGAGT TTAATTACGA 

126401 ACGGCTGTCA TGAATTAAGT CCTGCAATTA CAGGACCCTA TGCTTGGGGA 

1264 51 AACCGTATTA ACTATTTCGC CTTGCTCTAT GGGAAACGGT ATCCTCTTTG 

126501 GAAAAAATGG TTTTGCGAAA AGCTAAAACC TTATGAGCGG GTTTCTATTT 

12 6551 ATGAGTTTGC TATTTGGGAA ACGCAGTTGC ACAAGAAGTG TATGATTATC 

12 6601 GATGATGAAA TTTTTGTGAT CGGAAGTTAT AATTTTGGAA AGAAAAGTGA 

126651 TGCCTTTGAT TACGAAAGTA TTGTAGTTAT CGAATCTCCA GAAGTCGCTG 

126701 CAAAAGCTAA CAAAGTCTTC AATAAAGATA TCGGATTGTC GATTCCTGTA 

126751 AGTCATGGCG ACATTTTCTC TTGGTATTTC CATTCCGTAC ACCACACTTT 

12 6801 GGGACATTTG CAGCTGACCT ATATGCCAGC CTAGCGTCCC TGGGTGCGAA 
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126851 TCTACCAACA GGATCTCTTC TGCAGGCTCT GCAGGGATCC TGCCTGGTTT 

126901 TTTTCTCTGC TATCGTTTAC ACTACGCTTT TATTGTTTGG GTAGAGGGTG 

126951 GACCTTGTTA TCGTTCTTCT ATAAGCATCA AAAAAAATTT ATCGGCATTG 

127001 TCATTGCTGT AGTTTGTGTT TCTTGGTATT GGAGTGGGTT GGGGAGGATT 

127051 CTCTAGAAAA GGTTCTGCAG AGTCCAGCTC ACGTGGGACT GTTTTTACTA 

127101 CCGCTTCAGG GAAGCGGTAT GTAGAGAAAG ATTTCATGGC TATGAAGAAG 

127151 TTCTTTGCTC ACGAAGCGTA TCCATTTACA GGGAACCCTA GAGCTTGGAA 

127201 TTTTATCAAT GAGGGGCTAC TTACTGATTA TTTTCTAACG ACAAGGGTGG 

1272 51 GAGAAAAACT CTTTTTAAAA GTGTACCATC CGGGAGAGAA AATTTTTAGT 
127301 AAGGAGAAAG CTTACCAGCC GTATCGTCGT TTTGACGCTC CTTTTATTTC 

1273 51 CTCTGAAGAA GTTTGGAAAT CTTCAGCTCC CCAGCTTTTA GAGATCCTGA 

1274 01 AGGTCTTTCA ACAAATCGAG AACCCCATAT CAAAAGAAGG ATTTCTTGCT 
127 451 AGAGCCAAGC TCTTTTTAGA AGAGAGAAGG TTCCCTCATT ATGTGCTTCG 
127501 ACAAATGTTG GAGTACCGCA GGCAAATGTT TGCTCTTCCC CCAGATGAAG 
127551 CCTTATCTCG CGGGAAAGAC TTGCGGTTAT TTGGCTACCA GACGATTCAA 
127601 GACTGGTTTG GGGATGCCTA CCTTTCTGCT GCTGTTGAGC TCTTGATCCG 
127651 CTTTATTGAC GAGCAGAAAA AAGTACTTCC CAGGCCCTCA AAACAAGAAG 
127701 CTCGTGACGA CTTTTATGAT AAGGCGAAGC ATGCCTATAC TAAGATCAGT 
127751 AAGAATAAGG AATTTTCCTT AGGATTTGAA GAATTTGTAA ACTCGTATTT 
127801 TCAGTTTTTA GAGATCTCTG AGTCCGAATT TTTCAATATG TATCGAGACA 
127851 TATTGTTGTG CAAAAGAGCT CTTCTCCTAT TGCAGGGAGG CGTTTCTTTT 
127901 GACTTCCAAC CTCTAACTAC ATTTTTCGTT CAAGGAAAAG ATTCCATACA 
127951 AGTAGAGTTC TTTAGACTCC CTAAGGAGTA TAGCTTTAAA ACAAAACAAG 
128001 AGTTAAAAGC TTTCGAAGTC TATTTAAAGT TAGTGAGTTT ACCTAAATCG 
128051 GATAGTTTGG ATGTTCCTAA TGAGATCCTT CCTATAGCGA CCATAAAAGC 
128101 TAAAGAGCCT CGGTTAGTAG GCAGACGGTT TTCTATAGAC TATAAGAGAG 
128151 TCGCTTTGCA AGACTTAGCA GCTACTGTAC CTATGGTTGA AGTGCTGCAC 
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1 ?R901 


TGGCAACAAA ATTCTGAGCA CTTCCAGGAG ATTCTCCAGC AGTTTCCTGA 


1 9R95 1 


CGTTGAGACG TGTCAGTCGT ATAAAGACTT 


CCAACATCTT 


AAGCCTGCGC 




TGCGAGATAA AATTTCTCTT TTCACACGCA 


AGGAAATCTT 


AAGGGCCCGC 


-LZ O J Jl 


CCTGAGAGAA TTCTGCAATC GCTACAGCAA 


GTTCCTAAGC 


AGAGCCAAGA 




AGTTCTCTTA TCTGCAGGGA AGAATAGTGC 


TCTACCAGGA 


ATATCCGACG 




GTCAGCAATT AGCCAAAGTG TTGCTTGAAA 


ACGAGGTTTT 


AGATTTATAT 




AGCCAGGATG CAGAGACCTA TTATACTATT ATTGTTAATA 


GTTCTTTTGA 


1 9 R^l 


AAAAGAAGAA GTGCTTCCTT ATCGTGAGGT 


TTTAAAGAGA 


GATTTGGCCT 




CACAGTTACT TACTTCTCAT GGTCATCTTG 


TTGACATGGA 


GCGTCTAGAA 


liDDDl 


TCTGCGTTGC GTACACGGTA TCCAGGAGAA 


GAAGGCGCTA 


GCCTATGGCA 


1 0 rt n 1 


ACGACGTCTT TGGAAGGTAG TGGAAAACCA 


CAGATTGGGA 


AGGCATCTCG 


± Z O / _> 1 


AGGGGTCTTT CTCTTGGAGC TTAGATCGCT CATTGAAGAC TTTTTCCCGA 


1 *) D R fi 1 


GGAGACAAGG AGCTGCCCCA AGAGTTTGAT 


AGGATTTTCT 


CTATGAAGGT 




AGGAGACTAT TCTTCTGTAT TCATGAGTCC 


TAACGAAGGG 


CCCTGTTATT 


1 0 RQ 01 


ATCAATGCCT CTCTCATTTA CTGTATGATC 


GTCCTGCTAG 


CGTGGATAAA 




CTATTTTTAG CTAAAAGTCA GCTAGATGAA 


GAACTTTTAG 


GATCCTATAT 




GGAACGCTTT ATAGAACAGG GAGTCGTAAG 


GTGATGTGGT 


ATTCTGATTA 




TCATGTTTGG ATTTTGCCCG TCCATGAGAG 


GGTGGTGCGC 


CTCGGGTTAA 




CAGAAAAAAT GCAGAAAAAT TTAGGAGCCA 


TTCTCCATGT 


GGATTTACCT 


1 0 Q 1 ^ 1 


TCAGTAGGGA GTCTATGTAA AGAAGGTGAG 


GTTTTAGTCA 


TTCTGGAATC 




TTCTAAATCT GCTATAGAGG TGTTAAGTCC 


TGTATCAGGA 


GAGGTTATCG 


1 0 Q *5 R 1 


ATATCAACCT TGATTTAGTG GATAATCCTC 


AGAAGATTAA 


CGAAGCTCCA 


129301 


GAAGGTGAGG GATGGTTGGC TGTAGTCCGA 


CTAGACCAGG 


ACTGGGATCC 


129351 


TTCTAATCTT TCTTTGATGG ATGAAGAGTA 


AATTTTTTAT 


TAGATATACT 


129401 


CATTTTTTTC AGAAGATAAG AGGTATTTTT 


TTAAGGCTAA 


AACATTTAAA 


129451 


ATTTATGTCT AAGGTTTAAA AAATACATCA 


GAATTATTCT 


ATGGATCCAG 


129501 


CTAGTCCGGT AGCCCCTCAT GTCCTACAAG 


ATCATGTGCA 


ACTATCTTCT 
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129551 GAAGAATTGT CCGCATTATC TTCCGGGGTA TCTCGTGTGA AGAAGCTTAC 

129601 TATAGCCATC ATGGTCCTTT CATTGATAGC GATTTCTTTG GTAGCCTGTG 

129651 GCCTATTTTT AACGGGATCG GCACCTCTAC AGCTCTCGAT CTGGATTGCT 

1297 01 GCGAGTTGCA TTACCTTATC TATGTTAGTT TGTGCGTGTT GGCGTTATAA 

129751 GATTTCCAAT GCCTTAGAAA AAACTAAGGT AGCGCATGAA AGCTGAGTTG 

129801 GACATTTTAT TGATTGAAAA ATCATGACTA CATTACCTAA GTACGTTCCC 

129851 CGTTCTCGAC AAAATCCCGA TACTCTGACC TTCCTAAAAC GGTATTCTAG 

129901 TGTCCTTCTC CATTCGGAGA ATTCTTTATC TTATCGGATT TTTGCGAAAG 

129951 TGCTTGCTAT TCTCCTCACT TCGTTAGCTG TAGCTTTCGC CGTGACTTTG 

130001 TTTTCTTGTG AAGGTTCTCA ACTGAGACTC TGCGCTCTCT ATATAGGTAT 

130051 AGCTCTTGCT ATTTGTGTTT TACTGACGAT CGTTGTTTAT TGTATCGCAA 

130101 GTAAAATCGC CACAGCTTGC AAAAAGCCGC CTTCCATATC TCGAATTGAA 

130151 ATTGTTTAGA AGCATCTCTG TGTACAAAAG TTCACTAGAA ACTCGACTCT 

13 0201 AGGAAAGTTC CTAGAGAGAG CGACGCTGCG TTCGTGCTTT AGAAATACTT 

130251 GGCTGGTGTT TGGACGAAGA TTCTTTTAGT GGATTGGTGG TGTTTTCAAC 

130301 AACTTCAGTC TCTAGAGGAG CTCTTTGAAT CTTTGCTGAA GGTTTAGAAA 

1303 51 TATCAATACC TGTTAAGCTC ATAAAAGCCA TAGCAATGAG GCCTGTTGTA 

130401 ATGAAGGAGA TCCCCATTCC CTGGAGGTTT TTGGGAATAT CAGAGTAGGC 

130451 GAGTTTTTCT TTGATAGTGG CTAAAATAAC AATAGCGAGC CACCACCCAC 

130501 ATCCCGCTCC TAAAGAGAAG ATCATCATAG GAATAAAAGG ATAACTACGT 

13 0551 GTGATTCCGA AGAGCAGACC CCCTAGGATC GCGCAGTTCA CAGCAATCAA 

130601 GGGAAGGAAG ATCCCTAAGG AGAGATATAG ATTCCTGGAG ACCTTTTCTA 

130651 AAAGAAGCTC TAAGATTTGC GTGAATGCCG CAATCACCAC GATGAAAATA 

130701 ATCAGCTCCA GAAAACCTAG GTTTACAGAA GCTAAAGATG GAGAGATCCA 

130751 AGTTAGAGCT TTAGGGCCCG TGATGAAAGC ATGGACAAAC CAGTTGATGC 

13 0801 TCCCTGTTAC AGTGAGAACA AGGGCTACGG ACATCCCCAA GCCATTGGCT 

130851 GTAGAAACCC TAGTAGAGCA AGCAAGGTAA CTACACATCC CCAAGAAATT 
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130901 CGCAAGAAGG ATATTCTGAA TAAAGGCTGC TTGTAGAAGA ATACCAAAGA 

130951 CATTAAGCCA AGTATACGCA CCTAACCACA TAAACTACCT TTTTCTCTTT 

131001 TTAGAGTCTC GAATGTTAAC AAGCCAAATC ATAATACCAA GTAGGAAAAA 

131051 AGCCGACGGT GCTAGCACCA TAAGACTTAA ATTTTGGTAT CCATCGGGGT 

131101 GGGTTTCGGA AGCATAAACA AATTGAGGGA TGATGCGAAA CCCCATAAGA 

131151 GTTCCAAAAC CAAAGAGTTC TCTGATGACT CCAATGACAA GTAAGACCCA 

131201 GCCGTATCCT AAGCCAGAGG CAAACCCATC TAAGAACGCT GGAATAGGAG 

131251 TCACATGCCT AGCTAGACTT TCAGACCTTC CCATCACGAT GCAATTGGTG 

131301 ATGATAAGAC CCACAAAAAC AGAAAGTGTT TTGGAAATAT CAAAGAAAAA 

1313 51 AGCTTTTAAA AACTGGTCGA TAACAATCAC AAACAAGCTA ATGATAATTA 
131401 GCTGAGTAAT CATTCTCACA CTGTCAGGAG TGAACTTACG TAATAAGGAA 

1314 51 ACAAAGAAAG ACGAGCATCC TGTAACAATG CTGACAGCAA TTCCCATAGT 
131501 AATTGCCGTT TGTACTGTTG TTGTCACTGC CAGAGCCGAG CAAATCCCCA 
131551 AAATCGCAAT GAGAATTTGG TTGTTGCTCC ATAGAGGATC AAAGAAATAG 
131601 CTTTTATAGG ACTTTTTACT TGTCATTCGC CTGTTTTCTT TTCATGGGTT 
131651 AAATTAGAAA AATTTATAAG GAGCTGACGA TAGCAAGCCA GAGATTGTAC 
131701 ATAAGCTTCA GTGACACCGT TGCATGTTAA GGTGGCTCCA GAAATCCCAT 
1317 51 CAATAGCAGA AAGAGCTTTT GGAGAATCTC CCAAAGTAGT ACGCACGGAA 
131801 CCTTTAACTA CCTCAAGCCC TAGGTCTGTT GTTGCAAAAT TTGTAGTTCC 
131851 AGAAGAATCT TGTAGGAAGA TTTTCTTCCC ATAGAATTGC TCTTGCCATT 
131901 CGGGATTTGT AATATTTGCT CCTAAACCTG GAGTTTCTCC TTGTTGGTAC 
131951 CATGCGGTTC CCAATACAGT GTCACCGTCG TTTTTCACTC CTAGATAGCC 
132001 ATGGATGGGG -CCCCAAAGGC CGAATCCTGA TATAGGGAAG ATCAAAGCTT 
132051 GAACTGTAGA AAGGTCTTTC GCAACGTCGG CTCCTGACAT ATTTTCTGTG 
132101 CGAGAGGTAT TCTCTAAAAT GACATAAAAG GGGAGGGGGG ATTGCTGACA 
132151 CGGAGGGGTT TCTTGATATT TCTCAA/CAAA TTCAATGGGA TTCAGATTTT 
132201 TTTCTTCAAA AGAAAATACC TTGCCTTGGG CATCTGTAAG TAGAGGACGG 
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132251 ACAAAGCGCT CGGCATACAG CTCTAATTCA GGATAGGAAA CCTCAGAGAC 

132301 TTTTTTTGTA GCAACTTCAA GAAGTTGTGT TTTTTTATCG AAAGTCGCAG 

132351 GCACCCACTC TTTTTTTTCC TGAATTTGAA ATCTTCCTTT AAAATCTAAA 

132401 ATATGAGCAG CTAAAAGCAT TTGCTTATTG CGATCGAAAG TAGCAGCTTG 

132451 TTCCTGTATT GGGGAGAGCA CATAGTAGAT TGTGGATAAC AGCACTCCTG 

132501 CAAATAAGCT GAGGCCCAGG ATAAAGGAAA CGATGTACCA GGTTTGGTTT 

132551 ATGCGGACGG TATGTTTTGA AGAGCCTTTA GACATATTCT AGACTCCCCT 

132601 TTTTCTATAC TTTCTAACAG CAAAATAGTC GATAAGAGGG GCAAATACAT 

132651 TGCCCAGAAG GATCGCTAAC ATCACTCCCT CAGGATACGC AGGATTGATA 

132701 AGACGAATCA CAATAGTCAT AAATCCTATA AAGAATCCGT AAATCCATTT 

1327 51 CCCTAATTTC ATAGTCGGCG ATGATACGGG ATCCGTAGCC ATAAAGACTA 

132801 AACCAAAAGC AAGTCCTCCG AGGAAAAGCT GCCGATAGGC GGGAATGAAG 

132851 AATCGAGCAG GTGCCCAAGC TCCGTTTTGT CCCACGATGA GTACGCTGAT 

132901 AAACTTAAAG AGCCAGCCTG TGAGAAAGGC TCCTATCCCA AAGGCTGCCA 

132951 TGGTTCTCCA AGAGGCAATG CCTGTAACAA TAAGGAATAT TGCACCCAAC 

133001 AGACAGGCGA AAGTGGAGGT CTCCCCCAGA GAACCTATAA TGTTTCCCCA 

133051 AAAGAGATTC CCAGCTGAGA ACTTCCCAAT CCCATAGATC ACATCGGTAA 

133101 TAGCATAGGC AGAATCGAAC TGTGTGGGAA GCAGCCCCAA TCCTCCCTCA 

133151 GCAACAGGAG CTGTAACAAA CGTTTGAAGT TGTGTAAGAG TGAGATTATC 

133201 TAAAACCCAA CCAGGATGCG TCTCTGTCCA AAGAGAAAAT TGTGAGTGAA 

1332 51 TGACATCTTG AGTAGGGACG TGAGGAATGT GAAGCATATT TGCAGCAATC 
133301 GCATCGACAT GCAGACGCTT TACAGAGGGA GGTGTCGAAT TTAGAGTTTG 

1333 51 TAGGCAGGTA GACTGTGAAA ATCCATCAAT GAGTACTTTT CCTGTCGAGG 
133401 AGTTCATCTT CATGAGGCTA TCTTTAATCA CTCCGGGGTT GCTTCCTACC 

1334 51 CAAACGTCAC CACTCATCTT TGCTGGAAAC GTAAAAAATA AGAATGCCCT 
133 501 TCCTGATAGA GCAGGATTGA GGATGTTCAT CCCTGTGCCT CCGAAGAGCT 
133551 CTTTACTGAC AACAATACCA AAGGCGATCC CTAAGGCTGC CATCCAGTAA 
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133601 


GGAATTGTCG 


GAGGGAGAGT AAGGGGATAG 


AGGATTCCGG 


TTACTAGCAG 


133651 


TCCTTCTGCG 


ATTTTATGCC 


CACGAACTAC 


AGCAAATAGG 


ACCTCACAAG 


133701 


TACCCCCGAC 


AACATAGCTA 


ATCGTAAGTA 


GAGGAATAAA GATCTTAAGT 


133751 


CCTTCCCAAA 


GGATAGGAAC 


TATATGGATC 


TCTTTGTAAA 


CAAAGGATAA 


133801 


ATAACTACCA 


AATCCAGAAA 


TATGTAAGAA 


TTGCTCCATC 


AGCACAGGAT 


133851 


TGCCTGAGGT 


ATAAACGATA GATTGAAGTC 


CTGAATTCCA GATCGCAACA 


133901 


AAGGTCGCGG 


GAAACAAAGC 


GATAACAACA 


AGCATCATCC 


AACGCTTAAC 


133951 


ATCTACAGAA 


TCGCGGATGA 


AAGGAGGCTT 


GGAAGGGGTT 


TCAATAGGTT 


134001 


CGTAACAAAA 


TGTATCTATC 


GCATCGACAA 


TGGGAGTAAA 


GCGCTGATAC 


134051 


TTGTCTTGTT 


GACATAGTTT 


CCAAAGAGAA 


TTTATGAATT 


TTTTGAGCAT 


134101 


TGTGATTGAG 


AGAAAAGTTG 


AAGCTTCTAC 


ACGTTCAAAA 


ACGTAGCAAA 


134151 


CTGCTTAAAA 


TTTTAGGAAT 


AAAAATTTTC 


ATGATTCAAA 


AATAGATGGT 


134201 


ACTTTTTTCG 


TTGCTGTTTC 


CAAAGTTATG 


TTATGGCTGT 


CAAGCTCCAG 


134251 


GAGCCTACTT 


TTGTTCCAAC 


TGCTTGGAAA 


AACTTCTCGT 


AGAAGATAGA 


134301 


GAAGGGCGTT 


GTCTACATTG 


TTTTCGTTAT 


CTTGGTTCTT 


CCGAAACACG 


134351 


TCTATGTAGC 


CAGTGTTCAC 


CCTCTTCACA 


ACTTCAAGCT 


TTCAGCTTGT 


134401 


ACCTTCCTTC 


GCAAACGGCC 


CTCTCGGTAT 


ATGCTCGTGC 


TTGTGAAGGT 


134451 


AAGCGACCCG 


CTCTGCAGTT 


TTTTTCTAAG 


AGTATCGCCT 


TTGAGCTAGC 


134501 


TTCACTGGAT 


GAGACTCCGA 


GTTGTATTGC 


CTATATAACA 


TCGACAATTT 


134551 


CTAGGAAAAT 


CGTAGTAGAA 


GTTGCTAAAC 


TAGAAAAGCT 


TTTACGCATT 


134601 


CCCTTGTGGC 


CGTGGCTTCC 


TAAGAAAAGA 


CAAATAGAAA 


AACTTCCTAA 


134651 


AGGGGAAGGT 


ATCTGCTTTT 


TGTCGGCCTA 


TCCTTTATCA 


CAAAAATGGA 


134701 


TGCAAACTAT 


CGTTGGAGGG AGTGCATCAC CTCTAGTATC TATAAGTCTC 


134751 


TTTCTCTCTC 


AGAATGATCA 


GTAATTCCTG 


CAATTGCAAG 


GTAACCAAGA 


134801 


ATACGTACGC 


CCTCATCAAC 


ACACAACGTG 


TTCGCTTGGA 


TGTCTCCTTT 


134851 


AATGATTGCG 


CCTCCACGGA 


GTTCGACTTT 


TCCAGATACT 


GTGATATTTC 


134901 


CTTCTACAAC 


CCCTTCAATA 


ATGGCTTCTT 


GTAGCTGAAT 


ATCTGCCTTT 
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134 951 ACCACTCCTT TAGGACCGAT AATAATTTTT CCTTTTGAGA CTAAAATGCC 
135001 TTCAAAAGTT CCGTCAATAC GTAGGAGACG TTCAAAAGCA AGTTCTCCTT 
135051 TAAAGGTGAC GCCTTCTCCT AAGGTAGTTT CAGGTTCTTC AAGAGGGAGT 
135101 AGAGATTCTG TTCTTGGAGT TGAGGACCAT TGAGGAAGAG AAGATTCTTC 
135151 AGTTAAATTG TGATTCAAAG GGCGAGCTTC CGAAGCTTTA GGGTTGTCAA 
135201 AAAGACTTGG AGGGGTCTCT GGGCGCTCGG ATCTTGAATA TGGCGAGTAG 
135251 CTGGAAGGTG AAGAAGTTTC TTCTTCGTAA AGTGTTTGCA CATCTTCAAA 
135301 AGGACCTTTT CCTGTTCTAC GGAACATGGG ACACCCCCTA AATTAACCAA 
135351 CAATATGATT TTTACAGAGA TTTACTTGAC GCTTAAGAGT TTCGTCATTT 
135401 TGTAATTTAT GTTCTATAGT TTTACAGGCA TAAAGTACTG TCGAATGAGT 
135451 TTTACCAAAA GCAGCTCCTA TTGCAACTAA AGAATCTGTA ATAAGAGTTT 
135501 TTGCTAAATA CATAGCAATT TGCCGAGCTA ACACAAGATC TTTAGAGCGT 
13 5551 GAGTTTCCCT TAAGATCATT CAGCTTTACT TGGAATACTG TAGCAACACT 
135601 TTTTAAGATC GTTTCTACAG AAATTTTTTG TTTTGTTGGA GAACGGAAGA 
135651 GCTCTTTTAG AGTTTCTCGG ACTGTAGTTT CTGTAAGAGA CTTGCCGAAA 
135701 AGACGACAAT AGGCAGTCAG CTTGTTGATA GCTCCTTCCA ATTGACGGAC 
135751 ATTGCCATAG ATGTGATCCG CAATATAAAA TGCCATTTCA TTAGGAATGA 
135801 GCAATCCTTT TTGCTCCGCC TTGTGCTGTA AAATCGCAAC CCGAGTTTCT 
135851 AAATCAGGGA TGCCGACGTG AGCAACCAGT CCCCATTCCA TTCTAGCAAT 
135901 GATACGCTCG GAAAGTTTGA GCTGACTTGG AGGTTTATCA CTGGTAATTA 
135951 CAATTTGCTT ACTCAGGTTG ATCAAAGTCT CAAAGGTATT GCAAAACTCT 
136001 TCTTCAAAAT TTTGGCGATT CTGTAAAAAT TGAATATCAT CAACAAGAAG 
136051 TAAATCTAGG GAACGATAAA AATTTTTCAT TTTATCAACA GACTTGGATT 
136101 TGAGATGGTA GACAAGATCG TTGATAAACG CTTCTGTAGT GATGCAATGG 
136151 ATGCGTAGAT TTTTATGATG TTCTCTTACG TAGTGACCTA CGGCATGAAG 
136201 TAAATGCGTT TTGCCTAATC CCACACCCCC ATGGATGAAT AAAGGGTTGT 
136251 AGGAGCGGCC AGGTTTCCCA GCAATACCTA CAGCTGCAGA CTTCACAAAT 
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1 \jr\ XXX on\jo 


GACCTTCAAT 


GAAATTATCA 


AAGCGATAGG 


AGAGATTCAG 


U Oj j1 


v.1 1 1 AA1 J. \~ A 


AAATCTTTAG 


TTTCTTCAAA 


GACCTCAGAA 


ATTCCTTCGT 




1 X uA 1 1 U J. X I 


TTGAGAAGCC ACGGGGGCTG AAGGTTTCTT 


GTGTTCTGCA 




AC 1 AL. AAA 1 1 


CTAAAGCAGG 


CTCTCCATGA 


ACATCTAAGG 


GGACAAAAGA 


1 JODUl 


ALAGAGG I C J. 


CTTTTGTAGT TATCAAGAAG 


ATAATTTTGT 


ACAAAAATGT 


1 jODDl 


TGGGGAuTTC 


TAAGCGAATT 


TTCTCTTGAG 


TTTCTTCAAG 


AACTTGAATA 


1 *3 C C A 1 

1 JbbUl 


GGAGAAATCC 


AATTTTCAAA 


AGCCGTTTTC 


GAGCAACGTG 


TCTTAACATA 


1 *5 ^ C ET 1 

136651 


■» mmm * % Tv x TV 

ATTTaaaaaC 


TGTTCCCAAG 


TAGTGCACTC 


GTTACAGGTT 


AACATGCCGC 


1367 01 


TCTCTTTATT 


TATAAAGCTT TCCCAAATAC 


AATCGACCCA 


TCCCATGAGT 


136751 


TV m /■< OO/^ TV 7\ 

GATGGCGAGA 


AAATCTCATT 


GCATCTTGAC 


TATTCATTGC 


ACTGAGTGCA 


136801 


ATGACTCTTC 


GCGTTCAGAT 


TGGATCCTGC 


ATTCCCTGGG 


TGAAGTTCCT 


1 3 6851 


GTTAGGAATG 


GATCCTATAC ACCCTTCCTC 


CGCAGGTAAA 


CGCGGTACGC 


13or Ul 


TCTGCCGATA 


AAATAACTCT 


ACCGATTACT 


AGGTTTTAAG 


GCAAATTGGA 


13 6951 


TCGTTGGTTT 


CGTTAGGCAA 


TAAGGAACCA 


CAAATTCAGG 


AAAAAATAAT 


13 / UU1 


m > rnr* Tv Jv h mmm 

TATGAAAi 1 i 


TGTAATAAAA 


ATGGAAAAAG 


AACTAAAGAA 


ATCCGGAGTT 


137U51 


CTTCAATCAC 


GAAATACGTT 


TTCTATAGGA 


GAAAAAATTA 


ACGAACTAAC 


13 / 1U1 


f*f* 7v r^f* TV mmrpm 

GCAGCATTI T 


TTTTGGTTGC 


TTTACTATAA 


CTCATCAATA 


GAGCTTCAGC 


13 /lbl 


Tv rpo tv rr^m ji Tv 

ATCATTAGCA 


ATTGCTGGTA 


TGGGACAGGA 


TGAAAGGTAG 


GTG GC AATGG 


13 /^01 


C AG TAG CT TC 


TTCAATTCGT 


CCCAAACAGA 


AGAGAGCTTT 


TGTTTTATTT 

J. W * A * A. J* AAA 


137251 


TV TV > /"» Tl/~»m TV O 

AAGAGTGTAG 


GCAGATGATC 


TCCTTGCATG 


CGGAGTGCC I 


GATCTAAAAC 


137301 


AGCAAGCGCC 


TGACTATTTT 


CACCAATTTG 


GAGATAAAGA 


CCTCCAAGAG 


1373 51 


TTTGATGATC 


ATAGATACTT 


AAAGGATCTA 


AGATCACTAG 


AGCTTCAAAA 


137401 


AAAAGAATCG 


CTTTTGAATA 


ATGCCCTTGG 


CGTAGAAAAG 


AATATCCTGA 


137451 


GATTCTGAGT 


TCTTCTAACT CATCATCTCC 


CCAGCCTAAG 


ATTGCTTTCC 


137501 


ATTCATTATC 


CAACATACAT 


TATCCTTGAA 


CAAATTGAAA 


GATACGAGAG 


137551 


ATCACATAGT 


CTATCATCAC 


ATTTGTAGAC 


TTGACTCCAA 


GATCTAGAGC 


137601 


TTTTAAGAGC 


ACTTTCCCTT 


GTTGTACTCT 


AGGATCGTCA 


TCGTCTTCTT 
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137651 CAGGATCTTC GTCTTCCTCT TCTTCTTTAT CTTTGTAAGA CAAAAAGGGA 

137701 GTAATTTGAC TGCGATAGGA AAACTTCCCT CGAGTGAGAA CTTTTAAAAA 

137751 TGAGGAGATT TTTTCTATGT CTTCATCTTG TTGGTCTGGA GATCCTAAAG 

137801 AAGGTGCCAG GTAGGGTGTG GAAAAACGCT GTTTGTAAAA ATTATTTGGA 

137851 GGGGAAAAAC ATGCCCAGTG CGACTTTTGA TTTGTCTGTA AAAGAGACGT 

137901 CAAAGCCGAA GGCTTGGGGT TCATATCCAA AATTTGCGCA TGCTTGGCAA 

137951 CATCACGAAT GGAGATGCCT TCCATCTGGA TTTCTTTGCG AAAGTCGCTG 

138001 ACTATCCTAT TATTGGAAGC ATGTTGCTCA TATATAGACG TGCTATAATT 

138051 AAAAATTTCT ACCATGGCGA GGCCTTAAAA AACCGTCTCT ATCTCTAGAT 

138101 GATAGTGCGG TCTAGGAAAA AGTCAATAGT CTTGATCAGA AGCCTGAACC 

138151 TTTTCTCCTT ATCGAAGGCA CTTAAGAAAA AAGCTCACCC CTATCAAAAA 

138201 ATTTAGAGTG GGGAACTAGC GAGAATTTAA GATAAGGGAA GGCTTTCTTC 

138251 TTTCTATGGA ATCCTGTATC TTTGTCCTTA TTTGAAGCCA AGAGCTTAGG 

138301 AAATTCTTAT GTCCGAACGT GCGCATATTC CCGTATTAGT TGAAGAATGT 

138351 TTAGCTTTAT TTGCTCAACG TCCTCCACAG ACTTTTCGAG ATGTCACCTT 

138401 AGGAGCTGGA GGACATGCGT ATGCTTTTCT TGAGGCGTAT CCCTCTCTAA 

1384 51 CTTGTTATGA TGGCTCCGAT CGAGATCTTC AGGCTTTGGC AATTGCAGAA 

138501 AAACGTTTGG AGACCTTTCA AGATAGAGTC TCCTTTTCCC ACGCCTCTTT 

138551 TGAAGATCTT GCGAACCAAC CCACTCCACG TCTTTATGAC GGAGTTCTTG 

138601 CAGATTTAGG AGTCTCTTCT ATGCAGCTGG ATACTCTATC CCGAGGGTTT 

138651 AGCTTTCAAG GGGAAAAAGA AGAGTTGGAT ATGCGTATGG ATCAAACGCA 

138701 AGAGCTTTCC GCTAGCGATG TCCTGAACTC CCTAAAAGAA GAAGAACTAG 

138751 GGAGAATTTT TCGTGAATAT GGAGAGGAAC CACAATGGAA ATCTGCAGCT 

138801 AAAGCTGTTG TCCATTTTCG TAAGCATAAA AAAATTCTTT CGATCCAGGA 

138851 TGTAAAAGAA GCTCTTCTTG GCGTTTTCCC TCACTATCGT TTTCATAGAA 

138901 AAATACATCC ACTCACCTTG ATTTTTCAAG CTCTACGTGT TTATGTGAAT 

138951 GGAGAGGATA GACAATTGAA AAGTTTACTA ACATCTGCTA TATCTTGGCT 
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139001 GGCTCCTCAG GGACGGCTTG TCATTATTTC TTTTTGTAGC TCTGAGGATC 

139051 GTCCTGTGAA GTGGTTTTTT AAAGAGGCGG AAGCTTCTGG CCTGGGGAAG 

139101 GTAATCACAA AGAAAGTGAT CCAACCTACC TACCAAGAAG TACGAAGAAA 

139151 TCCTAGATCG AGATCAGCAA AACTACGGTG TTTTGAAAAA GCTTCCCAAT 

1392 01 GAACAAAAGT CGTTTTTTAC GTTTATGCTG CTGTCTATGC TTTTGTGGAA 

139251 GTCTCTTTTA TTTCTATATT AATAAGCAGA ACTCGCTGAC GAAATTACGC 

13 9301 CTCGAAATTC CTTGTTTATC TGTACGCTTG CGTCAGCTTG AGCAGCAAAA 

139351 TATTTCTTTA CGTTTTTTAA TTGATAAAAT AGAAAGACCT GATCATTTGA 

1394 01 TGGAAATAGC AGCTCTTCCC GAATACCAAT ATTTGGAATA TCCCTCAGAA 

139451 GAAAGTATCA GTCTTTTATC CTATGAGCTA CCGTAAACGT TCGACTCTAA 

139501 TTGTTCTAGG AGTGTTTGCT CTTTATGCTC TTCTAGTATT GCGTTATTAT 

13 9551 AAAATTCAAA TTTGTGAAGG AGACCACTGG GCCGCAGAAG CTCTCGGGCA 

139601 ACACGAATTT TGTGTCCGTG ATCCTTTTCG AAGGGGCACC TTTTTTGCTA 

139651 ACACGACAGT ACGTAAGGGA GACAAAGACC TTCAGCAGCC TTTCGCTGTC 

139701 GATATTACAA AATTTCACCT TTGTGCAGAT CCTTTAGCTA TTCCCGAATG 

139751 TCATCGTGAT GAGATCATCC AAGGGATTCT CCAATTTATT GAGGGGCAGA 

139801 CCTACGACGA CCTCTCCCTA AAGTTAGATA AGAAATCTCG GTATTGTAAG 

13 9851 CTGTATCCTT TATTAGATGT TTCTGTCCAT GACCGGCTAT CCCTTTGGTG 
139901 GAAAGGATAT GCAACAAAGC ATCGCTTACC AACAAACGCC CTATTTTTTA 
139951 TTACGGACTA CCAACGCTCG TATCCTTTTG GGAAGCTCCT TGGACAAGTT 
140001 CTCCATACCT TAAGAGAAAT TAAGGATGAG AAAACAGGAA AAGCCTTTCC 
140051 CACAGGCGGG ATGGAGGCGT ACTTTAATCA TATTCTGGAA GGGGACGTTG 

14 0101 GAGAGAGAAA -GCTGTTGCGT TCTCCTTTGA ACCGTTTAGA TACGAATCGT 
14 0151 GTTATCAAAC TGCCTAAAGA TGGCTCTGAT ATCTACCTTA CGATCAATCC 
14 0201 TGTGATCCAG ACCATTGCAG AGGAAGAACT CGAACGGGGC GTGCTAGAAG 
1402 51 CTAAAGCGCA GGGGGGTAGG CTCATTCTAA TGAACTCCCA AACAGGAGAG 
140301 ATTCTTGCAC TGGCTCAATA TCCGTTTTTC GATCCCACAA ATTATAAGGA 
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140351 ATACTTCAAT AACAAAGAGC GCATCGAACA TACGAAGGTA TCTTTTGTGA 

140401 GCGATGTTTT TGAACCCGGG TCGATCATGA AACCTTTGAC TGTGGCGATT 

140451 GCTTTACAAG CTAACGAAGA GGCTAGCTTA AAATCGCAGA AAAAGATTTT 

140501 TGATCCTGAA GAACCTATCG ATGTGACCAG GACACTCTTC CCTGGACGAA 

140551 AAGGATCTCC GCTTAAGGAT ATTTCTAGAA ACTCTCAATT GAATATGTAC 

140601 ATGGCTATCC AGAAATCTTC GAATGTCTAT GTAGCTCAGC TGGCTGACCG 

14 0651 CATCATACAA TCTTTAGGAG TGGCCTGGTA CCAACAGAAG TTGCTAGCTC 

140701 TGGGATTTGG AAGAAAAACA GGGATCGAGC TTCCCAGTGA GGCCTCTGGT 

140751 TTGGTGCCTT CTCCCCATCG TTTCCATATT AATGGTTCCC TGGAATGGTC 

140801 CTTATCTACT CCATATTCTT- TGGCTATGGG ATATAATATT TTGGCAACAG 

140851 GGATACAAAT GGTTCAAGCC TACGCTATCC TTGCAAACGG AGGTTATGCC 

140901 GTCCGGCCCA CTTTAGTAAA AAAGATCGTC TCTGCTTCAG GAGAGGAATA 

140951 TCATCTTCCT ACTAAAGAGA AGACACGACT CTTTTCAGAA GAAATTACTA 

141001 GAGAAGTTGT TCGTGCCATG CGTTTTACAA CGTTACCCGG AGGTTCGGGA 

141051 TTTCGAGCCT CTCCTAAGCA TCACTCTAGT GCTGGGAAAA CAGGAACTAC 

141101 AGAAAAGATG ATTCATGGAA AATATGATAA ACGCCGTCAT ATTGCTTCTT 

141151 TTATAGGTTT TACTCCCGTA GAGAGCTCGG AGGGAAATTT CCCACCTTTA 

141201 GTGATGCTCG TCTCCATAGA TGATCCTGAA TATGGTTTGC GAGCCGACGG 

141251 CACGAAAAAT TATATGGGGG GGCGTTGTGC GGCACCCATT TTTTCTAGGG 

141301 TTGCTGACCG CACACTCCTC TATTTAGGGA TTCTTCCAGA CAAGAAGCTA 

141351 AGAAATTGCG ACGAAGAAGC TGCTGCATTA AAGCGTCTCT ATGAAGAATG 

141401 GAATCGTTCT CCGAAACAAG GGGGAACGAG GTGAGGATCT CTATTTCCAT 

141451 CTTGCTATAG ACTTTTACCG TTGAGCAAAG ACTCTCTATC AGAGAGCCCG 

141501 TCTCCTCTTT ATCCTCTATG AGTAGTTTAT GTTATGGCTA GGGTAGGTCC 

141551 TAAACTATAG AAATAACTTT AGCTTTCTTC CCCTAAATAA GAGACCAAAG 

141601 TCTTGATGAG ACGGTCTATT GAAGTTTATG GAAGGGGGAG GTAAGGCTGT 

141651 GTGTTTGGGG ATTTAGATTT GGGATAAAGG AGGCTTCTGT TCGTAGAAAC 
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1417 01 AGGAGAGCGA AATTTTATAT TTCAGAGAAG AGTAAGAACT TTATGGACAG 

141751 TTTTTTGTGA TTGCTTGTAT ACTATCTTGA TTGAATTTTT TGTCGACCTA 

141801 CGAGTAAAGA AATCCTTTAA GCATTTTTTA AAAATCAGAG TGAGAGCATG 

141851 CCCCTAGAGG GCTTTTTATG AAAAAAGTTG TTTTTCAATA GTCCCTGGAG 

141901 CGTAAATGGA TTTAAAAGAG TTACTCCATG GGGTTCAAGC TAAAATCTAT 

141951 GGGAAAGTTC GCCCTCTTGA AGTGCGCAAC TTGACACGTG ATTCCCGTTG 

142001 TGTGAGTGTT GGCGACATTT TTATAGCCCA TAAGGGACAG CGCTACGACG 

142051 GAAATGATTT TGCTGTCGAT GCTTTAGCTA ATGGAGCAAT TGCCATTGCT 

142101 TCTTCACTAT ACAATCCGTT TCTTTCCGTT GTTCAGATCA TCACTCCTAA 

142151 TCTCGAAGAA TTAGAGGCTG AGCTTTCTGC AAAGTATTAC GAATACCCTT 

142201 CAAGTAAGCT CCATACCATT GGGGTGACTG GAACCAATGG GAAAACTACA 

142251 GTTACATGTT TGATTAAAGC TTTATTGGAT AGCTATCAAA AACCTTCAGG 

142301 GCTTTTAGGA ACCATAGAGC ATATCTTAGG AGAGGGGGTG ATTAAAGATG 

1423 51 GGTTTACTAC ACCTACACCC GCTCTTTTAC AGAAGTATTT AGCCACTATG 

142401 GTACGTCAAA ATAGAGACGC TGTTGTTATG GAAGTCTCTT CTATAGGACT 

142451 TGCCTCTGGA AGAGTAGCCT ATACCAATTT TGATACAGCA GTTCTGACTA 

142501 ATATTACCTT AGATCATCTC GATTTTCATG GCACATTTGA AACCTATGTT 

142 551 GCGGCGAAAG CCAAGCTTTT CTCTCTCGTG CCCCCTTCGG GAATGGTTGT 

142 601 TATCAACACA GACTCTCCCT ACGCTTCTCA GTGTATTGAG AGTGCAAAGG 

142651 CACCGGTCAT CACTTATGGT ATAGAGAGTG CTGCTGACTA CCGAGCCACC 

142701 GATATCCAAC TTTCTTCCTC GGGAACAAAG TATACCTTGG TGTACGGGGA 

142751 CCAAAAAATT GCGTGCTCTT CCTCATTTAT TGGAAAGTAC AACGTCTATA 

142801 ACCTACTTGC TGCGATCTCT ACAGTACATG CAAGTTTGCG TTGCGATCTT 

142851 GAAGATTTGC TAGAAAAGAT AGGCTTGTGT CAACCTCCTC CAGGTCGTTT 

142901 GGATCCTGTA CTTATGGGTC CCTGCCCTGT ATATATTGAT TATGCACACA 

142951 CCCCCGATGC TTTAGACAAT GTCTTAACAG GATTGCATGA GTTACTTCCT 

143001 GAGGGGGGAA GACTGATTGT TGTTTTTGGT TGCGGTGGAG ATAGAGATCG 



106 



WOQ027994 fn ie:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pc] 



Page 234 of 330 



WO 00/27994 PCTAJS99/26923 

143051 CAGTAAACGG AAGTTGATGG CCCAGGTGGT AGAGCGTTAT GGTTTTGCTG 
143101 TTGTAACTTC AGATAACCCT AGGAGCGAGC CTCCTGAAGA TATTGTGAAT 
143151 GAAATTTGTG ATGGGTTTTA TTCAAAAAAC TATTTCATCG AAATCGACAG 
143201 AAAACAAGCA ATTACATATG CTCTGTCTAT TGCCTCAGAT AGAGATATAG 
143251 TGTTAATAGC GGGAAAAGGG CATGAAGCTT ACCAAATATT TAAACACCAA 
143301 ACAGTTGCGT TCGATGATAA GCAGACTGTT TGTGAGGTAC TCGCTTCCTA 
143351 TGTCTAAGCA ACTGTCGTTT TTTGCTTTAT GTGTGTTAGG AAGTCACCCG 
143401 ATTTTTGCTC AAACACCGAA TCCTCCTCAG CGTGTACGAC GCAGTGAGGT 
143451 TATATTTATA GATCCTGGAC ACGGGGGAAA AGATCAAGGC ACGGCAAGTA 
143501 AGGAACTTCA TTATGAAGAG AAGTCCCTGA CCCTGTCTCT TGCTTTGACG 
143 551 GTTCAAAGTT ACTTAAAGCG GATGGGTTAT AAACCTCAGC TAACCCGATC 
143601 TTCTGATGTA TACGTTGACT TAGGGAAACG CGTTGCTTTG TCGAACCGTG 
143 651 GGCAGGGGGA TGTCTTTATC AGCATCCACT GTAATCATTC TTCAAACGCA 
1437 01 GCAGCCTTTG GCACCGAAGT ATATTTTTAT AATGGTAAGG TCGGATCTCC 
143751 GACTAGGAAT CGCATGTCAG AAGTACTGGG AAAAAACATT TTAGCTGCTA 
143801 TGGAAAAAAA TGGCATTTTG AAGTCTCGAG GTTTGAAAAC TGCGAACTTT 
143851 GTTGTGATTA GAGATACTTC TATGCCTGCA GTTTTGGTGG AAACCGGGTT 
143901 TTTATCCAAT AGTCGTGAAC GTGCGGCCCT GCAAGATGCT CGCTATCGTA 
143951 TGCATGTAGC GAAAGGCATC GCCGAGGGAG TTCATAATTT TCTTTCTGGA 
144001 CCTAGTTTTC AGAAACCAAA ACAGAATATC GCTAAAATAC GTAAACCACA 
144051 GATACAAGCA AATTAGTACT TTAGGAGTTA AAGGCAAAAA ATCGTCCTCG 
144101 ATGCGATTCG AACGCATGGC CTGCTGCTTA GGAGGCAACC GCTCTATCCT 
144151 GCTGAGCTAC GAGGACGCAA AGACCAGCAC TTTACCAAGT TAGATTAAAG 
144201 AAGTCACGTG TTAGATGCAC GCTAGCAATT TAGGGGAAGT TTTTCTCAAG 
1442 51 ATGTGGGAAT GATTTTTCTA GGTTCTAGAA AT AT AG TT AT TCGCATTAAT 
144301 CGATATGGTT TATAGTGATT GCGCATTTTT TAAAAAATGT CTTGAATCCA 
144 3 51 AAGGATGAAT AGATATGATG AGCTTAGACT TCAAAAGTTT ATTATTTACG 
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144401 


GTTACTCAAC 


GAGCTACTTT 


AGGGCACTTT 


AATAGGAGGC 


ATTGTCTAAT 


144451 


ATGGCTACCA 


TGACAAAGAA 


GAAACTAATC 


AGCACGATCT 


CACAAGATCA 


144501 


CAAAATTCAT 


CCTAATCACG 


TACGTACCGT 


GATTCAGAAT 


TTTCTAGATA 


144551 


AAATGACCGA 


CGCCTTGGTT 


AAAGGTGACA 


GGCTTGAGTT 


TAGAGATTTT 


144601 


GGTGTGTTGC AAGTAGTAGA AAGAAAACCA AAGGTAGGAC 


GTAATCCTAA 


144651 


GAATGCAGCA 


GTCCCCATTC 


ATATTCCTGC 


TAGACGCGCT 


GTAAAGTTTA 


144701 


CTCCAGGGAA 


AAGAATGAAG 


CGCTTGATAG 


AAACTCCGAA 


TAAGCATTCT 


144751 


TAATTCTTGT 


AGTCTTCTTT 


GTCTCAGTTG 


TTAGAGTCAG 


ACCGGTTTTT 


144801 


TACCGGGCTT 


GACTCTAATT 


TTTGTTATTA 


TTATCGTTTG 


GTGCAATGCT 


144851 


TTTCTGATCA 


AATTGTGCGT 


GATAATGGGG 


CTGCAATCCA 


GGTTACAACA 


144901 


TTGTATAGAA GTGTCCCAGA ATTCGAACTT TGATTCACAA GTAAAACAGT 


144951 


TTATCTATGC GTGCCAAGAT AAGACATTAA GGCAGTCTGT ACTCAAGATT 


145001 


TTCCGCTACC 


ATCCTTTACT 


AAAAATTCAT 


GATATTGCTC 


GGGCCGTCTA 


145051 


TCTTTTGATG 


GCCTTAGAAG 


AAGGCGAGGA 


TTTAGGCTTA 


AGCTTTTTAA 


145101 


ATGTACAGCA 


GTACCCTTCA GGTGCTGTAG 


AACTGTTTTC 


TTGTGGGGGA 


145151 


TTTCCTTGGA 


AAGGATTACC 


TTATCCTGCA 


GAACATGCGG 


AATTTGGCCT 


145201 


ACTCCTGTTA 


CAGATCGCAG 


AGTTTTATGA 


AGAGAGTCAG 


GCATACGTCT 


145251 


CTAAAATGAG 


TCATTTTCAA CAGGCACTCT TTGATCACCA AGGGAGCGTC 


145301 


TTTCCCTCTC 


TCTGGAGCCA 


GGAGAACTCT 


CGACTCCTAA 


AAGAAAAGAC 


145351 


AACTCTTAGC 


CAATCGTTTC 


TCTTCCAATT 


AGGAATGCAA 


ATTCACCCAG 


145401 


AATACAGTCT 


TGAGGATCCT 


GCACTAGGGT 


TCTGGATGCA 


AAGAACGCGT 


145451 


TCTTCATCCG 


CTTTTGTAGC 


CGCTTCAGGA 


TGTCAAAGTA 


GCTTGGGAGC 


145501 


GTATTCCTCA GGGGATGTCG 


GTGTTATCGC 


TTATGGACCT 


TGCTCTGGAG 


145551 


ACATTAGTGA 


TTGTTATTAT 


TTTGGATGTT 


GTGGAATCGC 


TAAAGAGTTC 


145601 


GTGTGCCAAA 


AATCTCACCA 


AACTACAGAG 


ATTTCTTTTC 


TCACCTCTAC 


145651 


AGGAAAGCCT 


CATCCCAGAA 


ATACGGGATT 


TTCCTACCTT 


CGAGATTCCT 


145701 


ATGTACATCT 


GCCGATCCGC 


TGTAAGATCA 


CTATTTCCGA 


CAAGCAATAT 



WO0 027994 [file :/A\dcwas03\firmdata\lp\FolevPat\PatentDocuments\WO0027994 .cpc) . . . . . Page 236 cf 330 

WO 00/27994 PCT/US99/26923 

145751 CGCGTGCACG CTGCGTTGGC TGAGGCCACC TCTGCCATGA CGTTTTCTAT 
145801 TTTCTGTAAG GGGAAGAATT GTCAGGTTGT TGACGGCCCT CGCTTGCGCT 
145851 CCTGTTCCCT AGATTCTTAT AAAGGTCCCG GAAACGACAT TATGATTCTT 
145901 GGGGAAAATG ACGCAATCAA CATTGTTTCT GCAAGTCCCT ATATGGAAAT 
14 5951 TTTTGCTTTG CAAGGCAAAG AAAAATTTTG GAATGCAGAC TTTTTGATTA 
14 6001 ATATTCCTTA CAAAGAAGAG GGCGTCATGT TAATTTTTGA AAAAAAAGTG 
146051 ACCTCTGAGA AAGGAAGATT CTTTACGAAG ATGAATTAAT TTTGGGTCTG 
14 6101 TAATTGTGTT TAAGAATTGT TTGTATTAAA ATGATTCTTT TTATACGAGG 
14 6151 AGAGCACATT CTAATGGAAC TTCTTCCACA CGAAAAACAA GTAGTTGAAT 
146201 ATGAAAAGGC TATAGCCGAA TTTAAAGAAA AAAATAAGAA AAATTCTCTC 
146251 TTATCTTCTT CAGAGATTCA GAAATTGGAA AAGCGTTTAG ATAAATTAAA 
146301 AGAAAAGATC TATTCGGATT TGACTCCTTG GGAGCGTGTA CAAATATGTC 
146351 GCCACCCTTC GCGTCCCCGT ACTGTCAACT ATATTGAAGG GATGTGTGAG 
146401 GAGTTTGTCG AGCTTTGTGG AGATCGCACC TTCCGAGATG ATCCCGCAGT 
146451 TGTTGGTGGC TTTGTAAAAA TCCAGGGTCA GCGTTTTGTC CTTATTGGCC 
146501 AAGAAAAGGG ATGCGATACA GCGTCACGCC TTCATAGGAA CTTCGGTATG 
146551 TTATGTCCCG AGGGTTTCAG AAAAGCCCTT CGCTTAGGAA AACTCGCTGA 
146601 AAAGTTTGGC TTGCCTGTGG TCTTTCTTGT CGATACCCCA GGAGCATATC 
146651 CTGGATTGAC TGCTGAAGAG AGAGGACAAG GATGGGCAAT TGCCAAAAAT 
146701 CTTTTTGAGC TCTCAAGACT TGCCACTCCC GTGATTATTG TCGTTATCGG 
146751 TGAGGGATGT TCAGGTGGAG CTTTGGGCAT GGCTGTAGGT GATTCTGTAG 
146801 CTATGTTAGA GCATTCCTAT TATTCTGTAA TTTCCCCAGA AGGATGCGCC 
146851 TCCATTCTTT GGAAAGATCC TAAGAAAAAT AGCGAAGCAG CTTCCATGTT 
146901 GAAAATGCAT GGAGAAAACT TAAAACAATT TGGCATTATC GATACTGTTA 
146951 TCAAAGAGCC CATTGGGGGA GCTCACCACG ATCCTGCATT GGTATATAGC 
147001 AATGTTCGAG AGTTTATCAT CCAAGAGTGG TTACGATTAA AAGATCTAGC 
147051 TATAGAAGAG CTGTTGGAGA AACGGTACGA AAAATTTCGC TCTATAGGTC 

109 



WO0027994 [file:/A\dcwasQ3\firmdata\lp\FoleyPat\PatentDocurnents\WO0,027994. cpc] 



Page 23? of 330 



WO 00/27994 PCT/US99/26923 



147101 


TTTATGAAAC 


TACTTCTGAA 


AGCGGTCCTG 


AGGCATAAAA 


ATCATCTCGT 


147151 


TATATTAGGC 


TGTTCTCTAC 


TCGCAATTTT 


AGGACTTACC 


TTTTCATCTC 


147201 


AGATGGAGAT 


TTTTTCTTTA 


GGGATGATTG 


CTAAAACAGG 


CCCCGACGCC 


147251 


TTTTTACTTT 


TTGGACGTAA GGAATCTGGA AAACTTGTAA 


AGGTTTCAGA 


147301 


ACTAAGTCAG 


AAAGATATTT 


TAGAGAATTG 


GCAGGCAATT 


AGTAAGGATT 


147351 


CAGAGACACT 


TACAGTCTCT 


GATGCCACGA 


CATACATCGC 


CGAACATGGG 


147401 


AAAAGCACAG 


CCTCTCTGAC 


GAGCAAGCTC 


TCTAAGTTTG 


TCCGTAACTA 


147451 


CATCGATGTG 


AGCCGCTTTC 


GAGGACTGGC 


AATCTTCTTA 


ATCTGCGTTG 


147501 


CTATTTTTAA 


AGCAGTCACC 


TTATTTTTCC 


AACGTTTCCT 


TGGGCAAGTC 


147551 


GTTGCTATAC 


GGGTAAGCCG 


AGACTTACGT 


CAGGACTACT 


TTAAGGCCCT 


147601 


ACAACAACTC 


CCCATGACCT 


TCTTCCATGA 


TCATGATATC 


GGTAATTTAA 


147651 


GTAATCGTGT 


CATGACAGAT 


TCTGCAAGCA 


TTGCCTTAGC 


AGTAAACTCT 


147701 


TTAATGATTA 


ACTACATTCA 


AGCCCCAATT 


ACCTTCATAT 


TGACATTGGG 


147751 


AGTCTGTCTG 


TCGATTTCAT 


GGAAGTTTTC 


AATTCTTATT 


TGTGTTGCCT 


147801 


TTCCTATCTT 


TATCCTTCCC 


ATTGTCGTGA 


TCGCTAGAAA 


GATCAAAAAT 


147851 


TTAGCAAAAC 


GTATTCAAAA 


GAGTCAGGAT 


TCATTTTCCT 


CCGTTCTTTA 


147901 


TGATTTTCTT 


GCTGGGGTTA 


TGACAGTAAA 


AGTCTTTCGT 


ACAGAAAAAT 


147951 


TTGCCTTCAC 


AAAATATTGT 


GAGCATAACA 


ATAAGATTTC 


TGCTTTAGAG 


148001 


GAGAAAAGTG 


CTGCTTACGG 


TTTGCTTCCA 


CGACCCCTCC 


TGCATACCAT 


148051 


AGCTTCTTTA 


TTTTTTGCTT 


TTGTCGTCGT 


TATCGGAATT 


TATAAATTTG 


148101 


CTATTCGTCC 


CGAAGAACTT 


ATCGTATTTT 


GTGGTTTGCT 


CTACCTAATC 


148151 


TACGACCCTA 


TTAAGAAGTT 


CGGGGATGAA 


AATACCTCCA 


TCATGAGGGG 


148201 


ATGTGCTGCT -GCGGAGAGAT 


TTTATGAAGT 


CTTGAATCAC 




148251 


ATAGTCAAAA 


AGAAAGAGAA 


ATCGAGTTCC 


TTGGACTTTC 


TAATACAATC 


148301 


ACATTCGAGA 


ATGTTTCCTT 


CGGCTATCAG 


GAAGATAAGC 


ACATCCTCAA 


148351 


AAATCTAAGC 


TTTACCTTAC 


ATAAAGGCGA 


AGCTCTAGGC 


ATTGTAGGAC 


148401 


CTACAGGATC 


TGGAAAAACA 


ACACTTGTTA 


AATTACTTCC 


TAGGCTCTAC 
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1484 51 GAAGTCTCCC AAGGAAAGAT TCTTATCGAC TCTCTTCCTA TTACGGAATA 

14 8501 TAACAAAGGG TCCTTAAGGA ATCACATCGC CTGTGTATTA CAGAATCCTT 

14 8551 TCTTATTCTA TGATACTGTA TGGAATAACC TTACCTGTGG TAAGGATATG 

14 8601 GAGGAGGAGG CTGTTTTAGA AGCTCTAAAA CGTGCCTACG CTGATGAGTT 

148651 TATTTTAAAG CTCCCTAAAG GAGTCCATAG CGTGCTCGAA GAATCTGGGA 

148701 AGAATCTCTC AGGAGGACAG CAGCAACGTT TGGCAATAGC ACGTGCTCTG 

14 8751 TTGAAAAACG CCTCCATCTT AATTTTAGAT GAGGCAACGT CAGCTCTAGA 

148801 TGCCATTAGT GAAAATTACA TTAAGAATAT CATTGGAGAG CTTAAAGGAC 

14 8851 AGTGCACACA AATCATTATT GCGCACAAGC TGACCACTCT TGAACATGTA 

14 8901 GATCGCGTGC TCTACATAGA AAATGGTCAA AAAATTGCCG AAGGCACAAA 

148951 AGAAGAACTC TTACAGACGT GTCCTGAATT TTTAAAAATG TGGGAGCTCT 

149001 CAGGGACTAA AGAATATAAC AGGGTCTTTG TTCCTGATCA CAAATTAGTC 

149051 GCAAATCCTA OGGACATGGC AATAACAACT TAGGTGGGAT CGCTCTCTCC 

149101 ATGAGCTCAG GCAACAACTC TACAAGTGTC TGAGTTAGCT TTTGTGATAC 

149151 CTCCTCCAAT CTGCTGAAGG GACAGTCTCC TGGAACAGTA TAATCAGAAG 

149201 TGATCTTGAG AAAAGAACAG GGGATGTGAT GTTCTGCTGC TTGTGAGGCT 

1492 51 ATAGCATAGC CTTCCATATC TAGAAGTTTA AACGTCTTAT GAAACCCATA 

14 9301 ATGGTACAAT ACTGGAGAGG TAACCAGAGA GCTTTTAGGT AGAGAATCCG 

14 9351 GTAGAGCGTC AAAGATATAA GGGGGATCTT CAGAGAGAAC AGGAGGTGTA 

149401 TCCGTAGTGA GGTTTGCAAT TTTCTCAATA GTGTAACATT GACCTAAAGG 

1494 51 AATCTCGGGA GAACATGCCC CCACAAAACC TGGATTGATC CACAGATCGT 

14 9501 AATCTGTATA TGCTTGGCAA TAGCTTTGAA GAGCATTTAA AACGGCTGTA 

149551 CTTCCCCAAA CATGGACAAT ATAGAGATQT AGATGGTAGT CAGTACAACG 

149601 ATAACTATAG AGATGCTCGT TGATCTGTGT AAAATCAAGT TGTTCAATTA 

149651 GAGGAGAAAT TTCTCTATAG TCTGCAACAA TGCAAAGGAT TTTTTTAGGT 

14 9701 GTATTGACAG CATTCATTGG CCTTCCAGAG CATATGTAAA GCTTTTTTCC 

149751 CAGTTTTAGA TAGTTGAAAG GTTTCTTTGT TGATATAGGT TCCTATGAAT 
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14 9801 CTATGAATCA CGGTCACGTT TTTATTTTTA GAGTATTCTA CTGCTTTTGC 

149851 TCCCGCAGTT ATAGGATCTT TCAGGGAGCA AATTAAAGAC TTTCTTAATG 

149901 CTGCTGTTAG AGCATCCACT GTAGCCATAG GAACATATTT CGCAATGGCT 

149951 AAACATCCTA AAGGAAGGGG AAAGATGGTC TTACGGCGCC ATAGCTCTCC 

150001 AAAGTCTGCC CGCAATGTCA ATTGGAGATC GTAGCTGAAG CGCTCTTCAT 

150051 GAATCAGAGC GCCTCCATCG ACTTTCCCTT GCAGTATCGC GGATAGAATT 

150101 TTGTCATAAG GCATGGGAAT GAGTTTTGCC TTGGGATAGT AAAGTTTACA 

150151 GAGAGCATGA GCGGTTGTCA TCTCTCCAGG AGTTGCCAAG GTATCTAGAG 

150201 AACATTCAGG ATCTAAGGAG AGGACGATAG GACCGCTGTT GTATCCTAAG 

150251 GTATTTCCTA CGTCCATAAG ATTATAATAA TCAGAAACTA GAGGGAAGAG 

150301 CGCTGCTGAC ATTTTCATTA GGGAGAGCCG TCGCTGCAGA GCTAGGGTAT 

150351 TCAAAGTTTC AATATCCGCA ATTGTTACCT GGTTAAGAAG AGGCCTGAAT 

150401 TGGGGGTCTT TTAAGAAAGA ACGAAAAAGG AAAATATCAT TCGGGCAAGG 

150451 AGAAAAGGCA GCAGTCAGTA TCATGTCGGT TGATGTAATA GAGCTATAGC 

150501 GGCTTTGATG TCTTTATTTT CAGGCTTATC CAAAGCTCCT TGGTTTTCCA 

150551 GCCATTCGAA GTAAGACTTA GGAATATCCA CAAGAGGCTG CCCTTTGTAT 

150601 TTGCCAAAAG GCATTTTGAA GACTTTCGGG TGATAGCTCT GTTGCAGCAA 

150651 GTCGAGGACT TGCTGGGGCG GTAAATCACC GATTAAAGAA GTAAATACCT 

150701 TGTGCAATAT CACTACGTCA TCTAGAGCTC GGTGTGCTTG ATTTTCAGCA 

150751 AAACCGTAAA CTTGTCTTAG GTATTGTAAA TTATGTTTTG GTAGATCGGG 

150801 GCGATATTTT TGTGCCCATT TTAGAGAGTC TATTGTACGG TTTGTCAGAG 

150851 GCTCTAAGGA ATGTCTGCGA CATTCCTTAC CGAGTAGGGG GAAATCAAAA 

150901 CCGTCATTAT TATGAGCCAC TAAGATGCTG TCCTCTCCGC AAAATTTCCT 

150951 AAATCCCTCG TAGGCTTCAG GAAATTTGGG AGCAGAAAGT ACCGCATCCG 

151001 TAGTGATTCC ATGAATTTTG GATGCCTCAT CAGGAATGGG AATTTCCGGA 

151051 TTCACATAAG TAAGAAAGGA CTCATCTGTG ACACTATTGT AGGCAGCAAT 

151101 TTCTATAATG CGATCTCTTT CTATTTGTGT TCCTGTGGTC TCCGTATCAT 
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151151 AGAAAATAAG AACATCCATA GTTTGACTAC TCATTACGTT TTTCTTGTTG 
151201 CTCTTGAAGA GCCTGACGTC TTAGTTCATC CAAATTCATA TTCCCAGAAG 
151251 AGATCAACCC AATAGCATGA GAAAAACTAT CACAGACTAG CTTTATTGTA 
151301 TCGATATATA TCCGTAATAG TGTGTCATGA ATTTCTCCGT TTAGGCAGGG 
151351 CAACACAAGC CGATAAAATA TCAATCCCTG TTCTTCATCC ATGCCAAAGC 
151401 CGGGAATATC AATGTCCCTA TTTAAGAGAT GGAGTAAACG AGCTGTTGAT 
151451 GCCTTATGAG ATTCATGCAA TTGGTAGGGA AGGTAACAAA TCAACTGCAG 
151501 TATTTCTCCC TCACTGCGGA TTACAAAAAA TAAAGGGAGT TCATTGCCAT 
151551 TAGCTTGAAT GTTAATGTAA GTAAGACCGC TTTCTCTTTC TAAGAAAGGT 
151601 TCTTCATCCG AACTTTTAAG AAATTTTGTG AGATTATTTT GATTTAATGT 
151651 CCATGTCGTC ATTTAGGAAA TACTCCAAGT TGTTCCTAGA GCCTGCATCA 
151701 TTGCTGGCTG ATAATACTAG ATCTAATCTT GATTCGTCTG TTGTTTTTTT 
151751 TGTATCTTTT GTATTGCTTT ACTGAGGAAA TCAGAGGTTG CAGAGGAAGA 
151801 AGACTGCTTA TTATTTTCTT GAAGTAGATG ATTGATATCA GTGTCGGGAG 
151851 GAAACTCTGT CAGTAGCGTA ATTTTTTCTG GTGTTTCTGC AATCACAAGA 
151901 GTTTTATTCA CAACTCGAAT GAGGTAAATA GAAGTTTTCG GCGTTAGGGA 
151951 ACGTCGTTCT AGGATTTTGA TTTGAGACGA GCCTCCAAAA CCGTGACTTC 
152001 TTGATCTCAC AAACTTTTTA AACGCCCAAA CTCCAAAGCC AAAAATTGTT 
152051 AAAAGTAGAA TCAAAGATCC TAGCATTTTA AACATTTCTA ATTTCATGCT 
152101 TCCTGGGAAC ATTTCATGTA CAGAAATGGG CTCTTGGATC GTTTCTGCAA 
152151 GAGCAAGCTC ATCAGAAAGC TTAAAAACTA AAGAAAAAAG ATTAAAAAAC 
152201 ATGTGTAAGA CCCGCGATCA TCTCTATAAA ATTATAGTGG TAGCCCGATT 
1522 51 TGTATCCAAC TACATACAAG TAATAATGAA GTATAGTTTT AGTCGATGCT 
152301 ATATAAATTA TAGTACAATG ATTTCCAAGT ACAGATTAAA CCGTAATCAT 
152351 GTATATTCCT GCAAGTACCG TCTAGAGAGC TCCCCTAGAT GATTTGGGTA 
1524 01 ATTCACAGAC TCCTTTATAC CCTTCTAGGG TGTGCTCGTT CCACAGAGCC 
152451 CAAGCTCTTG TCTTTCATAG ACAAAACGAC AGCAGTCTGT CGGTGGATGC 
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152501 AAAGAATTTT TTCAAGTTGC CTGAGAAATT CCTTGGCAGC GCTTAATAAA 

152551 ACTATGGTGA TGCTATGGAA AAGTTACTAG TGACTGATAT TGACGGTACA 

152601 ATTACCCATC AATCTCATCA TTTAGATAAA AAGGTGTATG AGCGGCTCTA 

152651 TGCGCTGCAC CAAGCTGGTT GGAAGTTGTT TTTCTTGACG GGAAGGTATT 

152701 ATAAATATGC TGCACGCTTG TTTTCTGATT TTGATGCTCC ATATTTATTA 

152751 GGATGCCAAA ACGGCGCTTC TGTATGGTCT TCAACATCAT CAAATCTTCT 

152801 CTATTCTAAA AGTTTACCCT CAGATTTATT ATGTATTTTA CAAGATTGTA 

152851 TGGAGGGGGC AACGGCTCTT TTTTCCGTGG AATCAGGAGC TCCTTACGGG 

152901 GATCACTACT ATCGCTTTTC ACCGACTCCT ATAGCTCAAG ATTTACACGA 

152951 ATATGTAGAT CCTAGGTACT TTCCTAATGC TAAGGAAAGA GAGATCCTAT 

153001 TTGAAACGCG CTCTTTAAAA GACGACTATG CTTTTCCTAG TTTTGCTGCA 

153051 GCAAAAGTCT TTGGACTGCG AGATGAGGTC ATCAGAATTC AAAAGGAGCT 

153101 GGAACGCCAA GAAGCACTGA CTTCAGTCGC GACGATGACG TTAATGCGCT 

153151 GGCCCTTTGA CTTTCGCTAT GCCATCTTGT TTTTAACAGA TAAAAGCGTC 

153201 TCTAAAGGCA AAGCCTTAGA TCGTGTTGTC AATATACTTT ATGATGGAAA 

153251 GAAACCCTTT GTCATGGCTT CAGGAGATGA TGCTAATGAT CTCGATCTTA 

153301 TTGAGAGAGG AGATTTTAAA ATTGTGATGA GTTCCGCACC TGAAGAGATG 

153351 CACGTTCATG CGGACTTTCT AGCTCCCCCA GCAGATAAGA ATGGCATTCT 

153401 TTCAGCTTGG GAAGCTGGTG TCCGCTATTA TGACGACCTT ATGAGTCTTT 

1534 51 AGGGAACATC TCAGGACCAA TTCCCATCAC ATTGGCTCCG TGATCTACGT 

153501 ATAAGGTCTC ACCAGTAATT GCTGAAGCTA GAGGTGATGC TAAGAAAGCT 

153 551 GCAACGGCAC CCACCTGCTC GGCATTCATA GCCTCGGGAA TAGGCGCCCA 

153601 CTCTTGGTAA TAGTCTACCA TTCTTTCAAT AAAACCAATT GCTTTTCCAG 
153 651 CTCGGCTTGC TAAAGGTCCT GCAGAGATGG TATTGACACG TATGCCCCAA 
153701 CGGCGTCCCG CTTCCCAAGC AAGAGTTTTG GTGTCACTTT CCAAAGCTGC 
153751 TTTTGCCGAA CTCATGCCCC CTCCGTATCC AGGAACAGCG CGCATAGAAG 
153 801 CCAAATAGGT GAGCGATATT GTCGATCCAC CACGGTTCAT GATACTTCCA 
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153851 AAGTGAGAGA GAAGGCTAAC AAAAGAATAA CTAGAGGCAC TGAGAGCCGC 

153901 TAAGTAACCT TTTCTTGATG TTTCTAATAG AGACTTAGAA ATTTCAGGAC 

1539 51 TATTTGCCAG CGAGTGGACA AGAATGTCAA TATGACCAAA ATCTTTTTTT 

154001 ACCTGTTCTG CGACTTCTGA TATCGTGAAT CCCGTAATGC CCTTGTAACG 

154 051 TTTATTTTCA GCAATATCTT CAGGAACATC TTCAGGGCTA TCAAAACTTG 

154101 CGTCCATGGG ATAGATCTTA GCAATCTCTA AGAGAGTGCC ATTCGATAAT 

154151 TTTCTAGATT CATTGAATTT TCCTAATTCC CAAGACTGAG AGAAAATTTT 

154201 GTAAATCGGT ACCCATGTTC CTACAATAAT CGTAGCTCCT GCTTCTGCAA 

154251 GAAGTTTAGC AATACCCCAG CCATATCCTT GGTCATCACC AATGCCCGCA 

154301 ACAAATGCTA CCTTTCCTGT TAGATCAATC TTTAGCATGA ATCCGCCTTA 

154351 TACTTTTGAA GCTTATTGGA AGGAGAGTAA CAAATCTTTC GATTATTAAG 

154401 AAAACCTTTT GGTGCCTCAA CAGGGGAGAT CCTGCCTCCA ATGTAAATAG 

1544 51 AAACGTAAAT TCTTTAAATT TTTTTCTTTA CATATTTTAT AGAATATCCA 

154 501 AACTTCTCAC TCCCGCGTAC TGCTAAAAAA ATTTTCAAAA GAATTTACGA 

154551 TCCGAACTTA TCGTAGTTTG GGTTTCACTG ATTACTTAGG AGGTTGTTTG 

154 601 ACGAATCCTT TAGGGAAATT CCCCTCACCA CAGAATCCAC AGGTTGTTAC 

154 651 GATAGCGCCT TCTTCCACAA CACCACAAGC AGTCTCATCT GCAGTTCAAG 

154701 GTTTTCTTCA AACTGGAGGA GCTGCCTCCT CTACAGCGAC AACTACTACC 

1547 51 GCATCCGGAG CCTCTGCATT AGGACTTTCA CCTGATCAAG TGCAAGCGTT 

154801 GCTTACTAAT TTATTAAATG TGGGACAACC ATCAGTGGGA CAACCATCAA 

154851 CTTCAGCAGG AACTTCGGGA GCCTCCTCTT CCAGTGCAAG TATGCAGCAA 

154901 CAGCTTTTGC AACTTATCTT AGACAAGACA ACAGGAAGTG GCGGATCGTC 

154951 CGTGAGTTCA GAGCAATTAC AGCAACTCCT TAGCTTGGTG AGCCAGATGA 

155001 CTACGTCTCA AGGAGGAAGT GGTGGAACTC AGGCAGGACA GGCCGCTTCG 

155051 GTACTGTTGA ATTTGTTATC GGC AACAGG A TCTGCAGCAG CAAATCCTTT 

155101 AGGGACAGCT GCATCGTTGG CACAGATCAT TTATGCAGCA GTAACAAGTC 

155151 CTGGAGCAAA GAAAACTAGC GAATTTTGTT ATAATTATTG TGGAGAGACC 
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155201 TGCCAAGGCA ACTGCGGTTG 

155251 CGGTTGTGGA GGATTTGGCC 

155301 GCGGGATAGG AGAGGGATCC 

155351 GAGGCTTTAG AACAGAAATA 

155401 TGAGCTTGGC ATTGATACCA 

155451 GATTTCCTCC AATCGCGGAG 

155501 GACTTTTGTG AGATCTTGAA 

155551 GGCGAGTTGT GTGGATGGTT 

155601 TTGCCTCAGG GATTGCTAAG 

155651 GAAAGCAAAG TGATGGTTCT 

155701 TTGTAGCCCT TTTAACCTAG 

155751 AGGTCTTCTC CCTTAAAAAG 

155801 TCTGCATCCT GCATGAATCT 

155851 TGGAGGAAGC ACTCCGATTT 

155901 TTGCTTTGGT TTTATCTCAC 

155951 GATCCCCAAC GTGGCAACAT 

156001 GGCTCGGGGG ATGGGCATGC 

156051 AATCTTCCTC GTCACGCTAT 

156101 TCTCTTACGA AAACAAGTTT 

156151 AGATAAGTGC CTACTACAGA 

156201 TAAATTTACA ACGCCCCCCT 

1562 51 CCGCAACCTC AACCAGATCC 

1563 01 TGCTAGAGAG CGTTCCCCTG 
156351 TACCTTTAAG TGTGATTTCT 
156401 CGGGATTTAG AAGATGAAGA 
156451 CGACTCCAAT TACAGTTTTA 
156501 TTTCTTGAAG TAAGTGCCGA 
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TCCTACCTGT GGCTGTCCAG ACGGACAGTG 
GTTTTTTCTG TGGTGTATGG AAAAATTGTT 
CAAGAACCCG CAATCCCTTT ATAAGAACTC 
TGGCAAGGCT GTTCTTTTAA TTGCGTTAAG 
TGAGCTTATT ATCAGGACAT CGACTCGAGG 
GTCATGGCTG CATGTGACCG GTGTTCTATG 
GTCTCAAAGC ATGGATCTGT GGGCGGATGC 
TATTACAGGA TCCTTTTTGG AGTACAGCAA 
TCTTCTCTTC AGGAAACGGA ATTCGAGTGT 
TTCTTCATGG GGAGAGCAAG GAGCACAGGT 
AGAGGATATG TATGTCTTTC CCATCACTTA 
AACGGGTGCG AGAACATGGG AATCCAGTTG 
ATTAATGTCT ATTTTCTTTG TAGCTACCAA 
GGATCACCAA AGAAAATCTG ATGGCGTTAG 
TATCAATGTT ATTTTGTCCC AGCCACAGGA 
TTTAGGTAAT CCAGAAGTCA ATGCTATTTT 
GTGTCGATCT GGAAAGGAAG CGAGGGGGAG 
TTAGAATTAG CTGCACGATG TTTTGAGAAT 
GTTAAGCGAT GCTAACAATG TTCAAGAAAG 
TGTCAACTTC ATTGATGCAT ACGGCGGGAC 
GTACCCACAC CTTCTGGAGT CACGGCACAT 
TGTGGTTACG TCTCAACCTT CTTTATTAGG 
TGTCTTCTAG AGGGCGTTTT CCTGTAGTTT 
CCTAGGTCGC ACCCCGGAAG GGTAGAAAGG 
AGAGGAGGTT ATGTTTTGAA GCAGTGTAAA 
TGAATCTCTA ATTGTAAAGT TCTAGGGGTT 
GCACATTCTC TAGGATCTTC GGTTGATGAC 
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156551 GCACAAATTT TTAGGGGAAG 

156601 GTGAGCATGG AGGAGAGGGC 

156651 CGTAGTCGAC ATCTCCGACA 

156701 CGGATTTGAT GGGTTCTCCC 

156751 GCTCCACACC TCCGGTATAC 

156801 TTTTTTTTTA GGATGACCAA 

156851 TTCTTTGCTT GAACAATTTC 

156901 CCCATGAGAA GACACCCAGA 

156951 AAACCGTGTC ATGTGTGCCA 

157001 CGTAGATAAT GCTATAGTCA 

157051 GTTGAGGGGA TCAGAGATAG 

157101 GGATTCAAAT CTTTCTATGA 

157151 TAGACGCCAA GATTTCTTGC 

157201 AAAGAAGATA ATCTTGAGAC 

157251 AAAATATTGT AAAGCAGCCC 

157301 AAGAGATTTT GTATGACAAA 

1573 51 TCCTGTAGAG GCATTAAAAA 
157401 CTTGGAGAGA TAACCCGACT 

1574 51 CTACAGCAAA CGCGAGCTGA 
157501 GGAGAGATTT CCTACCATAG 
157551 TCATTAAGTT ATGGGAGGGA 
157 601 TTAGAGGGAG CTCGCATGGT 
157651 TGATGCCATT TCCTTAGCTC 
157701 ATGCTATTCT AGCCTTTGCT 
1577 51 AATGTCTTGC GTGTTCTTAG 
157 801 CTTAGAATCA ACTCGTACTT 
157851 CTCATAAGAG TCCCGAGGTT 
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ATTTGTGAAT GGGGAGATAA ATTCTAGGGA 
GGAAGATCTG GGGAGGCTGT TCTTTAGGTC 
ATAGGATGAC CCAGCAATCC CATTTGTAAG 
TGTGATGGGC CTGCGGCTCC AAAAATCACA 
GGGGGCCGTA TAAGATTTTA CGGTTCCAAA 
AAACGAAAGC TATGTATTGT TTATGGATTT 
ATGAGCTCAG TAGCCGCTTG TTTAGACTTT 
GGTGCCTTTG TCTAACCTAT GCACAGTAAA 
TTTGTTCAGT AGTAAGATGG GGAGGTTTTT 
TCCTCCCAGA GGATGCTAGG TTGTTGTTTT 
GGAAACACGG TCGCCAGGTT GTACCTTGTA 
ACCCGTTCAC TCGACATCGA TGTTGGCGAA 
TTGCTATGAT TAGGCAGTTG AGATCTAAGA 
TTGTGTGGCA AGCCAGGAAA AATTTTCCAT 
TTTTATCATT TGATAATTGC ATAAAATTTT 
GATAGCTTTT TCTGAAAAGG CAAAGAATTT 
AATGGTTTGA AAAAAATAAA CGATCTCTTC 
CCCTATAGTG TGTGGGTTTC CGAAGTTATG 
AGTTGTTATA GATTATTTTA ATCAGTGGAT 
AGTCTTTAGC TGCAGCAAAA GAAGAAGATG 
TTGGGTTATT ATTCTCGAGC GCGCCATCTT 
TATGGAGGAG TTTCATGGAA AGATCCCTGA 
AAATTCGTGG AGTTGGTCCT TATACGGTTC 
TTTAAGAGGC GTGCTGCTGC TGTGGATGGC 
CCGGATATTT TTGATAGAAA CTTCTATAGA 
GGGTTTCTAG GATTGCTCAA GCGCTTCTTC 
ATAGCTGAGG CTCTGATAGA GTTGGGAGCT 
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10 l y \J ± 


TGTATCTGTA 


AAAAAGTTCC TCAATGTCAT CGTTGTCCTG TCCGTCAAGC 


T c *7 QCi 
1 D / y D 1 


ATGTGGAGCT 


TGGAGGGAGA ACAAACAGTT CGTATTGCCG GTACGTCATG 


lOOUUl 


CCAGAAAAAA 


GGTCATCTTT TTGCATCGTT TGGTAGCGAT 


TGTATTGTAC 


1 jou j1 


GATGGCTCTT TGGTTGTCGA GAAGAGACGT CCTAAAGAAA TGATGGCAGG 


1 col ni 
J.DO 1 Ul 


CTTATATGAA 


TTTCCTTATA TTGAAGTTGA ACCAGAGGAA GGTCTTCAAG 


1 DO 1 D X 


ATATAGAAGG 


ATTTACTAAG AAGATGGAGC TTTCTTTAGA 


AAGCCCTTTG 




GAATTCTTAG 


GTAACCTTAA AGAACAGCGG CATGCGTTTA 


CTAATCATAA 


1 jCSZ J 1 


GGTTCATTTG 


TGTCCTATAA TTTTTAAAGC CACTTCTCTG 


CCTCAGTTCG 


1 c o ^ r* i 


GGGAATTGCA 


TCTTTTGAGT GATATAGATC ACTTAGCTTT 


TTCTTCAGGA 


i c o *i c i 

158351 


CACAAAAAGA 


TTAAAGATGC TTTGCTAATC TACCTCGGGG 


ATGTCAGGTC 


158401 


TAGAGAATCA 


ATAGGAGTAT AGATGCGAGA TCACGCTTTT 


TCTAAATTGA 


158451 


TAGGGACTGT 


CCGTGCCATG GTAGTTGAAG GACGTTGTCC 


TTGGTCACTT 


lbobUl 


CAGCAATCCC 


TAGTCTCTAT GGTAGAGCAT ATTCTTGGAG 


AGTGTCAGGA 


1 COCCI 


ATTTCACGAG 


GCCGTCTTAC AAGGTAAGAC GGTACAAGAG 


GTTGGTTCCG 


IjODUI 


AAGCCGGGGA 


TGTCTTAACT TTAGTTCTAA TTTTATGTTT 


TCTGTTAGAA 


iDOODl 


CGAGAGGGCG 


TACTTGCTTC CGAAGACGTT GCCAATGAGG 


CTATGGAAAA 


1 C.Q7/1 1 


ATTGCGTCGC 


CGTGCTCCTT ATATATTCGC TGAAGATTAC 


AAGCCGGTCT 


1 DO / D 1 


CGATTGAAGA 


GGCCGATCGC CTTTGGGAGC TTGCTAAGCA 


L» V- o A oAVj nnh 


i cponi 
IjooUI 


AATGAATCTA 


CATAGTTGAA GTTTTGGTCT ATTTTTAAGC 


ATATGGTGCT 


lDobD 1 


TTTGAAAAAA 


CAGAATATAT GCTATCAAAG AAGGGTAAGT 


TGGGGGCCTT 




TTAAGAGAAG 


GAACCTGGGA ATCGGGTCAG GACTGGAAGG 


TAGCAGCCCT 


1589 51 


AAGGAGAGTT 


TTCTTTTGCT AAAAGAATGT TCTCCAACTT 


ACTCTTTTTA 


159001 


CTTTATTCCC 


AAAAATAGCA ATGAGGTGAG GTTAAACAAC 


CCGTGCAGTG 


159051 


CAATGGGAGA 


AAGAATGTGC CGATCTTTTT CATATAGAAA 


CCCTGCAGAT 


159101 


AAGGAAAAAA 


CAAAGAGCAC GGGGACAAAG ACCCAACTTC 


CTAAAGAGTG 


159151 


TTCAATGTGA 


ATGAAAGAGA AAATAATAGA AGAGCATAGT 


ACCGCAGCTA 


159201 


TGCGCGTCAT 


TTTGTTTTTC AAGAATGTCT GTAGAATTCC 


TCTAAAAAAT 
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159251 ACCTCTTCTC CAAATGGAGT GAGGACGCCT AAATTTAGAA TCATGCTAAT 

159301 GTAGTGTCCT GTTATAGGCA GAGAGTTCTG AACTTCTTGA GTGACTTCTT 

1593 51 GTGTGTGAAT CTCTTGCGTA GGAAGAACCA AAGTTAAAAA TTTACTCATC 
159401 ATAATCCCAA TCAGTTGTGT TACTGGGATG ATGATGATCC ACATTCTGAT 

1594 51 GGCAGATCCT AGAGCACGCC ATGAAGTTTT AACCGGTCTT TCTCCAGAGA 
159501 AAAGTATAGC ACGTGTGATA TCCTTGGGGA GAAAAAGCAG GTAGAACAGA 
159551 AATGCAAAGG CAAGGCTAAT TCCTGTCATG GTGGAAAGTA ATTCCGCAGT 
159601 TTGTGAGCTC ACGCTAAGAG CTACAAGGGA AGAAAAAACA AGAAGAGCAC 
159651 CACCAAATAA AACTTGGCGG AGCTTTAAAG GTGTTTTCCC AGAGGGTGCT 
159701 GGCCAGATAA AGAAGTTTTT GGAAGCTAGA GCAGCGACGC CAAGGGACAA 
159751 GAGAAGAATA AACTTGGACA TTTCCTTAGA CTACGAGTAG TTAGCACAAA 
159801 CATAGCCCTC AACTCTGGCA ACAACTTCGC GGAAAAGACG GCTATGCATT 
159851 AAGCCCATGG GCGTTGAATC AAAATGTTTT GAGTTCCAGC CATAGCGATG 
159901 ATAATCATTG ACTAGGGTAG TTAAAGGCTG GCTGCATTCG ATAATCTCTT 
159951 GATAAATGAG AGCTATTTTA TGATGACGGA TATCAAAAAC GCGAACACGT 
160001 ACAGACGCTG TTACAGAATC GACACCTGCT TCTTTCCCTG TCTTTTGTTC 
160051 TAACAGTTCT GTAGCAACAA TGAATTCTGC AGGAAGAAAT TGCTCAATAA 
160101 TTGTTTCGGG TAGACGATTC GCAATCGGAG CATAGAACTG AGAGACTGTC 
160151 TGAGGTGAAG CATTGTGCTT GATCAGGAAG ACCTTTTCCG AAGCATAAAA 
160201 CCTTTTGCTG ATCTCTTCAG TAAATTCTCC TTGGAGGTTC GAAGGTAAAG 
160251 GTTCAAGACT CTTTCCTGGG CGATGAAATA CAGGAAGCAT CGCAATCACA 
160301 CCTTTAGTTT TGCTCCCTGA AGTGTATAGC TTAGGATGAT AACTTCCTGA 
1603 51 AGAGCCTAAG TGAGTGCAGC TGGATAGGGT TGGGGATAGA AGTCCTAAAG 
160401 ATGCCAATAA TACCAACATT TTTCGCATAG TCACTGTCCT TAAATTGCTT 
160451 ATTTTGCAAA AGATTCTAGC CCTGGGAAAG TTTTTACTTT TAAGATCAAT 
160501 ACTTTCGCAA TTGAGAGATT TTCCATTTAA AACTCTCATT AGCTTATATC 
160551 AAAGAAAAAA ATAAAAACAA GCAAAGAGAC CGTCTCAGTT TTAGTTTAGA 
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160601 


AACTCAAGGT TGAGAAAGGG ATTCTGACCA AAGTTGTGAG GGAACTTTGG 


160651 


TAACTTTTTC TTTAGGAATC AATGTGCACC 


CTGGGAGGAA 


TACAACTGGA 


160701 


GCTGAGATCA CAGAAAAAAT AAACCACTTC 


ATGACTACCT 


CTGCAAAAAG 


160751 


AATACTACTA TTTTCTATTT GCATAGGCAG 


TTCTTCGATT 


TATAGCAATT 


160801 


TTTACTTTAT CATCATAAAA ACTATGATGA 


AAAGGTTCTT 


AGTGAACTTC 


160851 


TAAGGAACAC CATGAGTTTG TGATTAAAAC 


TCATGGTGCG 


TAGTTGAACC 


160901 


CTTATTGAGG AGGGACGCAA CCACTAAGAG 


CTAACAATAC 


TAAACAAGAT 


160951 


AATAGTAAAG AGAATGCTTT TTTCATTATT 


CATCCTTAGG 


TTAATTGACC 


161001 


TTTCCAGCCT AGCTCTAGGG CGATTATTTA 


TCAAATTTTT 


CTTTGTAATT 


161051 


AATGATCATG CGACCATTAA TTTAGCGATA 


AATTATGATT 


TCGTCAGGAA 


161101 


AATTCAATTC TTTATAATAA TGATATGAAA 


TTAGAGAATG 


TCTATAGGGG 


161151 


CGGACTCTAT TTGTGATCCA GGATCTCTTT 


AGGAGCACTT 


TGTGGATTTT 


161201 


GATTATTTTG GTCTGAGTGA TATTGGTAGG 


GTGCGCGCTA GAAATGAAGA 


161251 


TTTTTGGCAG GTAAACCTCA TGTCTCAAGT GGTTGCTATT GCTGACGGTG 


161301 


TTGGGGGGCG TCTTGGTGGA GACATTGCTT 


CTCAAGAGGC 


AGTGACTAGC 


161351 


CTTATGGAGC TGATTGATGA GCAACAGTCA 


AAATTGATGG 


GGTATGGGGA 


161401 


TGACCAGTAT AAGGAGACTT TAAAAAAGAT 


CCTTTTAGAG 


GTCAATGGTG 


161451 


TGGTCTATGA ACACGGCCAA ATGGAAGAGC 


ATCTCCAGGG 


TATGGGAACC 


161501 


ACTCTTAGCT TCATCCAATT CCGGAAGGAT 


AGGGCATGGC 


TATTTCATGT 


161551 


GGGAGATAGT CGAATTTATC GTATTCGTGA 


GGGAGAACTG 


CGCCGCCTTA 


161601 


CCGAAGACCA TTCTTTAGAA AATCAATTAA 


AAAATCGTTA 


TGGGCTTCCT 


161651 


AAACAATCAG ATAAGGTGTA TTCTTATCGC 


CATATTCTGA 


CTAATGTTTT 


161701 


GGGAAGTCGT CCCTATGTCA TGCCTGACAT 


TCGGAATCTT 


CCTTGTGAAA 


161751 


AGGAAGATTT GTACTGCCTC TGTTCGGATG 


GATTGACAAA 


CATGGTTCCA 


161801 


GATATCGATA TTCGTGATAT CTTGAACCAG 


CCCGCCACCC 


TAGAAGAACG 


161851 


GGGGAATGCA TTAATTTCTC TAGCCAATAC 


TCGTGGAGGC 


GATGACAACG 


161901 


CTACTGTCGT ATTAGTCCGA ATACAATAGT 


TCCTTTGCTA 


AGGATAGTAT 



120 



W O0027994 r file:/A\dcwas03\firmdata \lp\Fole yPat\PatentDocuments\WO0Q279 94xpc] 



Page 246 of 330 



WO 00/27994 PCTAJS99/26923 

161951 TCCATGATCT ATTTGGATAA CAATGCGATG ACACCCCCAG AGAGGGGACT 

162001 TTTGGAATTT CTCCAAAAAA CCTTCCTTAT AGAAGGGACG TACGCGAATC 

162051 CTTCGAGCGT CCATCAATTA GGTAAAAAAT CTCGTCAACT GGTTCTAGAA 

162101 GCTTCACACT GGATGCAAAA GGTCCTTTCG TTTCAGGGCC GTGTCCTCTA 

162151 TACCTCAGGG GCTACTGAGA GTTTAAATTT AGCAATAGCA AGCCTCCCTA 

162201 AAGACAGTCA TGTTATCACC TCAGGTAGCG AACACCCCGC CATCTTAGAG 

162251 CCTTTAAAAC ATTCCTCGCT TTCCGTTTCT TATTTAAATC CCGAAGAAGG 

162301 GAGATGTGTT CTTACTATAG AGCAGATTGA AAGAGCTGTG ACTCCTAAAA 

1623 51 CTTCAGCAAT CATCTTAGGT TGGGTCAATA GTGAGACTGG TGCCAAAGCT 

1624 01 GATATAGCTG CTATAGCCCA CTTCGCGCAA GAACGACAAT TGCAATTTAT 
1624 51 TGTGGATGCG ACTGCAAATG TAGGTAAGGA GAGGATAGTT CTTCCCTCTG 
162501 GTGTCACTAT GGCAGCATTC AGTGGACATA AATTTCATGC ACTCTCTGGA 
162551 ATCGGAGCTC TTCTGGTCTC TCCAGGAGTC AAACTACATC CTCAGCTGTG 
162601 GGGAGGAGGT CAGCAAGGAG GGCTGCGCGC AGGCACAGAA AATCTTTGGG 

162 651 GAATCGCCTC TCTGCTTTAT ATTTTCAAAT ACCTAGATCT TCATCAAGAG 
162701 CGTATCTCTC AGGAAATTCT TACCCATAGA AATGGTTTTG AAAAGGCAAT 
162751 CAAAGCACGC ATTCCTGATG TCCATATTCA TTGTGCGGAT CAACCACGGG 
162801 CAAACAACGT CTCAGCAATT GCTTTCCCTC CGTTGGAAGG TGAGGTATTG 
162851 CAAATCGCCT TAGATATAGA AGGAGTGGCT TGTGGTTATG GATCCGCATG 
162901 CTCTTCAGGT GCTACCGCAC CCTTTAAATC TCTTGTCAGC ATGGGTGTTG 
162951 ATGAAGAGTT GACCCTGGCA ACACTCAGGT TTTCTTTTAG CCATCTTCTC 
163001 TTGCAAGAAG ATGTTGAAAG AGCCGTTGGA ATTATAGAAA AAGTCGTAGA 

163 051 ACGTTTGAAA AATTCCTAAG TCTTAAAAGA GAACATGTTT CTAAGCTGAA 
163101 AGAACACTCC TGACTCTTAT TGCAGAATCT ATGAGAGTAA GTTTTTAATC 
163151 GATACGGTTT TTATCCCAGA TAAAGACATC TCTTTAACTT CTAAAAGCAA 
163201 GTTGTTGATA ATTACAGAAG TTCCTACTTC TGCAGGACTG TCTAGCAGTT 
163251 GCAATACCAA TTGGGCTAGG GTTTCTACAG GATATTGCGG AAATTGAATA 
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163301 TCGAGTTCTT TTTGCAGATC TTTTATGCGA GAGTTGCCAG GAAACGTTCT 

163351 TTCAATAACA GAGATGGTCT TGGGTTTTAA ATGAGCAATG TTTGTAGTGT 

163401 TGAATAAGAT TTTGAAAATT GCATTTAAAC TAAGAATACC TATAGGTTCA 

1634 51 CCAGAAGCAT TGAGGACAAC AGCAACACTC GAACGGTTGT CTCGAAACTC 

163501 TTTGAGGATA CGAATAAGTT TTGATTTTGC AGTGATAAAC CAAGGCGAGT 

163551 GTAGATTATT GATTAGGGGT TCATCAAGAG CTTTATTGAC AAAGTCTTTA 

163 601 GGATGGGCAA TCCCAATAAC GTTTTTTCGG GCCTTGTGAT AGACAGGAAT 

163651 AAAGTTGATA TCTGTATTTT TTATAGTCCG GCAAAAATCT TTAACATTTG 

163701 CAGAAGAAGG AAGCATGGTA ACCTGTTCTA AAGGTTGGCA TACCTGATCT 

163751 GCACAAGTCG CACTTAAAGA GAAAATATTT GTAGCAATTG TATTGAAATC 

163801 TTGTTCTTCA TGGTGAGTCT CTAAAGCTTT TTGGAACTCG TCTCTACTTA 

163851 ATGTAGAGTT CAATTTTTCT TTCCTAATAT TTAGAAGATA GTAAAGACCC 

163901 TCAGTGAGAC TTCCTATGAG CTGAATCAGA GGATAGAAAA TATAGTGGGA 

163951 ATAATAGAGA ATCGGTGCTC CCCAAAGTGC TAATTTTTCA GGAATCTTCC 

164001 GTGATATTGT TAGAGGTAGA AGTTCTGCAA AAATCACAAC TATAAAAATT 

164051 TGAGTGAAAG GAGCGTAATC TGGAGTGATT CCTAAAGCTC GATAGCAATT 

164101 TCTTGAGGAC TCAGACCCGA CTTGTAGAGC GATATTCACT CCTAACATCA 

164151 CCGTTCCAAA TAAACGATAG GGGCGGCGAA TCAGGAAATT AATGTAGCGA 

164201 GCTTTCTTAT GATCTTTAGT CAGATAGTAT TGCAATCGTA CACGGTTAAA 

164251 TGACACGCAG GCCATTTCCA TCATCGAATA GAATCCTTGT AAGACAATAC 

164301 AGATAATGTT GACTCCTATC CAAAAGAGAG CAGAATTAGT CATACAATTT 

1643 51 CCTTATATAC ACACGGCGAA TGCGATTCGG AGCAGCGTCT AATACCTGGA 
164401 AAAGCAAGTT -ATTCCAAGAG AGTTTCATTC CTGTTGTCGG AATCGTTCCG 

1644 51 ATTTGCTCTA TTAACCAGCC TCCTATAGTC GCAATATTAT TGTTCGTCGG 
164501 TAGGTTGATA TCGAAGATCT CACTAAACTC ACGGAGTTCT AAAGTTCCTG 
164 551 AGGCAATAAT AACATCAGCT CCTGAGGTGG TATAGAGTAT TTTATTATCT 
164601 CTCTGGTCTA CAATTTCTCC AGCAACAATT TCAAAGAGGT CTTCTTGAGT 
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164 651 GATCAATCCT TCAATAGATC CGTATTCATC AATGATCATC CCTAGGGTTT 

164701 CGTCTTCAGC TGCCATCTGA CATAAAGCCA TTTTTGCAGA GATGGTTTCT 

164751 GGCATATAAT ACGGTTTTTT CAGCAAGGGG AGGAGATCAT CCGAAGATTG 

164801 CAGTGGCTTG TCATGTAAAA GAAGAGAGCG CGCTGTGCAA ATGCCCAGAA 

164 851 GGTTTTGGAG GTTATCGTTA CATATAGGAA CTCGTGAGCA ATGCTGTTTA 

164901 GAAAATAAAA GATAGAGGTT CTCTAAAGGG GTTTGGATAT CATAAAATAA 

164951 AATATCCTGG CGTGGCTGCA TACGCTCTTT AACACTACAA TCACTAAGAG 

165001 AAAGATAACC ATAGAGTAAA CGGCTTTCTT CTTGATTGAC TACGCCGAAA 

165051 TCCTTACAAC TTTGCAATAC TTCCTTCAGC TCTTGGGGTT GGATGATATC 

165101 AATCTGTTGC TTCGATAAAA TCCATTGGAC CACATAATTA ATTCCTACGA 

165151 TACCCCAGTG GAGTAGGGGT TTGAAGATTT TAGTAACACA AAGAATAAGA 

165201 GGGGCTACGG AACTAGCAAT CTGTGTATTA AAAGGAAGAG CTACTGCTTT 

165251 AGGGAGAATC TCACCTAAGA TCAAAGTAAT TGCTAAAGGA AGACCTACAG 

165301 TAAACCACCA CGAAGCTGCA TCTCCAAATA GAATGGCAAA ACAGTTTTGA 

1653 51 ATAGCAATAT TCAGTCCGAT ATCACAAAAA ATTAAGGTGA TGAGCAGGTG 

165401 GTGGGGATGT AGAAGAAGGG TAGCTACTCG CTGCTGTTTC TTAGATTTAG 

165451 AGCGCTTATA GTGCGAGATC AAACTCGTAG GCAAAGAAAA CAAAGCAATT 

165501 TGAGATAACG AAATGAATCC CGAGCATAAA GTAAAACAGA TAATGAAGAA 

165551 CATTAACATG GTAGGAATCA TGGTCTCTTT TCAGTCCTTA TTTTCTGATT 

165601 GTTGCTTTGG GGAGACACAG AGTTTCTTAT AGCCTTTAGG AATGAGACCC 

165651 AGACGCTGGA TTAAAGCAGC TTCTATAGCA GCGGAGTCTT GCCAGTGTTG 

165701 CAGATGTAAT TGGAGTTGAC GCTGCTTTTC TTGAGCAGAA AGAATGTCTT 

165751 GGCATAAAGA AGAGACCTTG CTTTGTAAQC GTAGCTCTTC TGTACGTAAC 

165801 TCCTGGATAG CACGATCATA AACAAAGCCT CCAATTAAGA TGCTAAAGAT 

165851 CACCCACCAG GATTTGATCA TCACTTCTTC TAGTAATCTA AAACCCCAGT 

165901 TTTTTTTTCT TACTGAAACT TTAGACACAA GGTACGGTGA TGCCTTGTTG 

165951 CTTTTGATAC TTTCCTTTTC TGTCTGCATA AGAAACCTCG CAGGTCGTAC 
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166001 TAGACTCAAA GAATAAGACC 

166051 GCTGGCAATG GCGTAGTGTT 

166101 TTCAGGCTCA AAAGGTGTGA 

166151 TAGACTTTCC TATACACATT 

166201 TCAACGCTAC GAGCTAGAGC 

166251 ATCAGTAATA GAGATGAAGA 

166301 CAGAGTTATA GACATTGGTG 

166351 TCGTAACCAT AACTCGATAG 

166401 CTCATTTACG TTCACTTGGC 

166451 TTAGGGCCAT CTCTCGTATC 

166501 CCTTAACAGT TTGAAATTGC 

166551 CTGGTTTTTG GGGGGAACTT 

166601 CTTGAAAAGA CCAGTTGTAT 

166651 ATTTTATCAC GAAACCTAGA 

166701 GCAAACCTTA TTTTTAGGGA 

166751 ATAGTAGAAG AATCTTGGTT 

166801 TAAGAACTTC TTGGATTCAT 

166851 TAGAGAATTT TTACTACCAA 

166901 TTCGCCATCT TTAATTTATC 

166951 ACAGGAAGTT TAAATCATAA 

167001 GTACTTTGGC CCTGCTCTCC 

167051 TCTATATAGT ATCTCGTGTA 

167101 GATGAGAGTT TCAAAAAAAT 

167151 GAAGGTTCGT ACATGACGCA 

1672 01 ATTTGATGTT TCGTTAAGAC 

1672 51 ATCATTTAAA AGAACGCCTA 

167301 GGAGAAGTTC CAGGACATTG 



PCT/US99/26923 

TGGGCAATCC CTTCATTAGC GTAAATTTTC 
AGAAATTTCT ATAGTCACAT GCCCTTCCCA 
CATTTACGAT AATTCCACAG CGTGCATATG 
GTTAAGACAT TTCTAGGAAT TCGGAAATAC 
AAAAGAATTT GGAGGAACAA TACAGACGTC 
TATCCTCAGT AAAGCATTTT GGATCAACAA 
AACACTTTGA ATTCTCGAGA TAG GCGGAGG 
GCCGTAACTT ATAAGTTTTT CGCCTGTCTC 
CATTAACAAA GGGATGGATC ATATCGGCAT 
CACTTATCTT CTTTTATGCT CATTTAGAAA 
TTTCTTAATG ATATTCTGTT TTTCAATTTA 
TTCTAAGTAT AAGATAGACT TTGATTATCT 
AAACAAGAAA AGCCTATCCC AAAGGCTACA 
GGTAATGTTA GATAATCCTA AGGGAAAAAG 
GAACTTCAGG TAGGTCTGCT CTTTACTCTT 
CTCTTGAATG CATTCATGCG AGGACCTTGA 
AAAAAGATTA ACATCTCCTT ATTGATAAGC 
CTTCTCAGTG GAAAATGTTT TTAAAAATAG 
TGTTTTAAGA CAAAAGAAAT CTAGATCACC 
AATGAAAATG ATGGAGAGGT TCTAGTGGTC 
TTGATAGAAA GAAGAGGTCC ATAGTGTACT 
CTATGCCGAG TATAACCGAT CGGCGTTATC 
ATAAAATCCA CCTAAAGAAA AAGCGATAGA 
TCAAGTAGCT GTCTTGCATC AGGATAAAAA 
CTAAAGGGTT AGAAGAATTT TATGGACAGC 
GATCTATTTC TTTGCGCAGC ATTGCAACGA 
CTTGTTTTTT GGACCCCCAG GCTTAGGGAA 
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167351 


AACCTCACTT GCTCACATCG TTGCCTACAC CGTGGGGAAA 


GGGCTGGTCT 


167401 


TGGCATCAGG GCCTCAGTTA ATCAAACCCT CGGACCTGTT 


AGGACTTTTA 


167451 


ACTAGTTTGC AAGAAGGGGA CGTGTTTTTC 


ATCGATGAGA 


TCCATCGTAT 


167501 


GGGGAAAGTT GCTGAGGAAT ACCTGTATTC 


TGCAATGGAA 


GATTTCAAAG 


167551 


TCGATATTAC TATAGATTCA GGACCCGGAG 


CTCGCTCGGT 


CCGTGTCGAT 


167601 


CTTGCTCCTT TCACTTTAGT GGGGGCAACG 


ACTCGATCAG 


GAATGCTAAG 


167651 


CGAACCTTTA AGAGCACGCT TTGCTTTTAG 


TGCGAGACTT 


TCCTATTACT 


167701 


CGGATCAAGA TCTAAAAGAG ATTTTAGTCC 


GCTCCTCACA 


TTTACTCGGA 


167751 


ATCGAAGCTG ACAGCTCCGC ATTACTAGAA 


ATTGCTAAGA 


GATCCCGAGG 


167801 


GACGCCACGA CTGGCAAATC ATCTTCTACG 


TTGGGTCAGA 


GATTTTGCTC 


167851 


AGATCCGAGA AGGAAACTGT ATCAATGGGG 


ACGTAGCAGA 


AAAAGCTTTG 


167901 


GCTATGCTAT TAATAGATGA TTGGGGATTG AATGAAATTG 


ATATCAAACT 


167951 


TCTCACTACA ATCATCGACT ACTACCAAGG 


TGGTCCCGTT 


GGAATTAAAA 


168001 


CCTTATCGGT AGCTGTGGGA GAAGATATCA 


AAACTCTTGA 


AGATGTTTAT 


168051 


GAACCGTTTT TAATTTTAAA AGGTTTTATC 


AAAAAGACTC 


CCAGAGGCAG 


168101 


AATGGTAACA CAACTTGCTT ACGACCATTT AAAAAGACAT 


GCAAAGAACT 


168151 


TATTGAGTTT AGGAGAAGGA CAGTGAAACT ATTGAAAAAC 


GTACTTTTAG 


168201 


GTCTTTTCTT CAGTATGAGT ATCTCAGGAT 


TCTCAGAAGT 


AAAGGTATCC 


168251 


GATACTTTTG TGAAGCAGGA TACTGTCGTT 


GAACCTAAAA 


TTCGTGTCCT 


168301 


TTTATCTAAT GAAAGCACCA CAGCTCTCAT 


AGAAGCCAAA 


GGTCCTTATC 


168351 


GCATTTATGG AGATAATGTC TTATTAGACA 


CAGCGATTCA 


AGGCCAGCGT 


168401 


TGCGTGGTCC ACGCTCTATA CGAAGGGATC CGTTGGGGAG 


AATTTTATCC 


168451 


CGGACTCCAG TGTTTAAAGA TCGAGCCTGT 


AGATGACACT 




168501 


TTTTTAACGG GATTCAGTAT CAAGGTTCCC 


TATACGTTCA 


TCGTAAAGAC 


168551 


AACCATTGCA TCATGGTTTC TAACGAAGTT 


ACAATCGAAG 


ATTATCTGAA 


168601 


ATCTGTACTT TCTATAAAGT ACCTTGAAGA 


GCTAGATAAA 


GAAGCTCTAT 


168651 


CTGCTTGCAT CATTCTAGAA AGAACCGCTC 


TATACGAAAA 


GCTCCTTGCA 
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168701 AGAAATCCTC AAAACTTTTG GCATGTTAAA GCTGAAGAAG AAGGGTATGC 
168751 AGGATTTGGT GTGACCAAGC AGTTCTATGG TGTAGAAGAG GCTATAGACT 
168801 GGACAGCTCG TTTAGTTGTG GATAGCCCTC AAGGATTAAT TATAGATGCA 
168851 CAAGGGCTCT TGCAGTCCAA CGTAGATCGT CTTGCTATAG AAGGATTCAA 
168901 TGCACGTCAG ATTCTTGAGA AGTTCTACAA GGATGTGGAT TTTGTAGTTA 
168951 TAGAATCCTG GAATGAAGAA CTGGACGGAG AGATCAGGTA ACCTCTTTCG 
169001 CATGGCTGAT CGCAATTAGC GTGGTATGGG GCTGTAGCGA CACTGTCGGC 
169051 GTTGCTACAT TTTGAGGGAC AAACCCTTGC TGACTCTCGG CAACTATTTG 
169101 ATAAGGAAGA AAGTTGCTGG AGGCTTTAGG TAAGGTCGCA AGTTGGTCTT 
169151 GAGCTCCCAC GTGAAAAGCA ACATATACAT GCGCTTTTGG CGATTTTATT 
169201 TTAAATGCTA AGAAATTTCC AGGGCGCCAT GTCATGGGAT TTCCCATAGC 
169251 ATCTACCCAA CTGATTTCCT TATTGGAAAG AAAGCCTCGA TTAAAAAGTG 
169301 TTTTATATTT TTTTCGAAAC GCAATGAGAT CACAGAGAAA GTGCATCAGT 
169351 GTAGGCTTTG CGGTAAGCTG ATCCCAAAGG AAGTAATTCG CATTCGAATC 
169401 CAAAGCCCAA CGGTTGTTAT TGCCTTCCGC GGTATGGGCA TACTCATCTC 
169451 CTGATTGAAT CATCGGAATG CCTTGCGAGA CCATCAAAGT AAGGAAAAAA 
169501 TTTCGTAACT GTCTTTCACG AACTTCAAGA ATGCCAGGGT CTTCTGTTTT 
169551 CCCTTCCGTT CCGAAATTGT AGCTGTAGTT CGCATCTGTG CCGTCACGAT 
169601 TATCCTCTCC GTTAGCCTCA TTATGTTTGT GGTTATAAGT CACAGTGTCA 
169651 CATAACGTAA AACCATCATG GCAACTGACA TAGTTAATCG AATTTGTAGG 
169701 CGAGCCGTGA GGATAGATGT CTTGAGATCC TGAAATTCTA GAAGCAAAGG 
169751 TTCCTATGAG ATTTTGATCC CCATTAAGAA ATGCTTTCAC GTTATCACGA 
169801 TACGGGCCGT TCCATTCACT CCATCTTGGA GACAGTGTGG GGAAATAGCC 
169851 CACCTGATAC AAACCGCCAG CATCCCAAGG CTCAGCTATA ATCTTTGTGC 
169901 TCGCAAGTAA AGGATCAAAA GAAATCGCCT CTAAAACAGG AGCGAATTGT 
169951 AGGGGAGATC CCGAAGGACC ACGAGAAAAG ACAGAAGCAA GATCAAATCG 
170001 GAACCCATCG ACATGCATTT CTTCTACCCA ATAACGTAAG ATGTCGAGAA 
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170051 TCCATTGGGT CGTGGGGGCG CGGTTTGTAT TGAGAGTGTT TCCACAGCCT 
170101 GAATAATTTG TAAAGTGACC TTGTGCATCT AAAATATAAT AGCTCGGAGT 
170151 GTCTATCCAA GGCAAAGAGC AGGTCGTCCC TTGCAAGCCC GTATGATTAA 
170201 AAACAACATC AAGAATGACC TCAATACCTT CTTGATGCAA GGTCTTTACT 
1702 51 AAAGTTTTAA ACTCTCTACT TGGAGCGCAA GGATCAGAGG CATAAGCATA 
170301 ACGTCGGCAA GGAGAAAAGA AATTTAGGGG AGCATAACCC CAATAATTGC 
170351 ACAGATAAGG GAATTTCGAA TTTCTAAAAG GATGCGCAGT CTCATCGAAC 
170401 TCAAAGATAG GTAAGAGTTC AACAGCGTTG ATTCCCAGCT TATGCAGATG 
170451 GTCGATCTTT TCAATGATTC CTAGGAAGGT TCCCGGAGCA TGAACCCTAG 
170501 ATGAAGAAGA TTGCGTGAAG GAACGTACAT GCATCTCATA GATGATCATC 
170551 TCTTCTTTCG GCAAATGCAG AGGCTGATCA CCATCCCAAG GAAATGGTTC 
170601 TTCCTTTAAA TAACAAAATG CATAATCCCC CTGTTTCTTT CGCGAACCAA 
170651 AACTCTGTGG GGAATGAATA TTCTTCGCAT AGGGATCTGC AAGATATTCT 
170701 TTAAAAGAGT ATTGCATTCC ATGCTTTTTA GGCCCATGAA CACGAAATGC 
170751 ATAAGACGAT TGATCAGAAA TACCCTCGAT CTCTATATGC CAAATCGCAC 
170801 CCGTGCGGTG TGTATCGGGG TAAAGAGGGA CTTCTATGAC TTCTGAATTT 
170851 TCGTCTGTTA AAGCAAGGAT GACTTCGGTA GCTTGTGAAG CATATAAAGC 
17 0901 AAATCGATAG CGGTTTGGGG AAATTTTAGA AGCCCCAAGA GGTAAAGGAA 
170951 CTGAGGGATA AGAAGAAACT TTTTCCATCG TCGATCAGAT ATCATAACGG 
171001 CGCCAGAAAT ACAAGGGAGT TTTTGATTTT AAGCCTCTGA AATCGTTTAC 
171051 TAACAAATCA CTCTTATGGT AACTTGAAAC AACAACAATC TATTACAAGG 
171101 AGATTCCCTC ATGTCCAGGC AAAATGCTGA GGAAAATCTA AAAAATTTTG 
171151 CTAAAGAATT AAAACTGCCG GATGTAGCTT TTGATCAGAA TAACACGTGC 
171201 ATTTTATTTG TTGATGGTGA GTTCTCTCTT CACCTTACTT ATGAAGAACA 
171251 TTCTGATCGT CXTTATGTCT ACGCTCCTCT GTTAGATGGA CTCCCTGATA 
171301 ATACTCAGAG GAAATTGGCC TTATATGAGA AATTATTAGA AGGATCTATG 
1713 51 CTTGGCGGTC AGATGGCTGG AGGTGGAGTA GGTGTTGCTA CTAAGGAACA 
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171401 


GCTCATTCTC 


ATGCACTGCG TTTTGGATAT GAAATATGCT 


GAAACGAATT 


171451 


TACTAAAAGC 


ATTTGCTCAA TTGTTTATTG AAACTGTTGT 


TAAGTGGCGT 


171 501 

X 1 X mJ \J X 


ACCGTGTGTG 


CTGATATTTG TGCTGGTAGA GAACCTTCTG 


TAGATACAAT 


171551 


GCCACAAATG CCTCAAGGCG GTGGTGGAAT GCAACCTCCT CCTACAGGTA 


171 601 

X t X O V X 


TTCGTGCTTA 


ATTAAGCATA TAAGTACTAC ATCTTTTTTC TAAATTTAAA 


171651 


AGATTGATAA AGCTCTTCTT AGAGAAGAAA TGACTTTATT GACTTCTCGT 


171701 


TTTTGTCCTG 


AATCTGTTTT GAGAAGTTCT TAAGATGTAT 


TAATTAAAGA 


171751 


AGTATTAAAA 


ATAGAAATCT AAAGGCTATC TTATGATGTT 


TGGGCATTTT 


171 R01 

X f X 0\J X 


GCTGGTTACC 


TTGGAGCAGA TCCTGAAGAG CGAATGACTT 


CCAAAGGAAA 


171 851 

X 1 X O ~i X 


ACGTGTGATC 


ACTCTGAGAC TGGGAGTGAA GACTCGAGTT 


GGAATGAAAG 


1 7 1 Q01 


ATGAAACTGT 


TTGGTGCAAA TGCAATATTT GGCACAATCG 


CTATGATAAG 


1 71 951 


ATGCTTCCTT 


ACTTGAAGAA AGGCTCAGGA GTCATTGTTG 


CTGGCGATAT 


172 001 


CTCTGTAGAG 


AGTTACATGA GCAAAGATGG TTCACCGCAA 


TCTTCTTTAG 


J. 1 £t v J X 


TGATTAGTGT 


AGATTCTTTG AAATTCAGTC CTTTCGGTCG 


CAATGAAGGC 


172101 


AGCCGTTCTC CATCTTTAGA AGACAATCAT CAGCAAGTGG 


GATATGAATC 


177151 


TGTATCCGTA 


GGGTTTGAAG GTGAAGCACT GGACGCAGAA 


GCTATTAAAG 


172201 


ATAAAGATAT 


GTATGCTGGT TATGGTCAAG AACAGCAGTA TGTCTGTGAA 


172251 


GATGTTCCTT 


TTTAATTCCT AGTCATTAAA GGAGAGTTTG 


TGGTTTTATT 


172301 


TCATGCTCAA 


GCCTCTGGGC GTAATCGTGT TAAGGCAGAT 


GCTATAGTCC 


172351 

X 1 J w> J. 


TGCCCTTTTG 


GCATTTTAAG GATGCAAAAA ATGCAGCTTC 


TTTTGAAGCC 


172401 


GAGTTTGAAC 


CCTCGTATCT CCCCGCTTTA GAAAACTTTC 


AAGGAAAAAC 


1754 51 

X / A *» J X 


CGGGGAGATT 


GAACTCCTTT ATAGTAGTCC TAAAGCTAAG 


GAAAAACGCA 


172501 


TTGTCCTCTT 


•AGGCTTAGGG AAAAATGAAG AGCTCACCTC 


TGATGTTGTT 


172551 


TTCCAAACCT 


ATGCGACACT AACTCGTGTC TTACGTAAAG 


CAAAGTGTTC 


172601 


CACAGTCAAT 


ATCATCTTAC CTACAATTTC TGAATTGCGG 


CTTTCTGCCG 


172651 


AAGAATTCTT 


AGTGGGGTTG TCCTCAGGAA TTTTGTCATT 


AAACTATGAC 


172701 


TACCCACGTT ATAATAAGGT AGATCGTAAT CTTGAAACTC 


CTCTTTCTAA 
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1 *7 *n c i 
17^ /Dl 


"s A r , /T"T v r 
Au 1 l_/tl.oo J. 1 


ATCGGTATCG TTCCCAAAAT GGCGGATGCT 


ATCTTTAGGA 


i *7 o o n i 




CATTTTCGAA GGCGTATATC TCACTCGAGA 


TCTTGTGAAC 




AvjuAA 1 1 o 


ATGAAATTAC CCCTAAGAAA TTGGCAGAGG 


TTGCTCTGAA 


i / z y u i 


f rr , Tr*r*p a 2i l l. 

1 C 1 ^VjVjA/lHA 


GAGTTCCCTA GTATTGATAC TAAGGTCTTG 


GGAAAAGATG 


i / z y d i 




AGAGAAAATG GGACTCCTAT TGGCTGTTTC CAAGGGTTCT 


i n i nm 

1 / JUUl 


TGTGTGGATC 


CACACTTTAT CGTTGTCCGT TATCAAGGAC 


GTCCTAAGTC 


X / jUDl 


TAAAGATCAC 


ACCGTCTTGA TAGGGAAAGG GGTCACTTTT 


GACTCTGGAG 


1 / J1U1 


GTTTAGACCT 


CAAGCCTGGA AAATCCATGC TTACTATGAA 


AGAAGACATG 


nil 1 


GCAGGTGGGG 


CTACAGTCCT CGGGATTCTC TCGGCGTTAG 


CAGTTTTAGA 


1 »7 *j o An 


GCTTCCTATA AATGTCACGG GGATCATTCC TGCTACAGAG 


AATGCTATCG 


1 TiOCI 
1 / J Z3l 


ATGGCGCCTC 


CTATAAAATG GGAGATGTCT ATGTAGGAAT 


GTCGGGGCTT 


i tj*} m 
1 / jjUI 


TCTGTTGAGA 


TTTGTAGTAC CGATGCTGAG GGACGTCTTA 


TCCTCGCTGA 


i 711 CI 


TGCGATTACA 


TATGCTTTAA AATATTGTAA ACCGACACGT 


ATTATAGATT 


i *7 *a / a i 
1 / J4U1 


TTGCAACTCT 


AACAGGAGCT ATGGTAGTCT CTCTAGGAGA 


AGAGGTTGCA 




GGTTTCTTTT 


CCAATAACGA TGTTTTAGCT GAAGATCTTT 


TAGAGGCGTC 


1 / j jUI 


AGCCGAAACC 


TCCGAGCCGT TATGGAGACT TCCTCTAGTT AAGAAGTATG 


1 *7 1 C C 1 


ATAAAACATT 


GCATTCTGAT ATTGCTGATA TGAAAAATCT 


AGGCAGTAAC 


1 Ticni 

1 / jDvl 


CGTGCAGGGG 


CTATTACAGC AGCATTATTC TTGCAGAGAT 


TTTTGGAAGA 


1 / jO jl 


ATCTTCGGTA 


GCTTGGGCAC ATCTTGATAT TGCAGGTACT 


GCATATCATG 


1 / J / U I 


AAAAAGAAGA 


AGACCGTTAT CCAAAATATG CTTCAGGTTT 


TGGTGTTCGT 


1 *7 1 *7 ^ 1 


TCTATTCTTT 


ATTACTTAGA AAATAGTCTT TCTAAGTAGT 


TGCTTTCTAT 


i Tioni 

1 / JoUJ. 


TTATTTATGT 


TTTAGTAATG ACTTTTATTT TAGTTTTTTT 


AAAATAAAAG 


173851 


TCATTTTTTT 


TATTAAAGTT TTCAATCGTC CCTGCCGATA 


GATCAGGTAA 


173901 


GTAATTACCT 


GTCTAATTAG GGGAATAAAG ATGATTGGAG 


CGCAAAAAAA 


173951 


GCAAAGCGGT 


AAAAAGACAG CTTCAAGAGC TGTACGGAAG 


CCTGCTAAAA 


174001 


AAGTTGCGGC 


TAAACGTACG GTTAAAAAAG CTACTGTTCG 


CAAAACCGCT 


174051 


GTAAAAAAAC 


CTGCAGTTCG TAAGACGGCT GCTAAAAAGA 


. CAGTAGCAAA 
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174101 


GAAGACTACA GCTAAGAGAA CAGTTCGTAA GACTGTTGCT AAGAAGCCTG 


174151 


CAGTTAAGAA AGTTGCTGCT AAACGTGTAG TAAAAAAGAC 


AGTAGCAAAG 


174201 


AAGACTACAG 


CTAAGAGAGC 


GGTTCGCAAG 


ACTGTTGCTA 


AGAAGCCTGT 


174251 


AGCTAGAAAA 


ACTACAGTGG 


CTAAAGGTTC 


TCCTAAGAAA 


GCTGCAGCCT 


174301 


GTGCTTTAGC 


ATGCCACAAA AACCATAAGC ATACATCTAG 


TTGTAAACGT 


174351 


GTCTGTTCTT 


CAACAGCTAC 


GAGAAAGCAT 


GGCTCTAAAA 


GCCGTGTTCG 


174401 


TACAGCTCAT 


GGCTGGCGTC 


ACCAACTGAT 


CAAAATGATG 


TCTCGATAAT 


174451 


TTGTGATTTT 


CGCATTATTG 


CTCATGTTAA CGGGAAAGGG 


AAACATTGGG 


174501 


TTTCCTTCCC 


GTTTTTCTTT 


TTAAGGTTAA 


AAAGCTTTAT 


AGAGCGAGAT 


174551 


CTTCAGGCTT 


CATGCTGTAC 


AGTTGGTAGG 


AAAATACGTA 


TAGTAGGTTC 


174601 


AGGATACTAC 


TTTTTTGACT 


CTACCTATGC 


AAAAATCCTT 


AACGAGTTTT 


174651 


GATGACTTTT 


CCCAGGCGTA 


TGCAGAGAAA 


GTGCCCGCTA 


TAGCTCTTAT 


174701 


AGGGAGTGCT 


TTGGAAGACG 


ATAAAGATGC 


GCTGATTGAA 


TTATTAGTCT 


174751 


CTGAGAGCTT 


CAAAGAGCTC 


GGTGGTCAGG 


GACTCATGCC 


AGCAACCCTC 


174801 


ATGTCTTGGA 


CCGAGACGTT 


TGCACTCTTT 


CAAGAGCATG 


AAACTTTGGG 


174851 


GATTATTCAT 


GCAGAGAAAT 


TCCCTCTAGC 


AACTAAGGAA 


TTTCTAAGCC 


174901 


GCTATGCTCG 


GAATCCTCAA 


CCTCACCTTA 


CGATTTTGAT 


CTTGACCACA 


174951 


AAACAAGAAT 


GCTTTCGAGA 


ACTGTCAAAA 


GCCTTGCCAT 


CGGCTCTTTC 


175001 


TTTGAGTTTA 


TTTGGTGAGT 


GGCCCGCAGA 


TCGTCAGAAA 


AGGATCATAC 


175051 


GCCTCCTGTT 


GCAAAGAGCT 


GAGCGTGTGG 


GGATTTCTTG 


CTCTCAATCA 


175101 


TTGGCATCTT 


TGTTTTTGCG 


TGCACTTGCT 


TCAACCTCTC 


TTCCTGATAT 


175151 


TCTCAGTGAA 


TTCGATAAGC 


TACTGTGCTC 


TGTTGGCAAG 


AAAACGTCCT 


175201 


TGGATCACTC 


TGATATTAAA 


GAGCTCGTTG 


TCAAAAAAGA 


AAAGGCTTCC 


175251 


CTATGGAAAT 


TTCGAGACTC 


TCTATTGAAG 


AGGGATCCGG 


TAGAAGGTCA 


175301 


CCAGCAGTTG 


CATTTTCTAC 


TCGAGGATGG 


TGAAGATCCC 


TTGGGGATTA 


175351 


TTACTTTCCT 


TCGTACCCAA 


TGTCTCTATG GTTTACGTAG 


TATTGAAGAG 


175401 


GGATCGAAAG 


AAAATAAACA 


CCGAATGTTC 


GTCCTTTATG 


GAAAGGAGAG 
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175451 ACTACACCAA GCCTTAAATT CTTTATTTTA TGCAGAAACC TTAATTAAAA 

17 5501 ATAATGTCCA AGATCCTATA GTAGCTGTGG AAACTTTAGT GATTAGAATG 

175551 GTTAACCTGT GACTTTATAT CTTCTTCCCA ATACTCTCGG TACCCGTGCT 

17 5601 GTAGAGACTC TCCCCTCCGT TATAGGAGAA TTAGTTCATA GACTAGATGG 

175651 GCTGATTGTA GAAAGTGATC GTGGGGGTAG GGCATTTCTA AGTTTATGGA 

1757 01 AAATTCCCGA AGTTCATAAA TTTCCTCTTG CTATTCTTAG TAAACATGCG 

1757 51 CGCCTCCCTA AGGCTTGGGA TTTTTATCTA GAGCCTATCG TAAAACACGG 

175801 GGAGAATTGG GGACTGATCT CTGATGCGGG TCTTCCCTGT ATTGCAGATC 

175851 CTGGAGCGAG TTTAGTGCGT CGTGCACGTG CTTTGGGGAT TCCTGTGCAG 

17 5901 GCTTTTTCAG GTCCCTGTTC GATAACGTTA GCGCTCATGC TTTCAGGCTT 

17 5951 GCCTTCCCAG AGCTTTACGT TTTTGGGATA CCTCCCGCAA AGTCCTAAGG 

176001 AACGTGTAAA GTCGATAAAA AAGGCAGCGA CCTCCAAAGA GGTATCTACT 

17 6051 TCAGTATGTA TAGAAACTTC TTATCGTAAC GTCTATACTT TTGAGTCTCT 

176101 TCTAGATACT TTACCTTCCT ATGCGGAGCT TTGTGTTGCC TCTGACCTTT 

17 6151 CGGGCCCGAG TGAACTTGTT CTCACACGCC AAGTGCAATC ATGGAGAACT 

17 6201 ACTGAGGACT TAGGTTCTGT GAAGCAAAGT ATAACCAAAG TCCCAACAAT 

176251 ATTTCTTTTC CACATCCCGA ATTAATTTTT TCTTAGCTTG TTTTATCCCC 

17 6301 GGTATCATTG TCAGGAGTTT CTTAGCATGT AAGCTCTTCG GGGATATTTT 

17 6351 TCTTGTGATC ATAAGCAAAA ATTAAAGGTT GCATGATTCT AAATAGCGCA 

17 6401 TAATCATAGG TTTGCGTCTT TTTAAAATAA TCGTAATCCT CTATTGCGTG 

176451 TTATCAATAC TTATTTTATA ATCTTTGCCA AATAAGTTTA GAGCGTTGAT 

176501 GACAGCACCC ACAGAATCTC GATCTTCACC ACCAACACTA CTCGAAGAAA 

17 6551 CGGAACCTCT ATCCCCAAAT CCTATTCCCG CCGATATCCA AATCCCGAGA 

176601 ATTACAATAT CTCCTCCTTC TCTAGACGTA TCAACAGTTG CATCTTCGGC 

176651 TGAAGATATC TCGGTCTTCA TTGCAGGAGG CCCTAGAAGT TCTTCATCTG 
176701 CTTCTGTGGC TTCGGATGTG TATGAGTTGG TTTGCCTTTG TGGCGGTGAT 
17 6751 GAGGATCCTG AGCCCCCAGA TTCTGAGGTA CGTACTCTCT ATGTAAATGG 
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176801 AAGTTGGCAA ACGCATCAAG AAGCTGTACA GGAATTGCTG TATATTAGTG 

17 6851 AAGTACGGGG AGAGGCCGTT CGTCTTCTCT ATAACGATGG AAGCGGCATG 

17 6901 TCTCCTTGGC CCATCAGTCC TTGTCGTACT CTTCCTACTC TCGATCATCC 

176951 TTTATGTCAG GCCCTTTTGA CGGTTTGGGA ACAGTTTTTC TCTGCTCCTG 

177 001 AAAATCAGAA TCGTGAGTTT CTAGTGATTT TCTATGGGGA TGCATCGCCT 

177051 TATATACAAC AGGCGTTAAC GCAATCTAGG CATAGTCCAC GTATTGTTGT 

177101 TGTAGGGATT TCCCCGACGG TCTTTATTCA AGGAGACTTT AGGGTCCATA 

177151 ATTACCGTGT TTCTGGAGAC TTCTTTAGTT CTCTGGATTG TCGGGGAACT 

177201 AGGGCGGAGA ACACCACGAT ACTGCCGTAT TCTTCGGGTC TTGAGGGTGT 

177251 TTTTCTGCCT TCTATCCGTT GTCCTTCTTT TACTTGGGCG GTGCGTTTTG 

177301 GAGAGCAATG CTTGGTTGCG AATAGGGGTG AGGATGTAGA AGATAGGGGA 

177351 GGTCTTTCTC AAGATGCCGA AAGATCACAG TTACCACACA GTGAAAGAGA 

1774 01 TCTAGCTGTT GTCATTGATT CTACGGATCC TAGTTCTATG AGTAGGCTTG 

1774 51 TAGAATGGTT GAATCAAGGA TCGCCTTCAT CAGATATGGA AATCAATCCC 

177 501 TATCCCCAAC GGTGTCCTGA TGTAGCTCTT TCTGCGCTTT ATGCAATTTC 

177551 TAGAGTTTCA GGACTTGCGC AGGAATGGAT CCTAGCCTCT GTTCATGAGG 

177 601 GCTTAGACTT GCAGATCTGT TACTCTTTAA TTTTGATGCA CACGACGTTT 

177651 GCGGTCCGGT ATTTTTTCTT ACTCTTTACA AATTATCCTC AGTCTAGAGA 

177701 GAGATTCCGT ACTGCACGAA TCGTAGCACA ATCTCTATAT TTACCAAGCA 

177751 TCCTTGTTCT TGTTTTTGAT TGTGGCAACG TCCTGCGTAA ACTATGGATG 

177801 CCTCAGGAAA TCTTACGAGC AATTTTTATT TCTGCGTCTA CAATTTCAGG 

177851 GAGTATTGTC TTTGTAGAGT GCACTCGCTG GATGGGGCGA GGTCTTAGAC 

177 901 ATCGTGTACA ACAATTTGTG CAGCAACGAG TTATAGGAAG TGGCCTGCCT 

177951 GTAGGAACAG TACGAGCTTC TTATCGCGAT CGTGCAGGCT TTATCATAGG 

178001 CTTTTTACAA ACTGTACATG GAGGACTTTA TTTGCCGGTA TCCATTATGG 

178051 TGCTTAACCA GATTGCAATA CAAGTTCCAC GTATCTTAGT ACGTCCAAAT 

178101 AACACTGCTG TTTATGATCT ACATAATAAA AGTGCTGAAG AAAATTGGAG 
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1 *7 Q 1 Q 1 
1 / OlDl 




PTATTAGPTG TTGGCCAAAC ATTAAACTTC 


ATCTTATGTG 


1 f oZ U 1 


till H-ll 


GTTCGTAAAT CTATGGTTCT TTGTGAAGTC 


CGTATTACGC 


1 / OZ 51 


V»A 1 It 1 Awwt 


GTrr/rrGTCG CTAATGTATC TTTGGAGACA 


TCTTGGTATT 


J. / O Jul 


TTTAACATAA 


GAGCACGGCA AAAAAAAGAA AGACCGAGAA 


AGAAGCTTTA 


1 7D1 C1 
1 7 b JDl 


CCTTTAAGAG 


TTCTCTGATG TTTCCCGGAG CATCAGGAGT 


CCGAAGGGCA 


178401 


GAGTCTAGGT 


TTTAGCCTTC CGCAGGGATT AGAAGGAATA 


TCCCATAATC 


17 84 51 


TCTTCCGCTA 


TTGTATCGTG GAAGAGACGG CCCTGTCTAT 


TTAGGGCAAG 


178501 


ACATTGTCCA TGCACACTGA ATAGGTTTTG TAATTTTACA 


TCTTGCGTAA 


T "7 O C C n 

17855 1 


GCATGGAGAT 


AAGTGTGGAG GGGAACTCCG CGAGGTCTGC 


TCCTTCAAGG 


1 Term 
178601 


AGTCGGAGTC 


GCAGGGCTAA GGCTTCTTTG ATTCGTTCTT 


TTTTTGGGAG 


178651 


AATTTCTGAG 


GTCTCTTGGG TAGGGAGATT CTTACGTACA 


GCACGTAGAT 


178701 


AGTGAGAAAT 


ATGACTATAA TTTTTTGACC GCTCTCCGTG 


AAGGTATTGC 


178751 


GAAGCTGAAA 


CTCCTAAGCC TAAGAAAGGG CGATCTGTCC 


AGTAATAGAG 


178801 


GTTGTGCTTT 


GCGGGGTAAT CTGGCTTGGC ATATGAAGCA 


AGTTCATAGC 


1 / 88 5 1 


GTTGGAACCC 


TTGGGAGAGT AGGAGATTTT CAGCAAGGAG 


GCTCATCTCA 


1 7 by o i 


GCTAGAATTT 


CTTCTTGGGC AATTGTGGGG ACTAGAATTT 


TGCGGTGTTT 


178951 


ATAGAAGGAG 


GTGTGGGGAT CTATAGTGAG GTTGTATAGA 


GAAATGTGAG 


179001 


TGATAGGGAG 


AGTCAGAGCT TGATGTAGGT CGCTTAGGAA TATCTCCAAA 


179051 


GACTGTGTGG GCAGTCCGTA GATTAGGTCT ATAGAAAGAT 


TAGAGAATCC 


179101 


GTGATTCTGG 


CATTCTTGCA GTGCTGTGAT TGCCGCAGAT 


GAAGAATGCG 


179151 


TTCTTCCGAG GAGCTGTAGG ATAGAGTCGT CGAAGGTTTG 


TACGCCAACG 


179201 


CTAATTCTAT 


TTATTGGAGT CTCTTGTAGT TGACGTAGAT 


AGCTTACGGT 


179251 


GAGATTTTCG GGGTTGGCCT CTAAAGTAAT TTCCCGGGCA TGGGGGGCTA 


179301 


GCTCTTTGAG 


GATGCGCTTA AGATCAAGAG GAGAAACTAA 


TGAAGGTGTT 


179351 


CCCCCTCCAA 


AAAACACAGT CTCTATGAAA TGCGTCTCTT 


GGATGGGGGC 


179401 


TAGCTTTCTT 


AGCCCCTCTT GAATTACAGC ATTACAATAG 


AGCGATACAG 


179451 


ATTCACTTTT 


GTAGGGGATT GTATAAAAAC TGCAATAGCG ACATTTTTTT 
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179501 GTGCAGAAGG GAATATGAAT 

179551 TAGCATCGGG ATCAATGATA 

179601 AAAGGGTCTT CATCGTCTTC 

179651 TTCACGTCCT TCATCGGTGA 

1797 01 TATCAATTTC TGTTTCTGTG 

179751 GGACCATCAG GGTAGACAGT 

179801 CTCTTCAATC TCTCCGTTTT 

179851 CGTAAATTTT ATGTTCTAAG 

179901 TCTTTTTTTG GCACTAAACC 

179951 TTCTGATTCT AATTTTTTAA 

180001 CCTGTATTCG AGAGCAGGTG 

180051 AAAAAAAACA AAGATTTTTT 

180101 GCAGACATTT AGATCGTATT 

180151 GTTTGTGGGG CAAGTATATT 

180201 ATCAGAGATT TATGAATCAC 

180251 TTTGAAGGGT ATCAGCTCGG 

180301 TAAGATTTCT GGGAATGAAA 

180351 AGTTTCTATG TACGATTTAT 

180401 TCAACGCTTG CCCCAACTAC 

180451 TAAGATTGAT CTGGATGAGC 

180501 CTCAGGTTTC GGTACGAGAG 

180551 GGAAGTCTTA CTTTAGAAAC 

180601 TGTTTGGAAT CTTATGGAGA 

180651 TCCTTCGCTC CTATAAAGAC 

180701 TTACAGATAA AATTTACAGG 

180751 GACCTTGGTC CCCATGTTGG 

180801 GAATTTCTAA CTACGTTTTA 



PCT/US99/26923 

ATAAAGAGCT AGGGGAGCCT TACCATTCAT 
CCACCGCGTC TCCAGCGATT GCGATCGCTG 
GTGGGAATAG TCGACTTCTA CTGCCCCATC 
GTTCCAGTTC TACGGTTTCG CCTGGGTCGA 
GGTTCCACAA ATTCAATATC AGACATACTA 
CTTTGCGGGA CTGCGTCGTG GTGTGACGTA 
CTTTGAGGAG TTGTAGACGT TCTTTTTCTT 
ATTCGGATTT CTTCTTGGTG CCTGCTAATT 
CAACTGCATC CATTGCGTAA GGTCATGAAG 
GACGTTCGCT TTTCATCCAA GGGTTCCCAT 
CATAGCATTT TTTTGTTTTT TCAACTGCAG 
TTTTTTTGAA AAATAAAAAT TTGCTATGAA 
TATTGAGTTT AATTATTTTA TGGATTCCGA 
CTTCGGATAT GGATTGGATC GAGTCTATGT 
GAGACTTTGG ATCCTTCTTG GAAGTATTTT 
TCAAGCAGCA TCTCCATCAG AAGCTAGTAC 
CTATTGCTAT GCTTCAAGAA CAAAAATCTC 
CGTTATTATG GATATTTGCA AAGTCAAATT 
AGATTCTCGA TTCATTCAGG AAAAGATCGC 
AGGTGCCTTC TGCGGGTCTA CTTCCTAAAG 
CTGATCGAAG CTTTAAAAAA ATGCTATTGC 
CCTAACATGT ACTCCTGAGT TGCAGGAGTT 
AGCGACAAGT GGAGCGCTTT GCAGAGCAGC 
TTATGTAAAG CAACGTTTTT TGAAGAGTTC 
TCAGAAACGT TTTTCTTTAG AGGGCGGAGA 
AGCATCTTGT TCATTATGGA TCGGCATTAG 
GGAATGGCCC ATCGAGGTCG TTTGAATGTA 
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XOuOJl 


TTAACGAATG 


TTTTGGGAAA 


GCCTTACCGT 


TATGTCTTTA 


TGGAGTTTGA 


1 OU^Ul 


AGAPGATCCT 


GCAGCACGTG 


GTTTAGAGAG 


TGTTGGGGAT 


GTAAAGTACC 


±0\J?D± 


ATAAAPPPTA 


TGTGCTAAAG 


TCCCATCAGA 


AAGATAGGGA 


AACTACCTTT 


Id XU VI 


P TP & TP TTP P 
V? i uM lull o*- 


CAAAPGCTAG 


TCATCTCGAA 


TCTGTAGATC 


CTATTGTCGA 


i di n ci 




nrTpppTTGC 


AAPACCAAGG 


TCACGCAGGT 


AAAGAGCAAA 


1 Q 1 1 ft 1 


pp APPTTAPP 


AATTTTAGTT 


CATGGAGATG 


CAGCATTTTC 


TGGTCAGGGA 


1 OI 1 CI 


PTPPTTTITP. 
W 1 V?<J 1 1 1 A 1 Vj 


AAACTCTCCA 


GCTGAGTCGT 


GTTCCAGGGT 


ATTCTACTGA 


1 o 1Z U 1 




CACATTGTTG 


TGAATAATTA 


CATAGGGTTT 


ACCGCAGTGC 




p a pp p p ^ p tp 


AAGGTCCACC 


CCTTATTGTA 


CGGATATTGC 


TAAAATGCTA 




p.rp, ATTPPTP 


TATTTCGAGT GAATAGCGAG 


GACGTCGTTG 


CCTGTATAGA 




APPTATAPAG 
nuv. 1 nl nOn« 


TACGCTCTGC 


AAGTTCGTGA 


GAGATTTAGT 


TGTGATGTGA 




TPATAGATPT 


CTGCTGTTAT 


CGCAAGTATG 


GACATAATGA 


AAGTGACGAT 


loll Ji 


PPPTPAPTAA 


CAGCTCCCTT 


ACTCTATGAT 


CAGATTAAGA 


GAAAGAAGAG 


i pi cm 


TATTPPPPAP 
inl 1 Lulurto 


CTGTTTAGGC 


AATATCTGTT 


GGAAGGGCAG 


TTTGCAGATA 


1 Q 1 

IOXjjJ. 


TTTPTPA APA 


AACTTTGGCA 


TCTATTGAAA 


AAGAGATTCA 


AGAGAGTCTG 


loioui 


A ATPPTP APT 


TTCAAGTATT 


GAAAGGGACG 


GATCCAGAAC 


CCTTTCCTAA 


1C3 1 DDI 


A A AAPA ATPT 


CATCACTGCG 


ATCGCTTAAA 


TAACGGCGAG 


CTTATTTTGC 


10 1701 

lOl /ul 


ATPATTPTPA 


TGTTTCTTTG 


GATCGCGAGA 


CTCTTTTTCA 


TATGAGCTCG 


101 / DI 


PP TPTTTP TP 
1L. 1 1 Iwlo 


GTTTCCCTGA 


CAATTTTCAT 


C CCC ATCCTA 


AAATTAAGAC 


loioui 


TPTTT"? API 1 
!L 1 1 1 i nVjAn 


AAAAGAATGA 


AAATGGCAGA 


AGGTGGGGTT 


GGTTATGATT 


lOlOJl 


r t r+r*rT' atppp 


CGAAGAATTA 


GCCTTTGCTT 


CGCTATTAAT 


CGAAGGGTAC 




A A PPTP A P A P 


TCTCAGGTCA 


AGATTCTATT 


CGCGGGACAT 


TCAGCCAACG 


181951 


ACATTTGGTA 


TGGAGTGATA 


CTGTGACTGG 


AGATACCTAC 


TCTCCATTGT 


182001 


ACCATCTTTC 


TGCAGAGCAG 


GGCTCTGTAG 


AAATGTATAA 


TTCTCCTCTT 


182051 


TCCGAATATG 


CAATTTTAGG 


GTTTGAGTAT 


GGCTATGCTC 


AACAGGCATT 


182101 


AAAGACTTTA 


GTGTTATGGG 


AAGCGCAGTT 


TGGGGATTTT 


GCTAATGGTG 


182151 


CACAAATCAT 


TTTCGATCAG 


TATATCTCTT 


CGGGAATTCA 


GAAGTGGGAT 
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182201 TTACACTCTG ACATTGTTCT 

182251 ACCCGAGCAT TCTTCATCTC 

182301 ACTGGAATTT TCAAGTGGTC 

182351 ATTCTCAGAG AGCATGCTAA 

182401 TACTCCTAAG TTGCTGCTGA 

1824 51 AGTTCACAGA ACCTGGGGGA 

182501 AATTATGATG CTTCTATTTT 

182551 TTATGCAGAA ATGCTTCCTC 

182601 GTATAGAGAG CTTGTATCCT 

182651 GATAAGTATT CTCATTTGAA 

182701 GAATATGGGG GCCTATGACT 

1827 51 CTGAGAAACT GCTATATATA 

182801 GGATCAGCGA AGCTCAGTCG 

182851 CTTTTCTTTA AGGTAAATTA 

182901 CAGAGTCGAT TAGCGAGGTG 

182951 GCTCTGATTC AAGAAAACCA 

183001 AAATCAGCTC ATTTATGCCC 

183051 CAGAAGGCGA TGTTGTTCCT 

183101 GCAGGTGAAG GGGAAGAGCT 

183151 AGCTGAGATC ATTTGCTTTC 

183201 AGAATAAAAC GTTTATTCCT 

183251 GGTCTTTCTG CAGGAGATCG 

1833 01 TCGTAAGACA ATTTCGCGGC 

183351 TGCTCACGAC ATTCAATGAG 

183401 AAGGAAAAAC AAGAAGAGTT 

183451 TATGTCTTTC TTTGTGAAAG 

183501 GAGTGAACGC CTATATTGAT 



PCT/US99/26923 

GCTTCTTCCC CATGGGTATG AGGGCCAAGG 
GTATAGAACG TTATTTGCAA TTAGCCGCGA 
TTGCCTTCCA CTCCTGTGCA ATATTTTCGG 
GAGAGATCTT TCTTTGCCTT TGGTGATCTT 
GATATCCACA ATGTGTAAGT AGTATCGAGG 
TTCCGTGCTA TTCTCGAAGA TGCCGATCCT 
GGTATTGTGT TCGGGAAAGA TCTATTATGA 
AAGATCGGCG TAAGGACTTT TCTTGCTTGC 
TTAGCTCTTG AGGATTTAGT GAGCCTTATC 
ACATTTTGTT TGGCTACAAG AAGAATCCAA 
ATATGTTTAT GGCGTTGCAA GACATTCTTC 
GGACGTCCTC GGAGTAGTTC CACAGCTTCT 
TCAAGAGCTG GTCACGTGTA TGGAAACCCT 
TGACTACAGA AGTACGCATT CCTAATATTG 
ACCGTAGCTT CCTTGTTAGT TACAGAGGGT 
GGGCTTACTA GAAATTGAAA GTGATAAGGT 
CAGTATCGGG AAGAATTTTC TGGGAGGTTT 
GTAGGGGGGG TAGTGGGAAA AATAGAGCCC 
TGGAGATTCT CAGTCTAAAG AGACTATAGA 
CTCAGTCTGG GGTGCGTCAG TCTCCTCCAG 
CTTCGTGATC AGATGGACCA AGGATCCCAA 
AGGAGAAACT CGAGAACGCA TGACCTCGAT 
GTCTTTTGTC TGCTTTACAT GAGTCTGCGA 
GTCTATATGA CACCTCTTTT TCATTTGCGA 
TCTATCTCGA TATGGGGTGA AGTTAGGATT 
CTGTCTTAGA GGCTTTGAAG GCATATCCAC 
GGCGAGGAGA TTGTTTACCG TCACTATTAT 
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183551 


GACATTTCTA 


TTGCTGTAGG 


TATCGATCGA 


GG AC TTGTGG 


TTCCTGTGAT 


183601 


ACGCGATTGC 


GATAAACTTT 


CTAACGGGGA 


GATTGAGCAG 


AAACTCGCAG 


183651 


ATCTTGCCCT 


TCGGGCTCGT 


GAAGGCCTAC 


TTGCAATAGC 


GGAGCTTGAG 


183701 


GGAGGAGGTT 


TCACAATTAC 


CAATGGAGGC 


GTATATGGAT 


CGCTACTTTC 


183751 


GACTCCCATT 


ATCAATCCCC 


CGCAAGTGGG 


GATTTTGGGG 


ATGCATAAGA 


183801 


TAGAAAAGCG 


CCCCGTTGTT 


CTTGATAATG 


AAATTGTAAT 


TGCAGATATG 


183851 


ATGTATGTCG 


CTTTAAGCTA 


TGATCATCGT 


CTTATTGATG 


GGAAAGAGGC 


183901 


TGTTGGGTTT 


TTAGTCAAAG 


TGAAAGAAGG 


CCTAGAGAAT 


CCTGCCTCAT 


183951 


TACTCGACTT 


GTAATTTCTC 


TGATTCTCAT 


AAAGGCTCTT 


TTAGAGCCTT 


184001 


TTCAGATTTT 


TTAACCTCTT 


TTCTTATCAT 


GAAAAGGATT 


GCACTCATCT 


184051 


TGAGTAAGGA 


ACAATGTAAA 


TTGAGTATCC 


GCACTTACAT 


TGGCTTGTAC 


184101 


CGCAGCCTCA 


TTCAAGGCGC 


CTAACAACTT 


TCCGGGTAAG 


TACACATTGT 


184151 


CTCTTTTAAA 


GTGAGGTTTC 


CCGGTATGAT 


CAAGAGGAGT 


GAAAGCTTCT 


184201 


TCTGCGTGAA 


TATGTACTGA 


TGTGAAGCCA 


TTTTCTAAGA 


GATCTACAAA 


184251 


CAGATTTTTC 


AAGAACTCCT 


TTGAGTCTTT 


TTCAGAAAGC 


GAAGAGCTCA 


184301 


TCGGATCTTT 


TCTTTCTAAG ATATCTTGTA 


CATCCTTGTC 


GTAATAACCG 


184351 


CAAGGAGTCC 


ATGCTAAAAC TAAAGTTGAA 


CTGACACCAG 


GAGTTGGAGA 


184401 


ATTGAAAATT 


GCCCATTGAC 


CTGCTTTTTT 


TCTAGAGTTC 


ATTTTAGTTT 


184451 


TCATAGCTTC 


CCACTGTTCC 


TCTCCTAAGA 


GATCTTGCAG 


TTGCTTTAGG 


184501 


GGAGCCGAGA 


CTTCCTTATC 


CCCTTTCCAA 


TGCGAAGTTT 


CTGGTGAAAG 


184551 


GAGGAATAGC 


GAAGGTACGC 


GAGCCCCGTG 


ATCTCTAATT 


TCTGTGAGAT 


184601 


TAGGAGCTTT AGCTGTGACA . ACCTTAAATT AAGGACCCTT CTTTAGATGG 


184651 


TGAATTTCAC 


AATTCGTAGC 


TTCTGTAATT 


TTGATTTCTT 


TAGTGAGTTT 


184701 


TTCTTTGGGA 


GGAAGAGCGT 


CGTAAGGTAA 


GGAACAATGA 


TGTTAATAGG 


184751 


GCTAGTTTAG 


TTAAATAAGA 


ATAGGGGATC 


TCAGGTTTAG 


GAGTGATGGG 


184801 


TTGTTGCTCG 


ATTACAATGA 


TGGTTAATGG 


AAGAAGAGCG 


TTTCTAGAAA 


184851 


GTGCATCTGC 


TTCCTCGTCT 


TCTTTAGGCA 


ACAAGCTCAA 


GATGAGAAGC 
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184901 ACGAGCCCGA TGGCTAATCC 

184951 CGCCTGTATT AGAGGGAAGC 

185001 CAGCTGCTAA CATTAGTGCA 

185051 ATTTGAAGAG CAAGACTTTT 

185101 TTTGTTAGCT TCAATCTCAG 

185151 CGTGAACTTT ATGATTAGTT 

185201 GCTTAATTTT TTCCTAAAAA 

185251 AGATTACTGA ATTCTGAGTG 

185301 TAAGGTCCTT AGGGGAAAGG 

185351 CTGGGTAGGA GAAGGACGGC 

185401 GCCTGTGAGA TTGGGATCGG 

1854 51 AAGCGTCGTG TTCCGTGTAG 

185501 TTAGAGATTC CTAAATTCGT 

185551 AAAATGGTTT TTGGGATTCA 

185601 TAGCGTAATC GGGATAGATA 

185651 GCTACGAAGA GATCTTGTGG 

185701 TACGGTGACA GCATAGATAT 

185751 CATTTGCGAT TGCGTGGTGT 

185801 TCGGAATGGC GGATATGGAG 

185851 GTCTGCAGGT TGGTAGGTGG 

185901 TGCCGTGGCG TTGATGAAGG 

185951 GCAATCTCAG GATTCTTGGC 

186001 TTTTGAAAAG ACTCCGTGGC 

186051 TCCAATAGTT CAGAGGGTGA 

186101 TAGTTTCTTC AGGGTCTTTC 

186151 AATTCTTCTT CAGCATCTTC 

186201 TTTTACATAA AGATCGATCA 



PCT/US99/26923 

CACGAGTGTG ATAATACCAG CAATGCTATA 
CTAAAGCTAA GAAGGTAACG AACGTGATTC 
GCGAAAAGTA TGTAAAGAAC AGCACGCGCA 
ACTTCTGAGT GTGCCTTCTG TAGCTTGGGT 
CGAGTTGCGC TTTTTTTGGC TTCACTGGTT 
GATGCGCCTA ATTTCATAGA AAGTCTTTAA 
TTGTAGTTTA CATGTTGTTT TTCATCAAGA 
GGTTGGAACG AAGCATGTTC TTCTAGAAGC 
AAAAGAAGCT TGGCTGGATC TTTTTAATCT 
GGTTACATTA TTTCTATTTT TTGAATGTTG 
GATGGTGAGC AAGGTACCTT GAAGAAAAGA 
GTACAGAGGT CTGAGATAAA AATGCGGTCT 
AAGTTGCTTG CGAGCAATCG CACGCAGGTC 
TAAAGGGAAG AAAGCTACGA GGAAATAACG 
GCATAATCTG GACCGATGGA AGGGCCGATA 
TTTTGTATGA AATAATTTTT TCATAGTACC 
TGCCAAGCAA TCCTCGCCAT CCGCTGTGTA 
TCTCGATCAT AAAAGATAGC TGCTTGGCAA 
AGAGAGGAGC GGAGACTGCG TGCACAGTCC 
GGGATGTAGG TGTAACACAA CGTACGGAAG 
TCGCAATACT TCGGAGATTG GAGAGCTGAA 
TGCGAAGACC GTGCCCTCGG CATCCTTTTG 
GTATAGGCAA ACCATCGAAT TCTTCGAAAG 
GTGTGGAAGG ATAGAGTCAT ACTGTCAACT 
CATACCCCAT GTTCTTGTAA AAGTCGAATT 
CATGGGTATG TGAGCTTTTA CACAAGTATG 
TCCCTGTTTT GGAACCTACA AATCCAAAAT 
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186251 CTGCATCTGC CATTTCTCCA GGGCCATTCA CAATACAACC CATGATAGCG 

186301 ATCTTAAGTC CTGGTAGGTG CTGCGTTCTC TTGCGGATAC GTGTGGTGAC 

186351 TTCTTCAAGA TCAAAGAGGG TCCGACCACA CATAGGACAG GAGATGTACT 

186401 CTGTTTTTAC AAGGCGCACC CCTGCATTTT GTAGAGTGCC AAAGGCAATT 

186451 TTTAGCACGT CCTGTAGGGG AAGGTTCGGT AAGTCAAGAA CCACAGCTTC 

186501 TCCAAGGCCA TCAAGAAGCA GAGCTCCAAA CTCTGTTGCT ATGGAAATAG 

186551 CAGCTTCTTC TTTATTGTCA AAGTCCCTTG AAAATACTAG CTTGGTCGGT 

186601 TTTCCTTGGT GTCCTTGTTT TTCAAAGAAA TCTCGGGAGG TATGAATGAA 

186651 GGGGTCTGAA GCATGAAAAT GCACAAATGG AGCTTGATGA ACAGCAGGGC 

186701 TATCCCAAAT CTCCTCATTG TGTTCATATA GGCAAGGCAC TTGATGGTGG 

186751 TGGAAAACTA AAAAGTGTTC TCGAAGTACA TCTGTAATAG GAGCATCTTT 

186801 TAACTCAGGG GGAACGACGA CCCCTTCAGG AGTTGTGAAT GCTTTTTCTT 

186851 TTGTTACGGG ATTTACCCCC AAGTGTTCTA AGAGTTCTTC AGGAGTAAAG 

186901 TCGGTAAGAT GGTGAGGATA GAGTTTTAAA AAGACTCCGT AGACGTCTCC 

186951 CCAAAGTGTT GTTTTCGCAG GCTTCTCTGC AGCAGAAACA AAGTTTTCGG 

187001 AGTGTTGTAG GGAAAAGGGA TTTTTCTTTT CTGGAAGGTC TAAGTAGATT 

187051 TTCGTATGGC GTAGCAAGCT ATCACAGACA GGAATTTCTG TAGTGGGACA 

187101 CCCTGTGAGA GAGCAGCGTA TGGTATCCCC GAGTCCTTCG GCAAGAAGAG 

187151 TTCCGATTCC TACTGCGGAT TTTATGATCC CGTCCACGCC CATTCCAGCT 

187201 TCAGTAACTC CAAGGTGAAG GGGATAGAGC CAGCCTCTAG CATCTAAGTC 

187251 TTTAGCAAGT TGGCGGTATG CAGTTACCAT GATCTTCGGA TTGCTAGATT 

1873 01 TCATTGAGAA GACAACATCT CTATAATTCA GCTTTTCACA TACAGCGATA 

1873 51 TATTCAATTG CTGAGGCTAC GATTCCTTCG ATAGTGTCGC CATATTTTTG 

1874 01 CATGATTCTT TCGGAAAGTG ACCCGTGGTT CACTCCAATG CGCATAGCCT 
187451 TGCCTAGTCG CTTACATTTC TCTACTAAAG GAGCAAACTT TTCTTCAAGA 
187501 CGCAGGAGAC TTTGGGCATA GCTTGCCTCT GTATAGATCT TCGTCCCCTT 
187551 GAACATGTTC CTCTTATCTA TGTAGTTGCC TGGATTGATG CGAACCTTGT 
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187 601 CAGCAAAATC AGCAACTAAC ATAGCTGCTT GAGGGAAGAA GTGGATATCT 
187651 GCAACCAAAG GGATATTTAA CCCTAGAGCA ATCAGACGTT CTTTAATTTT 
187701 TTCACAGGCT TGTGCTTCCT TGATTCCCTG TACAGTCACT CTGACAATAT 
187751 CACAATTATG TTCCGCTAGA GCGTAGATTT GCTCTACTGT ACTGTCAATG 
187801 TCTGTGGTTA ATGTCGTTGT CATTGATTGG GTTTTTATTG AGTGGTCACT 
187851 GCCTATGTAT AAGTTGCCTA TTCTTACTGT ATGGGTTTTG CGTCGCGAGG 
187901 AATTGATGGC AGGGGTAATG AGTGTCATAA AAATCTCAAA AATTTCAGAG 
187951 TTTTATCAAT TATAAAAGCC GAGAGAATTT TGTTGAAGCT GATAAAGCTT 
188001 CTCTAAGGAT GCCTACATTT ATCAATTTTA CAGAACTTCA CCTCAATAAG 
188051 AGGAATTTTC AGAGAGAAAC GACGACCTTG GATGGCCGAG TGTTTCAGGC 
188101 CATCAAGAGA GTATATGGAG AGATTATTGT GAATTAAAAT AATTATTTCA 
188151 TACTTATCAA GGGGATAAAG ATTTTGCTTT AGAGGACGAT TCGTAAGGAA 
188201 CTCGAATCTG TTTTTGTGTT CGTAGAACGA ATCCCGAAGT TTGACAAAGA 
188251 GAAGGAGTTC CCAAGAGGAG TGCTTGGGGC TCGTACTTTG GGAACGCTTT 
188301 TTTCTAATTT GAAATCGGTG TATTTTGTGA TGGATGCGTG ATTTTATACA 
188351 TCGAGATTAA CATGATAGCC AGAAGGATTA GGGAGAGCAA GAATAAGGGC 
188401 CCTATAACTG TAAAGTAGGC TCCGACAGAT GTTGCTACAA AGATCATGGC 
1884 51 TATAGCAACG ATTCCAGAAA CAAAGGTATT GAGGAGGAGT AAAGCAGTTG 
188501 AAGCCGCAAA GGCAATTTTT ATGCATCGTC CTGATCCCGA TAACCTTTCT 
188551 AGGAATTCTC CAAGCTTTGT TTGTTCAACT GGAGATGCAC TTGATGTTCC 
188601 TGTGACTACT GGAGAAGACA TGTGATTTCC TATATTTCTA TGACAAGCCC 
188651 TATAATTTTA TAGTTTTCTA TTTTAAATGC AATGTAATTA GTGTTTTACA 
188701 AATTAATCTC 'CTACTAAAAT AGGAGGAGAC ACTCTCATAT TTTCTATGAC 
1887 51 CTAGGGAATA AAATATTTTA TTTATAATGA GTGAAAAACA CCGCTGTTTG 
188801 TCTAAGAAAT ACTTTTAGAA GAGAGTTTAC GTTTTATCTG CTGTTCAGCA 
188851 GTTGTTCCTT CGGTTTTGTT TACAAGGGCA AGCTTTATAA CAAGAAGAAG 
188901 CACACAGAGT ACTGTTGTAA GAATAAGGCC TCCTAGGATA ATCAAGCAGA 
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188951 TGGGAGGAGT CCCAGGAATA AAGAAAGCTG CAATTATTCC TAAAACTTCT 

189001 ATTACTAGAA TGGCAACAAG GATTTTAACT AAAAGTTTAA TCGTCTTCGT 

189051 CGTTTTATCT TGATGCTTTG CTTTTAGAGC TTTGGAATTT GGTTCCATGA 

189101 GGCCTAGATT TCCAGAGGAA GGGCGGTGGC TTGTGGGTAG GGGGGCTGAG 

189151 GACACGGGCA TCGTCTTTTA GTTAGAATTT TCTTTATTTC GCAAAAGCTT 

189201 TGTCTTATGG AGAGCAGAGG CGTTTCTATT TGCTTGAGAA AGTATAGCGT 

189251 GGTTTAGAAA GTTAGAAAAG ATTTTTTAAA AAATCTTCTA TGTTTTGGAT 

189301 GTTTTAAAGA GAGAAGTCTT ACCATTTTTT AAGAAAGGAT TTCCTTTTCA 

1893 51 CAAAAAGGCA TCTACTCTCT TTTGCTTAAC CAGGGGGAGA GATTAGGGTG 

1894 01 TGATGAGGGG ATGTCTAGCA' AGAGTTGTCA AAATGATAAC CCACGGTTGA 
189451 TTTTGATTTT CTGCTTCTGA TCCAAATGTC TGCAGAGCAT CTATCAAGGC 
189501 TAACTTCACA CCATCAATCC ATTGCAGACG TAGACTAGTA GTCTCTTCGT 
189551 CTTTTGGAGA GCCAGGGAAA TGGGAGGAAA GCAAGGGGAG CTGAACTACG 
189601 GTTGTTTTAC GGCGCTTGGC CTCATTCAAA CAGTTAAGGT AGGCTTGCCT 
189651 ACAAAAGGTA AACGCATCAT TAGGATTGTA GTTATAGTCA GAAGCTTTCG 
1897 01 GCCCTAGAAG TTGTCCTAAG AATTGCGGTA GTCCTGCTTT ACCTGCATTC 
189751 GTGGTTCCGT TTAGATTTTC CCATTTTGCT GAGCGGCATT CTCCTAATTG 
189801 TAGGGGCTGT TTTGAACGGA GAGGATCCGG AGATTTTTTC GAATTTTCCC 
189851 AACAGCGTAC ACTAGTTGCT TTCGCAAGTG CGAGACTTGT TCCTTTTACG 
189901 TTGTTTGCCA TGTTTGGGGT GGCTGCATTC ACAATCATGA CGATGCCTTG 
189951 AGTTTTATAG CGAGGCTTTT CAATGGGCCC TAACGTAGAG ATCAAAGTCA 
190001 GGTTTGAGTT TTTCAAATGC CAAGCATAAC CGGTTTGATT GTCAGCAGCG 
190051 ATGAAAGGCA TATCTGTGTT AGCCTGTAGA TCAGGTATAC GGTCCCAATT 
190101 TTCTTGTAAC AGTGTTTGAT GGGTTCTAGG GGAAGGAAGC GGGGGCTCTT 
190151 CTAGGACAGC TTCCGTAGGC ACTGGGGTGA GTGCTAGGGG CGGGACAGGA 
190201 GGAAGGTCTG TATCTTTTAT GATGGGCGTC GGCTGATGCG TTACGCTTAT 
190251 AGGACTTTTA GGTTCTCTTA AGAGAAAATA GAGAAGAATA CTAAAAGCCA 
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190301 . AGACTGCTAG AGCGGTAAGA ATAAATAATA GAGGAATGCC TAAAATTATA 

190351 GCAGCGGTTC CAGAACTTAC AGCAATTCCT ATAATTAGAA TAAAAGCGAC 

1904 01 AAGGTCCAGA CCTTTGAGCT TATGAGATGC GAGTTGCGTG GGCATGCTTT 

1904 51 CGCTGCTGAT CAGCATGGTT AGATTTACGT TAGTCAAATT GGGTTCTGTG 

190501 GTGGACATAA AACTTTTTAA ATAAAAAACA AAAAGTTTAA AAAAGATTCT 

190551 TTTTTATGTG AAGTTATTTA TATTTTAAAT AGAAGTTGTT TATTTAAAAT 

190601 AATAAATAGA CAACATTTTC ACTTTAAAAA GTATTGGATG CGGCTTAGAG 

190651 CCAAGAATCT AGGGGGTGCT TGTGATCAGG ATAAATTGGC TGTGTTTTTA 

190701 GAAAGCTGTT CGATTGGAAG CACGGACGCT TTCCGACATA GACTAGGGAG 

190751 TCGTTAGATC GAAACGGGGT GGAATGATGG GAGGCTGGTC CTTGTCTGTA 

190801 AGGATGATAA TTTTTTGCTC TTCCTGGTTG TCTTGTTCCC ATCCAAAATC 

190851 TTGAAGAGAT GTGATGAGTG CCAATTTTAT AGCCTCGATC CATTCTGCTC 

190901 TTGTTTTCCC GAGGTTTAGA AGTCTTGATG GAGCAAATAG ATTACATCCA 

190951 ATGAGGGGGA GTTGAATCAC ATCAACGCCT ATGATTTCAG CTTCTTGGAA 

191001 CAGGTTATGA AACGCTTTCT TGCTTACTTC AAATGCTTGC TTAGGATCGT 

191051 TGTTACACTT AGCAGCTTCG GGACCAAGCA GTTGTGCTAA GAAGTGTGCT 

191101 TTGCCTGGGA CATGGTCGTT TGAGGTGTGA TCACTATTTT CCCATTTTGC 

191151 TGAGCGGCAT TCTCCTGGCT GTAGTTGGGA TCCAGAACGA GAGTGCGCTC 

191201 TAGGGAGCCT AGATGCGTTC CAACACTGTA GACTTGTAGC CAGGGATAGA 

191251 GCTTTATTCG TTCCCCCTCC TTCTCGGGAG ATGTTCTCGT TTGCTGCGTT 

191301 AACAATCATC ACCCTGCCTT GAGTTTTGAT TCTTGGAACT GCAATATCTC 

191351 CTGAGGTAGA TATAAAGATA AGCTTCGAGT CTTTAAGCTT CCAAATAAAA 

1914 01 CAGGGCTGCT GAGGTGTTTC TGTAGTAAAC GATGGGTCTA CAGCGGCTAA 

191451 GCTCGGAAGA AGATCCCAGT TTTGTTCAAG AAGAGCCTCA TAGCCAGGAG 

191501 TGACCGTGAT AGCGGTAAGT TCTTCTGGGG GTGTGGTTAG ATCAGCAAGA 

191551 GAGACTTCGG GGAGTGATTC GGGATCGGGA AGTGGATCGA TCTGCAAAGG 

191601 GAGGTCCTGT TGCTCAGGAG GCTGGGGAGA GGGAGACAGA GAACTTTGCT 
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191651 CAGATTCGGG TTCGATTTGA GGAAGAATCT CGTATAATTT AGGTTTCTTT 

191701 AAAAGGGAGT AAAGTGAGAC CGCAGCAATT GCTAATGTTA TGACAACTAC 

191751 GGCAGAAAAA ATAGGCATAG CCAAAATGCC AGCAGCAATG CTAAAGCCCA 

191801 . AAGCGCACGC TAGGGCTAGT GTGAGGGTGA CAATTTTCAG TATTTTCTCG 

191851 ATTCTATCTG TACGGTTGAG AGGGAGTCTA ATCGGATAGG AATGTTTCGC 

191901 AGGAGTTCTG TAGAGACTGG CGTCTGTATA AGAGGGTAGG GGATTAGAAT 

191951 CTGTCATAAT ATTTTATTAA TAGATCAATT TTCAACTATT ATATTACTAA 

192001 TTTGTATTTT TATTAGATTT TTTTATAAAA CAAAATTAGT TTATTAATGC 

192051 ATCCTATTGA AATAGATCTT TTTAGTTAAA AGGGACCATA AGTAGCTGTT 

192101 TGTGGTCTGT AAGGATTATA GTCATGGGAG TTGAGGGGTG CTGCGCAGCA 

192151 AAGGAGCGAA GAGCTTCCAT AAGACCTTTC TTTATATCAT TTACCCACTA 

192201 CGTACGGATG TGGTAAAGCT TATATGCACT GCTATTAGGC TTTGTTTGGT 

192251 TTACGGGTTC TAGTTCCAGC TTTCCTCCAG GTGAGTATAT AGAGGAAGAG 

1923 01 ATCAGAGGCA CTTGGACCAC AGTGGCTTGG TTATTGAGAG CTTCATCAAA 
192351 ACAGTTCAAA TAGGCTTTCT TAATAACATT GCTTAATTTC TCAGGATGTG 

1924 01 CTTTCAATTC TCCTTCATAT TTAGGACCAA GAAGTTGTGC TAAGAAATGT 
192451 GCTTCTCCTG GGTTCGTATC ATTTATTCGT CCAGTCTCTA TTGATCCAGG 
192501 GTGCTGAGCG GCATTCACCC ACAGATAATC CTTTGCCAGT GTTTATTTTT 
192551 CCCCCAGATG TTCTCGTATT GTTCCAACAA GTAGGGTGTG TGGCTGCTGA 
192601 GAGAGCAGCA TTGGTTCCGG CTCCACCAGA TTGCATGTTC GAATTCGCTG 
192651 CGTTAACAAT CATGACTCTT CCTGAGGTTT TCAGGCGTGG TTTAGCGATG 
192701 TCTCCTGTAG TGGCTACTAA GGTAATTGGG GCTCCTTGAT GTTCCCAGAC 
192751 ATAGTATCTT 'TGATTAGGAT CTTGGAGAGT CCAGGATATA TTGATTTCTG 
192801 AGAGAGTATT GACTAGGGTC CATTCATTTC TCAGCAGCTT TTGGTAATCC 
192851 AGAGGGACTG AGGGTTGCAC TTGAGCTGCA ATATCCTTGG GGACATGTTT 
192901 TTGGGGCCCA AGGAGCTCTT CTGTCGGTTT GGATGGCTCT CTTCTTCTTA 
192951 AAAGAAGACA AGAGAGTACG ACTGCTGCAA GGAGAGCAGC ACCAGTGGCT 
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193001 ATAGCCATGA GAGGCATACC AAGAACTCCA GCAACAACTG CAGTCCCTAC 

193051 AGCTATGGCT AAAGCAAGAA TAAGAGCTGT GAGTTTTATT ATTTTAGCAA 

193101 TCACTTCTTT CTTAGTAAGC ACGGGTGATT CTGAGATCAA TCTCTCATCT 

193151 ACAGACAAAT GTGATTGATT CTCGTGTACA TCTGGGATTG AAGTGGTTTT 

193201 AGTCATAAAA TATCTATTTA AAGGAAAAGA TTATATCTTA ATTTGTTGTT 

193251 TTTTACAAAA AATCTTTTTC TTTGATAATA AAGATTTTTA TAAATTCTTT 

193301 TTAATTTCAT TAATTTTAAT GAGTTAAGGC GTAGCAGGCG CACCTTATTG 

193351 TTTAGTTCAT AAGAGAACTC TAAGTGGGTG CCACACAGAG TTTTTTCTCC 

193401 ACTTGGTTCA AAAGGGTGGC GTTTGTGGTT TTATGGGGAA CGATTCTACC 

193 451 CCAGCCATGG GATTCAAAGA CTTCTTAGAT CGACAATACT ACTTGAGATT 

193501 CTAATGGGGG GTGAAGACAT GTGTATTTCA TGGGCATGGG TAACGGAGGA 

193551 CATCAGGTTT ACTTTCGAAG GCCCATATAT CATCTTCAGA GGTCAATAAA 

193601 AGGGGGGGGT AGGGAGAACG ATGTTTTACA ACAGTGTAGA AATTAACTAT 

193 651 TTGTGAAATA TTTTCATTTA ATAACAATTT AATGAGTTTT AATTTGAAGA 

1937 01 CACAGGCAGT GGCTTATCTT TTGTCTATAA GTCGGTACAG TCAGACTTAT 

193751 AAGAATTGCC GCTTTTTGGG TCTTCATAAT CTCTATCTTA AACAAAAAGG 

193801 AGGTCAATCA CAAACTGGGT GAGTTGTAGA GGTCTCAATA AGAAAGTTTG 

193851 TAAGGTTAGG AGAGCTATAA TTGCGCCTGG GATTCTGCAT AGACGATACC 

193901 TGAAAAGATA GTGCCTAGGG TCGGTGTTTG CTGATGAGAA TGCCAAGAAG 

193951 CATAGTCATC AACCGTCTTA ACATATTTAT AAGTTTTCTT AGCAGCGAAT 

194001 GGCACAGCTG CTAGAAAGCA AAACAAGATG ATAACAGCCA CCAATAGAAT 

194051 TGTTCCTGAG ACAGCAAGGA CTGCAATGGC AGCTGGATGT CCAGTTTCTA 

194101 GAGTGAATTG CAGAGCAGGG CGGATTGCGA ATAAACTAAG AACTATGAGG 
194151 CAGCATCCGA TCACTAGGAG AACTTCCAAT ATGACAGAAG TGATTGTCCG 
1942 01 TGGAGAAGAC TCAGCTTGCT GCTTCCCCTT GAGTTCTATG AGTTTCTGAT 
194251 TTTGTGTGTT TGCGCTATTG ATTAAGGGAG CGGCTGGACT TTCAAAAAGA 
194 301 ACGTCCGTAG CAGATACTGG AAGATATCCC ATTTATTTTA ATATCCTTTA 
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194351 ATTAAAATTT GCGCAGATAT TATACCTTAA TTTAATTTTT ATTAGAGAAT 

194401 AAAATTCTAT ATTTTCTTTT GTGTAATATT TTTTTATAAA AACATGAGTT 

194451 TTTAATTTAA CTTAGAAGAG AAGTGTTGCA ATATCTTTTA TTAAAACAAG 

194 501 AGTACCTAAG AATTCGTGCG TAAAATATTA CGTATTTGAT TTGTATAAGT 

194551 CGTGAGAGTG TTTTTTCTAT TATTGAGGGG CTATTTTGAT TTTTCTTGAA 

194 601 TTGCGTCATG GAAAGATCTT GGAGGCTTTA GGAGCCTTTT TGTCCCTCAT 

194 651 CATCTCGAGA TAACCAAACT GAAAAGATGC GTACCAAGAG TTTGTTTTTT 

194701 CGTTCGCGAA TCTACTGTAC CCTTGATTTT AGGATAGCCG TTTTGAGTGA 

1947 51 GATAGAGATC CTAAGAGAGA TCCTCAATAC TTTCTTGAGG TAGTTGGGCT 

194 801 AAGGATCTTT CAACACAAGC AATAGTTAGC GGATATTCTG GGTGCAGCTC 

194851 GGCCATGTAT TTTACTGCTG TAATCATAGC CAAAGATTGT TGCATTCTTG 

194901 ATTGCGGGGG CAATTCTCTA GAATAGAGAT GCTCCCTCAA GCTGTATAAA 

194951 GGGATTTGGA TAATTTGGGT GTTCGTATTG ATCGCAGTCT GGATATAAGT 

195001 GACATAGAGT GCATAGTAGT ATTGGTAGAT GGCTTCAGGA GTTGTTGGTA 

195051 TTTGTGGGAG GAAGACGCGG CCTCGGAAAT CGGGTTGCCA GTATAAACGT 

195101 TCTCCTGAAT CTCCCGCAGG GAGTTCTGTA TTTATTTCTA CCCTGGTTTC 

195151 CTGAATTGCC TCATTGAGCA AGGATTTTGG GGAGGGAGTG GTATCTACGT 

195201 AGGAGGATAT AGGCGCATTT TGCGAGGTTT CTTCAATTAA CAGAGTCTTT 

195251 CCTAAATTTT CGGAAGCTGA AATTAAGAGA TGTTCTGGAT TGCCTCCCGG 

195301 TAGGAGGGAT ACTAGGGAAA ACAGTGAATG TTTAGCTTTC CATATCTCGT 

195351 AAGGGTTATC TGTGAGTAAT GCAGAGGGTG CGGGGAGATT CTGTAGGTCC 

195401 TTGGGTTGGT GTTCCCAGTT TTCTTTAGAA CGATTGATTG CGATAGAGAT 

195451 AGAAGAATAC TGTAGGTTGG TGGAGACCCT TTTTGGTTTT ATTTCTTTGT 

195501 GTTTGCCACA AGCACGCGGT TTTAAGAGGG TGGTTGCTCG TTTATAAAGG 

195551 ATAAAGTTGC TAAGAACAAC AGCAATAAAT GAAATTCCTG TGAGGAAATA 

195601 GATAACAGGA ATATTTAAAA GCAAACCAAT AAGTAGGGTC CCTATAGTAA 

195651 CTAAGACAGC AAAGATTGCA GTAATCAATA GAATTCGAGA CTGAATCTTA 
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195701 TCAAAGTTGC TGCAGGATTT TTTTTTAGTG TCTACTAGGG GATGATCTAC 
195751 AGTTAAAGAT ATGGGTGAGA TTGTAGCCAT AATTAAGGAA TTCACCTGAG 
195801 TTCCTTTAAA TTATATCTCT TTTATTGATA TTTAAATAAT TAAAAACATT 
195851 TCTGCATTTA AAAATATTTT TTAATTTGTC ATCTTTCTAA GTAAATATAA 
195901 AAAATAATAA GTTTATTTAG TTAAATTTTT TTTGAAAACA AGAAAGATAG 
195951 GGGAAAGGCT CGCGAGTAAA TCAAGCCTTT CCTGAATGCA AAGAGTCGCA 
196001 TAACTACTTA GAAAAGAGTT CGTCAGAATA GAAGTTGTAT TAATATAAAA 
196051 TTTTTATTTT CTAAGATTAG AAAGTAACTT TGACACAGCC ACCTTTGATG 
196101 CGGCACTGAC AAGCAAGACG TTCGTTAGAG TCTTCGGGTT CTCCTAGAAA 
196151 ATCGTATTCT GGTTCCGTAA ACTCAGAAAG ATTCTCACGT CCTTCTAAGA 
196201 CCTCTATCAC ACAAGTTCCA CAGACACCTT CTGTACAAGC AAAGGGAATG 
196251 CCCATGGATT CACAAGGCTC TGCGATCTCA CTATTGTCTT CTAACTCGAA 
1963 01 CTCTTGTTGT TCATCATCAG AGGTAATGAC TAGCTTGGCC ATGGAGTTTT 
196351 CCTTCTACGT ATTGTCGGAT AGATTAAAAT GAAGTTCGGA GTAGAGGGAT 
196401 TCGAACCCCC GACCTATTGC TCCCAAAGCA ACCGCGCTAA CCAAGCTGCG 
196451 CTATACTCCG TAAAATAAAA TTTTCGAGTA AAAGAGATCA ACGTTAGCAT 
196501 AGAAGGAAAT AGTCGACAAG ATCAAAGATG CTTCATAACG TCTCTTTAGA 
196551 GTTTTACTTG CAGATATATG GAAGAGGGAA GTATGAGAAA AGGCAGGGAT 
196601 TCTCTAGGAG GCTGTTTTTG TGTCTGGGAA GAAAGATGGT GTAAGGGGAA 
196651 TGATCTTTGT CCCTCTTAGC ATCCTAGTAC TAATCTTTTT ACCTCTTCCT 
196701 CAGATCCTTC TTGATTTTGG ATTGTGTATT AGTTTTGCAT TGTCTTTACT 
196751 AACGGTCTGT TGGGTCTTTA CCTTAAATTC AAGCAATTCA GCGAAGCTTT 
196801 TTCCTCCATT TTTCTTATAT CTTTGCCTAT TGCGGTTGGG ATTGAATCTT 
196851 GCATCAACAC GATGGATTGT CTCTTCAGGA ACCGCCTCTT CTCTGATTGT 
196901 TTCTTTAGGC AGTTTCTTCT CTTTAGGAAG TCTATGGGCA GC AACGTTTG 
196951 CGTGCCTCCT TCTTTTCTTT GTGAACTTTT TGATGGTTTC AAAGGGTTCG 
197001 GAAAGAATCG CAGAGGTCCG TTCGCGGTTT TTCTTAGAGG CTCTTCCAGC 
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197051 AAAACAGATG GCTTTAGATT CTGATCTTGT TTCTGGAAGA GCTTCTTATA 

197101 AGGCTGTCAA AAAACAAAAA AATGCCCTTA TAGAAGAAGG GGATTTCTTC 

197151 TCTGCCATGG AGGGGGTCTT TCGTTTTGTT AAAGGGGATG CAATTATTAG 

197201 TTGTATCCTT TTACTCGTGA ACGTAGTTTC TGTAACTTGT CTTTATTATA 

197251 CTTCGGGTTA TGCTCTTGAG CAGATGTGGT TTACAGTTTT AGGAGATGCT 

197301 TTAGTGAGTC AAGTACCTGC TTTACTTACT TCGTGTGCTG CAGCCACTCT 

197351 TATTAGTAAA ATCGATAAGG AAGAGAGCCT TTTAAATTAC CTGTTCGAAT 

197401 ACTACAAACA GTTGCGTCAG CATTTCAGGG TGGTGTCGTT ATTGATCTTT 

1974 51 TCTTTGTGCT GCATTCCCAG TTCTCCAAAA TTCCCTATCG TTTTGCTCGC 

197 501 GAGTCTTTTA TGGTTGGCGT ATCGAAAAGA AGAGCCTGCA TCAGAAGATT 

197551 CTTGTATAGA ACGTGCGTTC TCTTATGTTG AGGGGGCCTG CCCTAAGGAA 

197 601 CAAGAATCAC AGTTCTATCA AGTATATCGT GCAGCATCCG AAGAAGTATT 

197651 TGAAGATTTA GGAGTTAGAT TGCCTGTGCT TACTTCTCTA CGTATTGAAG 

197701 AGCGTCCTTG GCTCCGAGTA TTTGGCCAGA ATGTATACTT AGATGAAATG 

197751 ACTCCAGAGG CTGTGCTTCC TTTCCTTAGA AACATCGCTC ATGAGGCTCT 

197801 CAATGCCGAG GTAGTTCAAA AGTACCTTGA GGAATCAGAG AGAGTGTTTG 

197 851 GCATCGCTGT TGAAGACATC GTTCCTAAGA AAATCTCTTT AAGCTCTCTT 

197901 GTAGTTCTTT CTCGCCTCCT TGTTAGAGAA AGGGTATCGC TTAAGCTTTT 

197951 CCCAAAGATT CTAGAGGCCG TTGCGGTATA CCAAAATTCT GGAGACAGCT 

198001 TGGAGATCCT TGCGGAAAAA GTGCGAAAGT CTCTCGGATA TTGGATTGGG 

198051 AGAAGTCTCT GGGATCAGAA ACAAACCCTT GAGGTAATTA CCATAGATTT 

198101 TCATGTTGAA GAATTGATAA ACAGCTCATA CTCAAAGTCT AATCCTGTAA 

198151 TGCAAGAGAA TGTGATCCGT CGAGTAGACA GTCTTTTAGA ACGGTCGGTA 

198201 TTTAAAGATT TTCGAGCCAT AGTTACGAGC TGTGAAACAC GATTTGAGAT 

198251 GAAAAAAATG CTCGACCCAC ATTTCCCTGA TCTTTTGGTT TTATCTCATG 

198301 ATGAGCTTCC TAAAGAAATC CCTATTTCCT TCTTAGGGAT CGTTTCAGAT 

198351 GAGGTTTTAG TTCCTTAATT TAATTTAATC TTCTGTAAGC ACTATTACTT 
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198401 TCGTATTTTT TTTGTTGGTG TAAATATTGT TTAAAAATTT TTTTTGAATT 

198451 AATCTAATAA ATAACTAGAT AAAAAAAAAT TTGTGAAAAC ACAGCAAACT 

198501 CAAAACATCA TAGAGGTTTG GAACTTCTAC TGGGAGACTC AGGAAATAGA 

198551 GTATCGCGAT AGCTTAATTG AGTTCTATTT GCCTTTAGTA AAAAGTGTGG 

198601 TTCATCGTTT GATTTCAGGG ATGCCTTCCC ATGTAAAGAC CGAGGATTTG • 

198651 TATGCTTCGG GTGTTGAAGG TCTCGTCCGT GCGGTGGAAC GTTATAATCC 

198701 TGAGAGAAGT CGTCGTTTTG AAGGTTATGC GGTATTTCTG ATTAAGGCTG 

198751 CCATTATTGA TGATCTGCGT AAGCAAGACT GGGTTCCTCG TAGTGTCCAT 

198801 CAAAAAGCGA ATAAATTGTC AGGAGCTATG GATTCTCTTC GCCAGTCTTT 

198851 AGGCAAGGAA CCCACGGATC TTGAACTGTG TGAGTATCTC AATATTTCGC 

198901 AACAAGAGCT TTCGGGATGG TTTGTATCTG CCCGTCCTGC ATTAATCGTG 

198951 TCTCTGAATG AAGAGTGGCC TTCACAAAGT GATGAAGGAG CCGGAATGGC 

199001 TCTTGAAGAG AGAATCCCCG ATGAACGTGC CGAGACAGGG TACGATGTTG 

199051 TAGATAAACA AGAATTTTCT TTATGTTTAG CCAATGCGAT TCAGGAACTT 

199101 GAGGAAAAGG AACGCAAGGT CATGGCCCTG TACTACTATG AAGAACTTGT 

199151 CCTTAAGGAA ATCGGTAAGG TCCTTGGGGT AAGTGAGTCT CGCGTCTCTC 

199201 AAATTCACTC TAAAGCATTG CTTAAGCTTC GCGCAGCACT CTCTGCATTT 

199251 CGATAAATAC AGTTCTCAGG TTTTAAGAGC AGTCCTAGAG CTAGGAGAAG 

199301 CCCTCCTACG ACACAGAGTA ATCCGTAAAG AATTTGTTTA GTCAACGCCA 

199351 TAACAATGCA GCCAGCAATT GTGAAGATTG CTCCTACAAT CAAGAGAAGA 

199401 ATGCGGGGCA GCCATTCTTT AATACGTTCG GCTAGGGTAG AGGTAGGAAG 

199451 GGGTGCCTTG CTAGTTGTGG CATTTAGACT AGTTGCGGCG TTTAGCATCG 

199501 ACGGAACCCC AATTGGAGAT GTAGACATAG CGACCTAATC TTTAGAGAAG 

199551 ATAGAAGTTA TGGTATCTCA GCAAAGGAAT TATTCTCAGC GTACAATCTT 

199601 TTCTTTATTG TTAGATAGGT GGTGCCTTGC ATATAAGCTT TAGGATCGAT 

199651 AAGATAGAGC TTGGCAATTC TTAATTATGT ACGATCACTC ATGCAATCCT 

199701 GGTTACAATC TTTACAAGAG CGAAATATTT TAGAGAATTT TACCGCAGGT 
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199751 TTGGAATCCG TAGAGGGACC TATCGCCGCT TATTTAGGAT TTGATCCTAC 

199801 CGCACCTGCT CTACATATTG GTCATTGGAT TGGGATTTGT TTCTTGAAGA 

199851 GACTCGCTGC TCTGGGGATT ACCCCCATAG CTTTAGTCGG GGGAGCCACA 

199901 GGTATGGTTG GAGATCCCTC AGGGAAACAG AGCGAGAGAT CGTTACTTCA 

199951 GACAAGTGAA GTTTTTGATA ACAGTCAAAA GATCACGGCG TGTCTCCAGC 

200001 GCTATCTTCC CGGGGTGACT CTTGTAAATA ATGCAGACTG GTTGCAGGAG 

2 00051 ATCTCCCTGA TTGATTTCTT AGGGGATATA GGAAAACACT TTCGTTTAGG 

200101 CCAAATGCTA GTGAAAGATA CAATAAAGCA GCGGGTGCAT TCTGATGAAG 

200151 GAATTAGCTA TACCGAGTTT AGCTATTTAA TCCTGCAATC CTATGATTTT 

200201 TATCACTTAT TTAAAAATTA TGGCACGATC TTGCAGTGCG GTGGTAGCGA 

200251 TCAGTGGGGG AATATTACTT CAGGAATCGA TTTTATTCGC CGTAAAGGGT 

200301 TGGGTCAGGC CTACGGCCTT ACCTATCCTT TATTAACGAA TGCTCAGGGG 

200351 AAAAAAATAG GGAAAACAGA GTCGGGAACT GTATGGCTCG ATTCAGATTT 

200401 AACCTCTCCT TTTGAGCTGT ACCAATACTT ACTCCGTTTG CCCGATGATA 

200451 CCATCCCTAA AATTGCTCGT ACGTTAACTT TATTGAGCAA TGAAGAAATT 

200501 CAAGATATTG ATAGGCGTGT ACAGACGGAT CCAGTTGCAG TGAAGGAATT 

200551 TGTAGCCCAA GATATCTTAA GTGCTATTCA TGGAGATCTA GGGCTTGAAG 

200601 AGGCTCTTTC TGTAACTCGT AGCATGCATC CAGGGAATCT TTCATCCTTA 

200651 TCGGAAAAAG ATTTTCATGA ATTGTTTGCA GGAGGGATGG GGGCCTCATT 

200701 GGATAAATCC GAGGTGTTAG GGAAACGTTG GTTAGACCTA TTTCTTGTTT 

200751 TGGGACTATG TAAATCTAAA GGGGAAATTC GAAGGCTAAT TGAACAAAAA 

200801 GGGGTATATA TTAATAATGT GCCCATCGCT AATGAGCATA GTGTTTGTGA 

200851 AGAACAAGAC "ATCTGTTATG GTCACTATGT GTTGTTGGCT CAAGGTAAAA 

200901 AACGAAAGCT TGTTCTATAT TTAAATTAGT TTAAGGAGGC TAGGTAGCTT 

200951 TGCAAACGAA TATTGGTCTT ATTGGCTTAG CTGTCATGGG GAAAAATCTT 

201001 GTCTTAAACA TGATAGATCA TGGTTTfTCT GTCTCTGTCT ATAATCGGAC 

201051 CCCAGAGAAA ACGCGGGACT TCTTGAAAGA ATACCCTAAC CACCGAGAGC 

149 



WO0027994 ffile:/A\dcwas03\firmdata\lp\Fole yPat \PatentDocuments\WO0027 994.cpcl 



Page 277 of 330 



WO 00/27994 PCI7US99/26923 

2 01101 TTGTAGGGTT TGAATCTTTA GAAGATTTTG TGAATTCATT GGAGAGACCA 

2 01151 CGAAAGATCA TGTTGATGAT TCAAGCAGGG AAACCTGTGG ATCAGAGCAT 

201201 TCATGCGTTA CTGCCTTTTC TAGAACCCGG CGATGTGATT ATCGATGGGG 

201251 GGAATAGCTA TTTTAAAGAT TCCGAACGAC GATGTAAAGA GTTGCAAGAA 

201301 AAGGGGATTC TCTTCTTAGG CGTGGGGATT TCTGGAGGAG AAGAAGGTGC 

201351 ACGTCACGGC CCATCAATTA TGCCTGGAGG AAATCCTGAG GCGTGGCCAT 

2 014 01 TAGTGGCTCC TATTTTTCAA TCAATAGCAG CAAAAGTACA GGGCCGTCCC 

201451 TGCTGTTCTT GGGTAGGAAC TGGCGGTGCA GGCCACTATG TAAAGGCTGT 

201501 TCACAATGGT ATAGAATACG GCGATATCCA GTTGATATGC GAAGCTTACG 

201551 GTATCTTAAG AGATTTCCTA AAGCTCTCCG CAACTGCCGT TGCTACAATT 

201601 TTGAAAGAGT GGAATACTCT AGAGTTGGAA AGCTATCTAA TTCGTATTGC 

201651 TTCTGAAGTC CTAGCATTGA AAGATCCGGA AGGAATCCCT GTTATTGATA 

201701 CGATTTTAGA TGTCGTGGGC CAAAAAGGTA CAGGAAAGTG GACCGCAATC 

201751 GATGCTTTAA ATTCTGGAGT TCCCCTTTCC TTAATCATAG GAGCTGTTCT 

201801 TGCTCGTTTC CTTTCTTCTT GGAAAGAGAT ACGCGAGCAA GCTGCCCGTA 

201851 ATTATCCAGG AACCCCCTTA ATATTTGAAA TGCCCCATGA TCCCTCGGTA 

201901 TTCATACAAG ATGTCTTTCA TGCTTTATAC GCTTCCAAGA TCATCAGCTA 

201951 TGCTCAGGGA TTCATGCTTT TAGGAGAAGC TTCAAAAGAA TATAATTGGG 

202001 GATTAGACCT AGGAGAAATT GCTTTGATGT GGCGCGGGGG ATGCATTATT 

202051 CAAAGTGCAT TTTTAGATGT TATACATAAA GGATTTGCTG CCAACCCAGA 

2 02101 GAATACCTCG CTCATCTTCC AAGAATATTT CCGTGGAGCA TTACGCCATG 

202151 CGGAGATGGG ATGGCGTAGA ACAGTAGTGA CTGCAATTGG TGCAGGGCTA 

202201 CCTATTCCCT GTTTAGCAGC AGCAATCAGG TTTTATGATG GCTATCGTAC 

202251 AGCAAGCTCT TCAATGTCGT TAGCTCAAGG ACTGCGAGAT TATTTTGGAG 

202301 CTCATACCTA CGAGCGTAAC GATCGCCCTC GAGGAGAGTT CTATCATACC 

202351 GATTGGGTGC ACACGAAAAC TACAGAAAGA GTGAAGTAAA AATAAAAAAT 

202401 CTCGAGAAGA CCCTAGGTAG CTCGAGATTT AGTTCACCAC TACCGAATTT 
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2024 51 TCAATTGTAA GCAGTGTGCT GATTTTAAGC GTCAATGTTA ATCTAATTTT 

202501 AGAACTTCAA TGAAAGCTGT ATTGGGAATG GAAACTTTTC CAAATTCCTT 

202551 CATACGTTTT TTTCCTTTCT TTTGCTTTTC CCACAGCTTG CGTTTCCTAG 

202601 TAATATCTCC GCCATAACAC TTTGCGGTCA CGTTCTTAGA AAGCGCACGA 

202651 ATCGTTTCTC TGGCAATGAC TTTTTTGTTA ATGGCAGCTT GGATGGGAAT 

202701 CTTGAAGAGT TGTTGTGGAA TCACGTCCAC AAGCTTTTCG CAGATACTTC 

202751 TTCCACGAGA TTCTGCTTTA TCTCTATGGA CTAAACAAGA AAAAGCATCT 

202801 ATGGGCTCCT CGTTAATAAG AACCTCTAAT TTGATGATCG ATCCCTTACG 

202851 GTAATCCCCA AGACGGTAGT CAAAGGATCC ATAACCTTTA GTTACTGACT 

202901 TCAGCTTGTC ATTGAAATCC GAGACAATCT CATTTAAAGG GAGTTCGTAA 

202951 GCAAGAACTA GACGGTGCTG ATCTAGCATT TCTGTTTTTA CGCAGATCCC 

203 001 ACGTTTATCT AAACAGAGGT TCATAATGTT GCTCAGATAT TCTTGAGGGG 

203051 TGATAATATT CACATGAACC CAAGGCTCTT CCACATGCTC GATGATCGCA 

203101 GGATCCGGAT ATCCTGAGGG GTTATCAATA TCTAGAACTT TCCCGTTTTT 

203151 TAAGACGACT TTATAGATGA CACTTGGAGC CGTTGCAATA ATATCTAAGT 

2 03201 CAAATTCTCG AATGATTCTT TCAAAGATAA TCTCAAGATG AAGAAGTCCT 

2032 51 AAGAAGCCAC AACGAAAACC AAAGCCTAAA GAGTGACTGC TTTCTTGTTC 

203 301 TATAGTTAAA GCAGAATCAT TGAGCTGTAG TCTTCCTAAA GCATCTTTCA 

203 3 51 AAGTATCAAA ATCAGAAGAA TCTATAGGAT AAATTCCAGC AAAAACTACC 

203401 GGATTGATCT CTTTGAAGCC TTCCAAAGGA GTTTTTGCAG GATGTTTTGT 

203451 TTTCGTGACT GTATCGCCGA TCTTCACATC CTTCACTTTT TTGAGATTGG 

203501 CAATAAAAAA ACCCACCTGA CCAGGGCGTA AGGAACCTTC TATAAATGTT 

203 551 GCTTTAGGGA GAAAGGCCCC TATACCTAAG ACTTCAAACG AGGAGCCTTT 

203 601 AGCCGCCATA AAAGTAATGC GGTCTCCTTT TTTTAATTCC CCGCTAATAA 

203 651 TGCGTACGTA GACCATAATG CCAACGTAAG GGTCATAATG AGAATCAAAG 

203701 ACTAAAGCTT TAAGCTCTGT TTCTGCAGGT GCTTTTGGAG GAGGAACAAG 

2037 51 ATCGATAATT GCTTTCAGGA TTGCAGGGAT CCCCTGACCT GTTTTTGCAG 
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203801 AACAGGCAAT AATGTTCGTA GTGTCTAGGC CTATATAATC TTCAATCTGT 

203851 TGAGCAATTC TCACGGGATC AGCGGCAGGT AGATCAATCT TGTTTAATAC 

203901 AGGAATGATC TCTAAATCTC TTTCAAGGGC CAGGTAGACA TTAGCAAGAC 

203951 TTTGTGCCTG CACCCCCTGG GCGGCATCTA CAATAAGTAA GGCGCCCTCA 

204 001 CATGCAGATA GAGATCGAGA GACTTCATAC GAAAAGTCCA CGTGACCAGG 

204051 GGTATCAATC AGGTTCAGTT GATACACCTC TCCTTCATAT AGATACGTCA 

204101 TGGTGACAGG ATGAGCTTTA ATTGTAATGC CACGCTCTCT TTCAAGATCC 

204151 ATGGAATCTA AGAGCTGCTC ACGCATCTCC CGTTCTTCTA CTGTGCTCGT 

204201 ACTTTCTAAA AGGCGATCAG CAATTGTAGA CTTCCCGTGA TCAATATGCG 

204251 CTATGATTGA AAAATTGCGA ATGTTCTCTA TCTTATATTC TTTCAAAATG 

204301 TACTGTAGTG TTATCTAGGT TTATTCCTGG TTTCATAAAG CTGCATTGAG 

2 043 51 AAGGCTCCTT CAGACAAAGA CCAGCTTCTC AAATACTTTT GCGGACTGCA 

2 04401 TCTCTTTATT AGCAAAATGA TCATGCGGTC ATCTTTAGCT TTGGATAGAT 

2 04451 GAAAGTATAA TAGATGAGAT GTAGAAACCA CAAGGGTTTA AAGTCGAATC 

204501 TGATTTGAGG TAGAGTTCCA GAGTGGCTAG CAGTAGAAAA AACTAAGTGA 

204 551 GAAGAAGTGC TCTCCGTTAG TATGAAACTG ATTCCCACTC AGGATTCTAf 

204601 AGAAAGGGAA ACCGATTCTA AAAGAGATAA AAAAATATTT ACCATTTACA 

204651 TATGTTCATC TAAAGTCCTT GCGGGTCATT TTTTCAGTCA TTTAGACAAG 

204701 CATAATAAAA TTCATGAAAG CATTGGGGTT TGAGATAGTT AAATCGACTC 

204751 GATCCAAGAG TATAAGAGAG GAATGAGTGT TCTTATGTCC GAGATAGTGC 

204801 TCTTCAATGT AGTGAAGAAC TGGAAAGTCA GGAGATCTAT TTGAGTAGGA 

204851 GTTCAGGTCT TCGCAATCGA TTTTTTCAAG CTCTGGAGTC TATGTGAAAG 

204901 AAATTTATAG AAGTAAAATA ATTCCATGAA TTCTAAAATG ATTAGGAAAA 

204951 TAATATAACA CGCTGATACT CAAGTCACAA TGGAGCTTCT ATGGTTAATA 

205001 TACAGCCTGT GTATAGGAAT ACCCAAGTCA ACTATAGTCA GGCTACCCAA 

205051 TTTTCGGTGT GCCAGCCAGC GCTTAGCCTG ATTATCGTTT CTGTTGTTGC 

205101 TGCTGTACTC GCTATTGTAG CTTTGGTATG CAGTCAATCT CTTTTATCCA 
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205151 TAGAGTTAGG AACTGCTCTT GTTCTAGTTT CTCTTATTCT TTTTGCTTCT 

205201 GCTATGTTTA TGATTTATAA GATGAGACAA GAACCTAAGG AGTTGCTGAT 

205251 CCCTAAGAAA ATCATGGAAC TCATCCAAGA ACATTATCCA AGTATTGTTG 

205301 TTGATTTTAT TAGAGATCAG GAGGTTTCCA TTTATGAGAT ACATCACTTG 

205351 ATCTCTATTC TTAATAAGAC GAATGTTTTC GACAAAGCAC CAGTATATTT 

205401 ACAAGAAAAA CTCTTACAGT TTGGCATTGA GAAGTTCAAA GATGTACATC 

2054 51 CAAGTAAGCT CCCTAATTTT GAAGAAATTC TTCTACAGCA TTGCCCATTG 

205501 CATTGGTTGG GACGTCTGGT ATATCCCATG GTATCGGATG TCACTCCAGG 

205551 AACCTATGGA TACTATTGGT GTGGTCCTTT AGGACTGTAC GAGAACGCTC 

205601 CCTCTCTTTT TGAACGTCGA TCTCTTCTAT TGTTAAAGAA AATTAGCTTT 

2 05651 GGAGAGTTTG CTCTTTTAGA AGATGGTCTC AAGAAAAACA CGTGGAGTTC 

205701 TTCGGAACTC GTTCAAATCA GACAAAACCT TTTTACAAGA TATTATGCTG 

205751 ATAAAGAAGA GGTAGATGAA GCAGAGTTAA ACGCTGATTA CGAACAGTTT 

205801 GATTCCCTCC TTCACCTTAT TTTTTCTCAC AAGCTCTCTT GAAAGCAAGT 

205851 GCAATTATTT CAATATATGA ATGAGTCCGG ATGGGATTGG CTTTGTGATT 

205901 TTGATTCTCA AGGCGAGGGA TTCCAGTTAT CACGTCTGGT TGGGCTGTTA 

205951 CATTCGTCCT GGGCATTATA CGAAGCAAAA GAGCAATTTT ACCTTCCTGA 

206001 GGTTTCTCTA TTGACCTGGG AAGAACTGAT AGAAATGCAG TTATTAAGCA 

206051 AACCAACAAA ACACGGGGTT GCAAAAGATC TTTGTAATGT ATTTGAAAAA 

206101 CACTTTCAAA GGTTTAGACA GTACCTAGGT TCCTTAGATC TAAATCAAAG 

206151 GTTCGAAAAT ACCTTCTTGA ATTATCCTAA ATACCATTTA GATAGGGAGT 

206201 GAGAAAAAAA TCCTAGGTCA GCTTGGCAGA AATTTTTGAA ATCCTTAAGT 

206251 GTTCGATCTG CATTTTTTTC GGGGATTGTA AAAAGTTTGC CTAATCTGAA 

206301 GGGTAAGAGA GAGCTATTTT TCATGGGATT TTTTATTACA GGAAATTCTT 

206351 GAATTAGAGA TTTTTATTTA CACATAATCT ATACTGCCTT CAATAGATCT 

206401 ATATCTAAGG AGTTGGCTAT GAGCATGACG ATCGTTCCAC ATGCTTTATT 

206451 TAAAAATCAT TGCGAGTGTC ATTCTACCTT TCCTTTGAGT TCAAGGACTA 
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206501 


TTGTAAGAAT 


AGCCATTGCC 


AGCCTCTTTT 


GTATAGGTGC 


ATTAGCAGCT 


206551 


TTAGGCTGTT 


TGGCTCCTCC 


CGTTTCTTAT 


ATTGTTGGGA 


GTGTTTTAGC 


206601 


TTTTATTGCC 


TTTGTCATTC 


TTTCTTTAGT 


AATTTTAGCT 


TTGATTTTTG 


206651 


GAGAGAAGAA GCTTCCACCA ACACCAAGAA TCATTCCTGA TAGATTTACT 


206701 


CACGTGATAG ATGAAGCTTA TGGCCTTTCA ATCTCTGCAT 


TTGTAAGAGA 


206751 


ACAGCAGGTA ACATTAGCCG AGTTTAGACA ATTTTCTACT GCCCTGTTGT 


206801 


GTAACATATC 


TCCTGAAGAG 


AAAATCAAAC 


AATTGCCTTC 


TGAATTGCGA 


206851 


AGTAAAGTAG 


AGAGTTTTGG 


TATTAGCAGG 


CTCGCAGGTG 


ATTTAGAAAA 


206901 


GAATAATTGG 


CCAATATTTG 


AAGATCTTTT 


AAGCCAAACC 


TGCCCGTTAT 


206951 


ATTGGCTTCA 


GAAATTTATA 


TCAGCAGGAG 


ATCCACAAGT 


TTGTAGAGAC 


207001 


CTAGGTGTCC 


CTAGAGAATG 


TTATGGGTAC 


TATTGGCTAG 


GGCCTTTGGG 


207051 


ATACAGTACA 


GCTAAGGCTA 


CAATTTTTTG 


TAAAGAGACG 


CATCATATTC 


207101 


TTCAACAATT 


AACGAAAGAG 


GACGTTCTTT 


TATTAAAAAA 


CAAGGCTCTT 


207151 


CAAGAGAAAT 


GGGATACTGA 


TGAAGTCAAA 


GCAATTGTAG 


AGCGTATCTA 


207201 


CACTACCTAT 


ACGGCACGAG 


GAACTCTAAA 


GACCGAAGCA 


GGGGGACTTA 


207251 


CAAAAGAGAC 

******** VJ** VJ*» V- 


AATCAGTAAG 


GAATTGCTAT 


TGTTGAGCTT 


GCATGGCTAT 


207301 


TCTTTTGATC 


AGCTACAGCT 


GATCACTCAA 


CTTCCTAGAG 


ATGCTTGGGA 


207351 


TTGGCTGTGT 


TTTGTAGATA 


ACAGTACCGC 


ATACAACCTT 


CAGCTTTGTG 


207401 


CTCTTGTAGG 


AGCTTTGTCA 


TCCCAAAATC 


TTCTTGACGA 


ATCTTCTATC 


207451 


GATTTTGATG 


TAAACCTAGG 


CCTGTATGTG 


ATTCAGGATC 


TAAAAGAAGC 


207501 


TGTTCAAGCA 


TTTTCTGCTT 


CTGATGAGCC 


AAAGAAAGAA 


CTAGGTAAAT 


207551 


TCTTGTTAAG 


GCATTTGAGT 


TCAGTTTCTA 


AGCGATTAGA 


GAGTGTATTA 


207601 


AGACAGGGTC 


TTCACAGAAT 


AGCTCTAGAG 


CATGGAAATG 


CCAGAGCTAG 


207651 


GGTTTATGAC 


GTCAATTTTG 


TAACAGGAGC 


TAGAATTCAT 


AGGAAGACGA 


207701 


GTATCTTCTT 


TAAAGACTAA 


ACCAGGTAAC 


TAGCTTTTTA 


GTCTCGAAAG 


207751 


GGGCTACGAG 


AGGGAGTAGA 


ATTTTTCTTA 


TTCCAAAATA 


GAAATCTCTA 


207801 


ACCTATCAGC 


AGGGAGGACT 


CGTCTTCTTC 


AGGTTGGGAG 


AAGACGCTTC 
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207851 


TCTTGAGTGG 


ATGGACTGTT 


ATAGGTGGTA AGCTTCTAAA 


GGTGoAAl I A 


"■s r\ ^i f\ r\ n 

207901 


TGAGTTGCTA 


AATTCTCAAG 


GAACATCGCT 


ATATTGGTTG 


Av- 1 GAt-GAUG 


207951 


GATCGTTTTA 


AACTGATCTA 


AACTAAGGAG 


TGTCAATGAA 


GAAAGAAA 1 o 


208001 


CTTCGCTTTC 


TTTATGAATT 


AGGCCTTGTG 


TGTAGAGAGT 




208051 


GAGGCAAATT 


CCATGTCTTG 


GGGTCGTCCT 


GTATGGTCTA 


AAG I GAAGUA 


208101 


ACAGAGGTTT 


TCCCATAAAT 


CAGCCGGGAC 


TGTTTTTATT 


TV TV /"*" TV p^TTT 

AAGGALA1 C 1 


208151 


GCTTCCACGA 


GTACCTATGC 


TTAAAAATAC 


ACAAGAGCAG 


OTT ATA/"»T\ A T 
GT i ATAGAA r 


208201 


TCATCCTCTT 


GAAGAGAGGG 


TTGAGATAGA 


TGTTTATAGG 


ATTG AT AG G L 


208251 


CGCATGATAT 


TTAGCAAAGA 


GCTGCTGTTT 


CATAGTATTT 


7v TV (T^r^ TV OT> A 

AGATCACTAT 


208301 


GATTCAGGTT 


CCACTCGCCA 


TTTAATGCTG 


CATACTTAAG 


ATGTTGAAAA 


208351 


ACATCACGAG 


TTAAGACTCT 


AGCTAAAACT 


AATGTGTGTA 


GATTTAGAAT 


208401 


. AGAGGAATTT 


GTTGGTCTAA 


AATCTAAAGG 


AATCAACCAA 


TAGGTAGGAA 


208451 


GCATCTCTGG 


TTTTATATAA 


TCACTAAGGT 


CTTCTTCGAG 


GCTGCCTCCA 


208501 


ATTTTCTTAA 


CTCGATGGAG 


CTCTACAACC 


TTGCTGCCTG 


CAGAAATAAA 


208551 


CTCTCCTAAC 


CAGAAAAATG 


GAAATACCTT 


TTGTAAGATT 


TTTGGTAAAG 


208601 


AGGGGAAGAG 


GGGACGGGAA 


GAAGCAGCGC 


TTTCAAGTCT 


TTwr>rr> iv > r> jv f» jv jv 

TTTAAGAGAA 


208651 


GCAGTACGGA 


CTTCATTTTT 


TAAGCGTGCG 


ATACCCTCGA 


ACGTATG I A 1 


208701 


TTTCTGTTGT 


AGTTCTTCAG 


ATACGTTGTA 


ATTTGTAGAT 


GATCCAAC i 1 


208751 


CAGAGTGCAA 


TTGATTTAGA 


AGATCAATAA 


AACTTATGAG 


7V mrtmrnrp TV T\ TV 

ATCTTTAAGA 


208801 


TTTGGTTTAG 


CTTCTGAAAC 


AAAATCAGAG 


ACAAATTTAG 


G ATAGTGG G G 


208851 


CCTGATTCTA 


TTTACGAGTT 


CTTGGGGGAA 


TACTTGTTCT 


H n I V /"< TV fT*/** TV TV TV 

TTTGATGAAA 


208901 


TCATAGGCGT 


GATTTTTTTT 


ATTCCTAAAA 


TCACACCAAT 


CAAGGCTATT 


208951 


AATAATCCTA 'ATCCTAACAA 


TGCGCCACTT AGAATATAGG 


AAACAGGAGC 


209001 


TGCTACACAC 


AAGAAAGCTA 


TCAAAGCTCC 


GCAGAGTAAG 


ATGGCACTGA 


209051 


TAACAATATG 


AATAGTGGTT 


GAATTCTTTA 


ATTCAAAATA 


ATAATTACAA 


209101 


GAGCGATTAT 


TTTGAATAAC 


TGGCGAGGTT 


ATATTACTCA 


TACATTTCCT 


209151 


AGGCTCATCA 


ATTTGCGTAT 


TATATCGTAA 


ATTTAACAAA 


AATCCTATTA 
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209201 ATAGAAATGT TTTTATTTTT AAAATTTTTA TTGAAATTGT TTTGTATTAT 

209251 TGATGAAAAG TTTTCATTGA AGATAGCTTC TAGAAGAAGA ATAACGAGAG 

209301 AGCTTGTTAT GCTTTTTTTA AAAAAAGGGC GAGCCCCTTT TTGTTCTTTT 

209351 CCCGATTGGG AATAACGTAT TCTTTTTATA AAAATTAAAG ATTTTTTTAT 

209401 ATTTCTAAAT CTCAATTATA AAAAGTCTTT CAAACGCAAT TTGCTTGTAG 

2 09451 GTATTGCAAA TGATCAGACT CCCCTTATAG GGAAGATCGT ATAGGGAGAC 

209501 TGCTTTCTAT TTTTTTCTCT AGTTTGTAGG TTTGGGCGTG GAAAGCATAC 

209551 TGAGGTAAGC TTGATGTCCA TTGTGAACTC GTATTACGAA CAAATTGCCA 

209601 TCGTTTTAAC AGATTTTGAT GGTGCGTTGT ATTTACGCAT ATCTTCTGAA 

209651 CAAGAGCACT CGCTGGGTGC ATAGGACTTT CTTTTACTTT CTCTAGTAAA 

2 09701 ACCTTCAATT CTTTCCACGT CATGAAGTTC ACTGTAGGTT CATAGTTAGA 

2097 51 GGATACTGGA TCGAACATAT TTGTTTCAGT ATTCAAAAAG CCTCCAAATG 

209801 TTGCCATGGA ACAGTGGCCT CCTGCTTTAT CAAACTGACA CAACATTTTC 

209851 CAATTATCAG GATTTATAAG TTGAATCATC TGAGCCTGTT CCCAAGTGAT 

209901 ACCATGAGAA AAGAAAAGAA ATAAGAATTG TGAGATTCCT TGAACATCCT 

209951 TCCGGAAAAT CATATTGTGG GGGAGTTCCT TGAATATTTC TTCGCAGGTT 

210001 TTTTTCACAG AAGGAGAATC CCATTGATTC TTAGACGCTT TACTATATAG 

210051 GAACTTATAC TGTGATTCTG AGATTAATGT TAGTAGAGGG CGTGTATAAG 

210101 AGTGGAAAAT AGTTGTATAT CCTTTATGAA ACGCTAAGGG CCCAAGTAAA 

210151 CCATAAACTT TTTGTGTTTT ATTTAATCCG ATTTCCCCAG CAACAGATTC 

210201 AGTTTTGTCT ATAAAATGGG AGAGCCAGTA TAACGGGCAG TTTTGAAGAA 

210251 GAATCTCTTC GAACTCTGGA AACAGGGTTA AATCTATAGA TTTTAGAATA 

210301 TCGATCCCGA AAGCCTCTGC TTTTTTATGT AAATTCGGAG GCAGGTCTGT 

210351 ACCGCTTTTC CAGCAATTAA TAAATATTTT TAATTCGTTA ACAGTCAGGG 

210401 AGTGTGTTTT CACGAAATAA AAGACTTCTT TAGGATAGCG ATTGTAAATA 

210451 ATCTTCTGAA GTTCGTTGGG GATGGGCAGA ACCTTTGATT TAGCTAGCAG 

210501 TGCTACGACT AGCGTTATAA TCAAGATTAC GATAGCGGCT AAAGCTAAAG 
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210551 


TTCCTCCAAT 


AGCATAGCTA ATCGGCGCAG CAAGGAAAAC AAAAGAAAGT 


210601 


GCGCTAACAA 


GAGCTAGAAC AAGCCCAAGA ATGAGTCGGG 


CAATTGTCCT 


210651 


AATTTTTAAA 


GAACAAGATC TATGACATTC GCAGTCATTT 


TTTAAAAATA 


210701 


TATGGGGAAC 


TAGTGTTAAA GGACTACACT TCATAGTACA ACCAGAATGT 


210751 


CTCACGGTTA 


TTATAACCTT TTGATTGAAA AAAGCGTACG 


AATTCGTGAA 


210801 


ATGGGAGTTA 


ATAAAAAAAA TCCCTACCCA CTAAAAAGCA 


GGTAGGGATC 


210851 


AACAAGAGTA 


AGAGAAGCAA CTCTATGAAG AAGCAGGAGC 


TGAATCTTCT 


210901 


TGAGCCACTT 


CTTGTTCTTT AAGAGCAGAC TGCGCTAAGA 


ATAGTTTGTT 


210951 


TAACTTAGTT 


GCAGAAACCA ACCAAATAGC AATGATGAAA 


AGAAGAATCA 


211001 


CTGCAAGATA 


AGGGGTCATA GCTCCAATAC TTCCACAGAT 


AACGAGCAAA 


211051 


CCTTGTTGGA 


TTAAAGCTCC TCCTGATTTT CCGAAGCGGG 


CAGCAACTAC 


211101 


ATCAATAGCA 


GCCTTACCTT TGACTTTTTG CTCTTGGTCA 


AGAGGGATAT 


211151 


AGGCCATTTC 


TTTAGTTGAG TCAAAGAGAG CGTATTTTGT 


GGATTTCGAA 


211201 


AGAATATTCT 


GTATAGCTCC GACAACCACA GCTAGCATGA 


GAGGAGTTGT 


211251 


ACCGAACATA 


GCGACCAGCC CAGAAGCTTG GTTTCTAAAG 


ATAACAAGAG 


211301 


CGAAGAAAAC GATACCTGTT AGGAGAACCA TGACAGGAGT 


GACTAGGGCT 


211351 


CCAGTTAACC 


ATCCAAATTT ACGAATGACG TTACCACCAA 


CAAATAGCAT 


211401 


GATAAGTACG 


GATACTACGC CAGTCCAGAA GGAGAAGTTC 


CCCATGAACT 


211451 


CACTATAGTC 


ATTCATATTA GGATATTGCA GTTTCAGCTG ACTTTTCCAA 


211501 


GTCACTTCGA 


TTAAGTTAAT GCAAATACCA TAGGCAATAA 


CCAAGAGAGC 


211551 


TAATAAAAGA 


ATATAAGGAG ATCTAGCAAG ATAGAGGAAG 


CTATCTTTCA 


211601 


TATTCATTTT 


AGGTTTAGCA CCTTTTTTCC CCTTTTGCAT 


TTCTTCTGGA 


211651 


TTATAGAAGC 


GAGGATCGGT CAATACGTTC TTATTGATCC 


ACCAGTAACT 


211701 


GGCCATAAGA 


ACAAGTCCAG ATACAATAGT CATAGCCATC 


AAAAGACGTA 


211751 


AAGAAATTCC 


CCAAGGATCT ACACCTTCAG AAACGGAAGC 


TCTCAACTTT 


211801 


GAAGCCCAAA CAATTGCACG ACCAGAAGCT AGTAAAGAAA 


TATTAGCTCC 


211851 


GATACCGAAA 


AGAGCGTAGA AACGCTTTGC TTCGTGGATT 


TTTGTAATTT 
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211901 CATTAGCAAA TCCCCAGAAC ATTAGAGATA GCATGACGCT TCCCCATAGT 

211951 TCAGCAAGTA CATAAAATGC AGCAAATGTC CAGTTTCTTA AGATGGCAAC 

212001 GAGTCCTAGC AATCCTGGAG GTAGGATGGC CTGTAAACGG TCAGCAAATT 

212051 CTGTAGGATG' TAAAACATCG CGTAGCGGAT AAATTACAGT CGGGAACAGG 

212101 GCAAAGAAAA TTAAAAAGGG CGTTCCCACT GCATAAAATA AGGCCTGCTT 

212151 ACTTAAAATA TTACTTAGCT TTGCATAAAT AAGCATAAAG ATAATAGCAC 

212201 AGGGGACAAC AAGCCAAAAC TTGATGAAAG GTATTGCCTC TGCACCAGAA 

212251 CCAGGAGCTC CCACAATAAG AGTGTCTTTT GTATCGCGTA ACACCGTATA 

212301 GTTAAATGTA ATACAGAAGA ACATTAGGAA CATTGGCAGA ACTTTCTTTA 

212351 GCTCGTGAGT ATGTATCGGC CACAAGAAAG AGCGCAATTT TCCAAAAGGT 

212401 TTTTCTTCGG TTTTTGTCAT ATTTACCCTC TGAAATACTT TTATTTTCTA 

212451 TTCTGATAGT TTTTTATTTC GTTGTGGACT AACTTGAATT GTTTCCATTT 

212501 CGAAAGACCT TATAATATAC CATCTTTTCT CTCTATTGAC AAGAGGAAAG 

212551 GGATACTCCT AAGGGATAAG AGTTCTTAGG CGACTTTATT GCTAAGTCGC 

212601 CTGTAAAATT TATTTTTTAA TTAAACGAAA GGAGAGGTAA CTTCAAGGTT 

212651 GGGAGAGCTT ACAGCCTTCA CAGCTTCGAC GTAGGCATTC CATAGCTCTA 

212701 CGCTCACGAT ACCGTTGCGA ATCGAACGAA TTAACTCCCT TTTTAAAGAA 

212751 GTTAACGGTT TTTTAGGGCT AAAGCGAGCG AGGAGGTCTT TGTCTCCAAC 

212 801 TTGTCTTGCC AGTTCTAAAA CGCGTTCAAT ATCTGTTTTA GTAAAGTTAA 

212851 TAAAGTCACG ACGTTTTTTA GAGCCTCTAG GAGCTCTAGG CTTAGGATTA 

212901 ATTGTGCGTG GGGTTTCCGA TTCTAAAATA AATGTTTTTA ACATTTTTAA 

212951 AAGTTGTTCG GGAGCAGCGC TTTTCATCTT TTTTAGAGTG AAATGATGCA 

213001 TATAACCGCC GCTAGGGCCT GGAAGATAGC GACATAAATC GTTTTCTTTG 

213051 TTTCCGCAGA CTCTTTTAAT AGCTTTTCCG ATCAGTTTTT CTATTTCTTC 

213101 TTTTATATGT GATTGTTCTA CGGCCATGAA ACCCACCTTT TATAATTGAT 

213151 GCTTACATAT TTTACAAGAA AGAATGTGGC ATACTTTCAA TTTAATGTTT 

213201 AGAGAAATAA AATTAAAGCG CTGTTTTAAT CAATCAGCGA ACATAAATTA 
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213251 TATTGCAAGT TCTCCTTTTT GCAATATTTT TCCTCTCAGG ATCTTTTTGT 
213301 TTCTTAACAA AATTCAAAAC ATAAAAATCG TAAAGTAAAG AAGATTTTTT 
213351 CTATAACATT TTGTTTTCTT ATAAAACAAA CGGGTTTTCT AATTTTTTTA 
213401 AAACTATCCT GAACAAAGAT ATTTCTTTCA TTTCTATTAG TTGTATTTCT 
213451 TGCTGGATAA AATATATTTT GAATAGAACT TGTTTTTCTG GTACTTTAAG 
213501 TGGTTTTTAA TGGATCTTAA AAAATGCTTC TAGAGAGATG GATGCGAAAA 
213551 TGGGATATAT ATTTAAAGTG ATGCGTTGGA TTTTCTGTTT CGTGGCATGT 
213601 GGTATAACTT TTGGATGTAC CAATTCTGGG TTTCAGAATG CAAATTCACG 
213651 TCCTTGTATA CTATCCATGA ATCGCATGAT TCATGATTGT GTTGAAAGAG 
213701 TCGTGGGGAA TAGGCTTGCT ACCGCTGTTT TGATCAAAGG ATCCTTAGAC 
213751 CCTCATGCGT ATGAGATGGT TAAAGGGGAT AAGGACAAGA TTGCTGGAAG 
213801 TGCCGTAATT TTTTGTAACG GCCTGGGTCT TGAGCATACA TTAAGTTTGC 
213851 GGAAGCATTT AGAAAATAAT CCCAATAGTG TCAAGTTAGG GGAGCGGTTG 
213901 ATAGCGCGTG GGGCCTTTGT TCCTCTAGAA GAAGACGGTA TTTGCGATCC 
213951 TCATATCTGG ATGGATCTTT CTATTTGGAA GGAAGCTGTC ATAGAAATTA 
214001 CAGAAGTTCT CATTGAAAAG TTCCCTGAAT GGTCTGCTGA ATTTAAAGCA 
214051 AATAGTGAGG AACTTGTTTG TGAAATGTCT ATTTTAGATT CTTGGGCGAA 
214101 ACAATGCTTG AGCACAATTC CTGAAAATTT ACGGTATCTT GTCTCAGGTC 
214151 ATAATGCGTT CAGTTACTTT ACACGTCGCT ATTTAGCTAC TCCTGAAGAA 
214201 GTGGCTTCCG GAGCATGGAG GTCTCGTTGT ATTTCTCCTG AGGGTCTATC 
214251 TCCAGAAGCT CAAATCAGTG TTCGTGATAT TATGGCGGTT GTAGATTATA 
214301 TTAATGAGCA TGATGTCAGT GTGGTTTTCC CTGAGGATAC TCTGAACCAA 
214351 GATGCGTTGA AAAAAATTGT TTCTTCTCTG AAGAAAAGTC ATTTAGTTCG 
214401 TCTAGCTCAA AAACCATTGT ATAGTGATAA TGTGGACGAC AATTATTTTA 
214451 GCACCTTTAA ACATAATGTC TGCCTTATCA CAGAAGAATT AGGAGGGGTG 
214501 GCTCTTGAAT GTCAAAGATG AGACTTTTTG GTCTGTACAC AACCTTTGTG 
214551 TAAACTATGA GCATGCAGCC GTTCTTTATC ACATATCCTT TTCCTTGGGA 
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214601 AAGGGGTCAT TAACTGCTAT TTTAGGTCCT AATGGAGCTG GTAAAAGCAC 

214651 TCTCTTAAAG GCTTCCTTAG GCCTGATCAA ACCCTCTTCG GGGACTGTTT 

214701 ATTTTTTTAA TCAAAAATTT AAGAAGGTGC GTCAGCGCAT AGCCTATATG 

214751 CCTCAGAGAG CTAGCGTGGA TTGGGATTTT CCAATGACTG TCTTAGATTT 

214801 AGCCCTTATG GGGTGTTACA GCTATAAAGG AATGTGGGGG AGAATTTCTT 

214851 CGGATGATCG AAGGGAGGCC TTTCATATTT TAGAAAGAGT TGGTTTGGAA 

214901 TCCGTAGCAG ATAGACAAAT AGGACAGCTC TCAGGAGGAC AGCAACAAAG 

214951 AGCATTTTTA GCACGTGCTT TGATGCAAAA AGCAGATCTA TATCTTATGG 

215001 ATGAGTTGTT TTCAGCGATT GATATGGCTT CGTTTAAAAC ATCTGTAGGG 

215051 GTTTTGCAAG AGCTGCGAGA TCAGGGAAAG ACTATCGTCG TTGTTCATCA 

215101 TGACTTGAGT CATGTGCGTC AACTATTTGA TCATGTGGTT TTATTGAATA 

215151 AGCGTTTGAT TTGTTGTGGC CCTACTGATG AATGTCTGAA TGGAGACACT 

215201 ATTTTCCAAA CGTATGGTTG TGAAATTGAA CTTTTGGAAC AAACCCTGAA 

215251 GCTCTCTCGA GGAAAACAAT TTGGATCGTG CTGATTATGC TCAGTTGTGT 

215301 TTTTTCTGAT ACGATTTTCT TATCTAGTTT TTTAGCTGTC ACTTTGATTT 

215351 GTATGACCAC AGCTTTGTGG GGGACAATTC TCTTGATTAG CAAGCAGCCT 

215401 CTTTTAAGCG AAAGTTTATC TCACGCGTCG TATCCAGGAC TTCTAGTTGG 

215451 AGCTTTGATG GCGCAATATG TTTTCTCATT GCAAGCTTCT ATTTTTTGGA 

215501 TTGTGTTGTT TGGGTGTGCT GCTTCGGTAT TTGGTTATGG GATCATTGTT 

215551 TTCTTAGGGA AAGTATGTAA ATTACATAAA GACTCCGCCC TTTGTTTTGT 

215601 TCTTGTGGTA TTCTTTGCTA TCGGAGTGAT TTTAGCCAGT TATGTCAAGG 

215651 AAAGTAGCCC TACGCTATAC AATCGCATTA ACGCCTATCT ATATGGGCAA 

215701 GCAGCCACTT TAGGTTTTCT TGAAGCTACG TTGGCTGCGA TCGTCTTTTG 

215751 TGCTTCGTTA TTTGCTTTAT GGTGGTGGTA TCGACAAATT GTTGTGACTA 

215801 CTTTTGATAA AGATTTTGCT GTTACTTGTG GCTTAAAGAC TGTTCTTTAT 

215851 GAAGCACTCA GTCTAATTTT TATATCGTTG GTGATCGTAA GTGGAGTTCG 

215901 AAGCGTAGGG ATTGTTTTAA TTTCTGCTAT GTTTGTGGCT CCTTCTTTAG 
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oi com 


bl bL 1 ubl LA 


GPTTTPCGAT CGTCTAAGTA 


CAATTCTTAT 


CCTTTCTGCA 


o i cnfti 
Z lb u u 1 


1 1 L I 1 1 VjVjA\j 


PGATTAGCGG AGCTTTAGGA 


AGCTATATCT 


CTGTAGCATT 




P IV r* TV T 1 /** Tr^T 


PPTATTATAG GGPAAPAGGP 


GGTGCCTGTA 


ACCTTGCCTA 


oi ci n 1 


UvjouAUU ill 


rGTTGTCATT TGTGCTGGAT 


TATTGGCCGG 


TCTATGTTTG 


")1 £1 CI 

Z 1 bl D 1 


LI i I 1 1 


r&7ia.7iTrTGG GTGGGTPATT 


CGTTTTGTCC 


GTAGGAAGCA 


01 com 
Z IbZUi 


LI 1 1 1 v-L> 111 




TTTAAAGGTG 


TTTTGGCATA 


oi com 


Tl 1 L 1 LA1AA 


TPGTTTHG&G A APATTAGTG 


TTCGAGATTT 


TGTCTGTAGT 


Z lb J Ul 


TATAAbTA 1 L 


AGGAGTATTT TGGGCCTAAG 


CCTTTCCCTA 


GATGGAGAGT 


Oi £T Ci 
ZlD J Dl 


TCAGATTTTA 


GAATGGCGGG GTTATGTTAA 


AAAAGAACAA 


GATTATTATC 


O 1 £ A fi 1 

z lb4 Ul 


VjAC 1 C Av-AAA 


AAAAGGAAGA AGTGAGGCCT 


TAAGATTAGT 


TCGTGCTCAC 


O 1 C /I C 1 
Z 1 D 4 D 1 


AbA T I ATbbb 


AATCGTATCT TGTGAATTCT 


TTAGATTTTA 


GCAAGGAAAG 


O 1 CCfll 

Z 1 b d U 1 


Tbl 1 CAKjALj 


TTGGCTGAGG AAATAGAGCA TGTTCTTACT GAAGAATTGG 


O 1 CC.C1 
Z IDDj 1 


ATv-Al ALL 1 1 


GACAGAGATT CTCAATGATC 


CTTGTTATGA 


TCCTCATCGA 


O 1 C C C\ 1 

Z lob Ul 


C TV TV A rprp » rp/-. /-* 

CAAA1 lAlv_v~ 


CAAATAAAAA AAAGGAAGTC 


TAATGGCTTT 


GGGACCTTCT 


O 1 

z lbODl 


CCTTATTATG 


GAGTATCTTT TTTCCAATTT 


TTTTCAGTAT 


TTTTTTCGAG 


o 1 com 
zl b / Ul 


AC TGTTTTC T 


GGAAGTCTTT TCACGGGTTC TGTCTATATT GATGATATTC 


o i c*7 1; 1 
z 1 b / Dl 


A r» TV TT IV *r* i\ r* T» 
AbA 1 1 A 1 Ao 1 


ATTCCTTGCT ATTTCCTGTT 


CAGGTGCTTT 


TGCAGGAACT 


O 1 C 0 1 

z lboU 1 


iTTlTAblbl 


TGCGAAAGAT GGCTATGTAT 


GCGAATGCTG 


TCTCTCATAC 


O 1 CQCI 

Z 1 bo D 1 


TbTLC 11111 


GGTTTGGTCT GTGTTTGTTT 


GTTTACGCAT 


CAACTGACGA 


o i c q m 
zi by ui 


LL,\- 1L1L1 1 1 


GGGTACCTTG ACTCTTGCAG 


CAATGGCAAC 


AGCTATGCTG 


O 1 £ Q C 1 

zi by di 


ACAbb-bl F1L 


TTATTTACTT TATTCGTAAT 


ACTTTTAAAG 


TTTCAGAAGA 


o i o n rn 
Zl / UU1 


LjALjU AVjL AL. C 


GCTCTAGTCT TTTCTTTATT 


ATTCTCTCTG 


AGCCTTGTTT 


217051 


TGTTAGTCTT 


TATGACAAAG AATGCTCATA 


TAGGAACGGA 


• GCTTGTGTTA 


217101 


GGAAACGCAG 


ATTCTTTAAC GAAAGAGGAT 


ATTTTCCCTG 


TCACTATTGT 


217151 


GATTTTGGCT 


AATGCTGTAA TTACTATTTT 


TGCGTTCCGT 


AGCTTAGTTT 


217201 


GTTCTTCTTT 


CGATTCTGTA TTTGCCTCTT 


CTTTAGGAAT 


TCCTATTCGG 


217251 


TTGGTTGATT 


ATTTGATTAT TTTTCAACTT 


TCTGCATGTC 


TTGTAGGAGC 
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71 7** m 

Z X 1 3 U 1 


TTTTAAGGCT 


GTAGGTGTAT 


TAATGGCACT 


TGCTTTTCTG 


ATCATTCCAT 


*31 ~71 ^1 
Z X / J Jl 


PGPTTATTGP 


TAAGGTTATT 


GCAAAATCGA 


TAAGGAGTCT 


TATGGCTTGG 


ZX /4UX 




TTAGTATTGG 

1 1 T\\J 1 f\l 1 WSJ 


TACAGCATTT 


TTAGCTCCTG 


CATCTTCGAG 


Z X /flDl 




AGTGPTTATG 
no 1 u\. i i *» * vj 


ATTTGGGGT T 


ATCGACTTCG 


GGAATCTCTG 


01 7*;n 1 
Z X /dux 


t a ptpttp'ft 


PAPGATGATG 


TACATCGTGG 


TTAAATTTAT 


AAGCTATTTT 


Zl / DDI 




TTTPTA A AAA 
X i i v» i tww* 


TTTTGAAAAA 


ATAAGTGAGA 


AAAGTTCTCA 


7 1 7£fi1 
Z X / QUI 


ATATTAPPAP 
Mini 1 nyvno 


TPATTTAAAA 


GAATGAAATT 


TTAGAGTTTA 


GTCCACTTTA 


"5 1 7£ ^ 1 
Z 1 /DDI 


/ITT AP A A A T A 


PT A T P AAT AG 


AGAATTGACA 


ATTCCTCGAC 


TTGCGGAGTA 


7 1 7*7 n 1 
Z X / / U X 




TTTPAPCTAG 


TTAAAGGTAG 


CATGCTTGAA 


ACATTTAGCC 


77m 

Z X / / 3 1 


PTT PTTP fin T 


PAAPAGGTAG 


TATTGGCCGT 


CAAACATTAG 


AGATTGTGCG 


Z 1 / OU X 


n c p p*p a t p pt 


TPAGAATTTA 


AAATTATTTC 


TATGGCTTCT 


TATGGAAATA 


71 7ft^1 
Z X / O D 1 


A TP T & A PPTT 


ATTTTTTCAG 


CAACTAGAGG 


AGTTTGCTCC 


GTTAGCCGCA 


71 7 on 1 


PPPPTPT AT A 


ACGAAGAGGT 


TTATAACGAG 


GCCTGTCAGC 


GATTCCCCCA 


717 Qt;i 

Z X / 7 D X 


TATPPA ATTT 
Ini Vj^»ri/\ ill 


TTCCTAGGCC 


AGGAGGGTTT 


AACCCAACTT 


TGTATCATGG 


*5 1 onni 

zloUUl 


Al ALAo 1 CAV- 


TACTGTCGTT 


GCTGCTTCTT 


CAGGAATCGA 


GGCGCTACCC 


ZXoUDl 




AGTCGATGAA AAAAGGAAAA 


GCACTAGCTT 


TAGCAAACAA 


7 1 qi n 1 

Z X O X U X 


AP A A AT TTT A 
AoA/iM 1 1 1 1ft 


GTTTGTGCTG 


GCGAATTGGT 


TTCTAAGACT 


GCAAAGGAAA 


71 fi 1 ^ 1 
Z X O X D X 


ATPPTATA A A 
A 1 Vjo 1 A 1 /i/V* 


AGTTCTTCCT 


ATTGATAGCG 


AGCATAATGC 


TTTGTATCAA 


71 Qorn 
ZXoZUl 


TPTTTAfil AP 
1 o 1 1 I nu/inu 


GCAGGACGAT 


TGAGGGAATC 


AAGAAACTGA 


TTCTTACAGC 


7 1 qoc; 1 

Z X OZ D X 


TTPTPP APPP 
1 1 V. 1 obnuUU 


CCTCTGCTCA 


ACAAGTCTTT 


AGAAGAGCTT 


TCTTGTGTAA 


7i pini 


PA A A A PA A PA 


TGTTTTGAAC 


CATCCTATAT 


GGAATATGGG 


TTCAAAAGTG 


71 Q 7 s; 1 
ZXoDDl 


AC1 bUjoALl 


CATCCACATT 


GGTCAATAAG 


GGACTCGAAA 


TTATCGAGGC 


218401 
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CTCAGAGCTT 


AATCCATGGT 


ATGGTAGAGT 


TTTTAGATGG 


GAGTGTGATT 


218501 


TCTATCATGA 


ATCCGCCTGA 


TATGCTCTTC 


CCAATACAAT 


ACGCTTTAAC 


218551 


AGCTCCAGAG 


CGTTTTGCAT 


CTCCTAGGGA 


TGGTATGGAT 


TTTTCGAAGA 


218601 


AACAAACTTT 


AGAATTTTTT 


CCGGTAGATG 


AGGAGCGATT 


TCCTAGTATC 
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218651 CGTTTAGCAC AACAGGTATT 

218701 TAATGCAGCC AATGAAGTAT 

218751 CTTGGTGTGA CATTTTACGC 

218801 GTTTATGCCT GCCACTCTTT 

218851 TAGAGCTCTT GCTCAAGAAA 

218901 ATTTTATTCT AGCAGCCCTA 

218951 CTTGGTCATC TGGTAGTAGC 

219001 TAGCATAGGC TTTGGTCCTG 

219051 AATATCGCAT TGGATGCATT 

219101 ATGGAACGTA CCAAAGAAAA 

219151 TGATATTCCT CAGGGATTTT 

219201 TTCTTGTTGC TGGTCCTCTT 

219251 AGCATTCTTT ACATGAATGG 

219301 TAAAGTGGTA GGTTGGGTCC 

219351 CTGGAGACGA GATTCTTACG 

219401 GACATGCTAA CAACCTCTTT 

219451 ACGTCCTGGC TATTTGACAG 

219501 AGTTTGATCC CACAAAATTC 

219551 TTGTATAGCA ACCAGGTGCC 

219601 AGAGCTACGT CCGAATGATC 

219651 TCTCAATGGC TCAGATATCT 

219701 AAAGTAGCAC GGAATGACAA 

219751 GGCTTCCGTT TTACATTACA 

219801 CGCAGTATGA GGCTGGACTT 

219851 CCTTATGTAA TCAATAGTTA 

219901 AGATCCAGAG TCTCCTTTGC 

219951 ATCGCATTCT AGCTATTGAT 



PCT/US99/26923 

AGAGAAACAG GGGTCTTCTG GAAGCTTTTT 
TAGTGCGGAG GTTCCTTTGC GAAGAGATTT 
AAATTAACGA CTCTTATGGA ATGTCATAAG 
AGAAGATATT TTAGAAGTAG ATGGTGAGGC 
TATAATCGAG TAGGTATATG ACAATAATCT 
GCTTTAGGGA TTTTAGTGTT AATTCATGAA 
AAAAGCTGTA GGAATGGCTG TAGAGAGTTT 
CTTTATTTAA AAAGCGTATA GGCGGCATAG 
CCTTTTGGAG GCTATGTTCG TATCAGAGGT 
AGGGGAGAAG GGGAAGATAG ACTCTGTCTA 
TTAGTAAGTC TCCTTGGAAA CGCATTCTGG 
GCCAATATTT TATTAGCTGT CTTGGCTTTC 
GGGAAGAAGT AAAAATTATA GCGACTGTTC 
ATCCTGTTTT ACAGGCAGAA GGATTGCTCC 
TGTAATGGTA AGCCTTATGT GGGAGATAAG 
ATTAGAGGGG CATCTCAATC TAGAAATCAA 
TTCCTAGCAA AGAGTTCGCT ATTGATGTTG 
GGGGTTCCCT GTTCTGGAGC GAGTTATCTT 
CCTAACGAAG AACTCTCCTA TGGAGAATTC 
GTTTCGTTTG GATGGATGGC ACACTTCTTT 
CAGATACTCA ATGAGTCTTA TGCTTTTGTG 
AATCTTCTTT TCTCGTCAAC CTAGGGTATT 
CTCCCTACCT TCGTAATGAG CTTATAGATA 
AAAGGCAAGT GGTCTTCGTT ATATACATTG 
TGGATACATA GAAGGTGAAC TTACTGCTAT 
CACAACCTCA AGAGAGGCTA CAGCTTGGGG 
GGAACTCCTG TTTCTGGAAG TGTAGATATT 
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220001 TTACGTCTTG TTCAGAACCA TCGGGTCTCT ATTATTGTTC AGCAGATGAG 

220051 TCCGCAAGAA CTTGAAGAGG TGAATTCTCG AGATGCTGAT AAGCGGTTTA 

220101 TCGCCTCTTA TCATTCCGAA GATCTGTTAC AAATTTTGAA CCATTTAGGA 

220151 GAGTCTCACC CAGTAGAAGT CGCGGGTCCT TATCGTCTTC TTGACCCTGT 

220201 TCAGCCTCGT CCTTGGATTG ATGTTTATTC TTCGGAGAGT TTGGATAAAC 

220251 AGTTGGAAGT AGCTAAGAAG ATTAAGAACA AGGATAAACA AAGATACTAT 

220301 TTGGAGCGTC TTGATGCTGA GAAGCAAAAA CCATCTTTAG GGATTTCTTT 

220351 GAAAGATCTT AAGGTGAGGT ATAATCCTTC ACCTGTGGTT ATGTTATCAA 

220401 ATATTACTAA GGAAAGTTTG ATCACCTTGA AAGCTTTAGT TACTGGACAT 

220451 CTGAGTCCAC AATGGCTTTC AGGACCTGTG GGTATTGTGC AGGTTTTACA 

220501 TACAGGATGG TCGGTAGGGT TTTCTGAAGT GCTCTTTTGG ATCGGTCTAA 

220551 TTAGTATGAA TTTGGCTGTC TTGAATTTGC TTCCTATTCC TGTTTTGGAT 

220601 GGAGGTTATA TCCTTCTATG TTTGTGGGAG ATAGTCAAAA GAAGACGTTT 

220651 GAATATGAAG ATTGTGGAAA GGATTTTGGT TCCGTTCACT TTTTTATTGA 

220701 TCATCTTCTT TATTTTTCTA ACTTTTCAGG ATTTATTTCG TTTTTTTGGT 

220751 TAAGGCTCCA TTTTTTTGGA AATCCGTAAA GGTTGTAAGG ACGAGAACCA 

220801 AAGCAGTGTA TAACGTAGAT GTCTTGGTTA GGGTAAGGCT TAAGTCTTGA 

220851 GTTCTTTTGT TTGTCTTCCA TAACAGTTTG TTCTAAAGCT GCTTCGGGAA 

220901 CGTGCTCGAT TATCGGGGTT GTATTTCCTT CTTTTAAAAA CTCTTTCATA 

220951 AG AT AG AG CG ACTTCCGATT AGTAATGATG TAGGTATGAC TCGTGTGGAT 

221001 CACGTAAGCT TTATGTGGAG TATCTAGCTT TGTCTCTATA GTGCATACCT 

221051 TACCATCGTT TTCATAGGGC AGGCGGAAAG GAAGGAATTT GCTATGTCTG 

221101 TTCCCACTAA GAATGAGAAC GTCTAAAGAC GAAGGGAGTT TCCCGACATT 

221151 TAACATCTTT GTGGGGCAGT AGGTAAGAAG CTGTCGTCCT AATTTTCCTC 

221201 CAAATACGAA CTGTACGAAT TTCACACAGC GGTAGCGTCT AGCTAGTGTA 

221251 GACCCTGCGT TCGGAGGAGC CATGAGAATA GCTTTTCTTT TTTGGCTTCT 

221301 TCGGGGCAAT CAGGGTGAGC GAGTGCTACA CGAACAATGA CTCCTCCAAT 
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221351 AGAATGAGTT ACGAAGTTTA TAGGGACTCC AGGCTTAAGT TCAGCTATTT 

2214 01 TTTTCAGCAA GCGATTGAGA TGTTCAGCAT GCTTTTCTAG AGTAAACTTG 

221451 CGCGTCTCAT AATTCCAAAT AAAGACATCG TAATGTTCTT TTTCTAGAAC 

221501 GCGAGCAATA GGTTTTAAAG ATGTATAAGA TCTTAAAAAC GCATGCACGC 

221551 AGACCACGGA TTCTTTTTGT TTTGAGGTTT CCTTTAATCC CCCAATTCCA 

221601 GATGGAAGGG TTTGGATTAC CGAGGTTTCC GAGAATAAAG GATTACCCAC 

221651 AGCTAAAAAG AGTATAGTTA ATAAAAATTT CTTCATGTGT CACCGATATA 

2217 01 ATTATTTTAA ATAAGTTTTT TATTAAAAAA GAATATGTTT ATTAAAACTT 

221751 ATTTATAAAA ATAACTAATT CGCTTACTTT TAACGCAAAA GAAAAATGAT 

221801 GTTATTGTTT TAGATAATTT GCTCAGAAAC TTGAGCGTTC TCGATACTTA 

221851 AAACAAGGCT TGTTTTTGGA AGTTCCCCAT GCATATGGGT GGAAGTAATC 

221901 AGAGTCTGAC CCAGAGTTGG GGCAGGGTCA AGGAGTTGAC CGACACGTTC 

221951 ATTATCTAAT CCAGCATGGA TATCATCTAG ACAGACTAGA GGGGAGACGT 

222 001 GATGAGATTG CTTTAGATAT AGGCACTCAG CAAGCCTTAA GATTGCCAAA 

222051 AGACTGTGTT TCTGCCCTTC ACTAGAGAAT TGAGACACAG GCATTTGGTT 

222101 CATAGTGAGT AGAAAGTCTT CGCGATGAGG GCCAACGGAA GTGCTTCCTC 

222151 ATTCGAGATC TCTAGGAAGT GATATAGAGA GCTGTTTATG AAATTCTTCG 

222201 GCAACAGCAG TTTCAGAAAT ATCAGAATTT TTAATTAAGG AACTTTTAAA 

222251 TTTTAAGGCC AATTGTTCTT TTAGGTTGTT AGACCAAAGT TCTTTGGATA 

222301 AATCTGAAAG TTTCTGACTA CAGAGAAACC GTTGGATGGA TAGGTAGGTG 

222351 CCGTGTTTGA CCAACTGTTC ATCCCAGAGG CCACGGTTGA GGTTTGCTTG 

2224 01 CTTTTTAAGA GAGCATTTCT CTGCTGAAGA GCGCGATGAT AGTACGATAA 

. 2224 51 GCAGAGGGTA TAGTGGTTAT CGCATTGAGA TAAAAGCAGA TTTAGGAAAA 

222501 GGCGACGATC CGCAGGAGCT CCTGAAATTA GAAGGCGGTC TTTTGAAGAG 

222 551 AAAAGCACAA TAGGTACTTT CCCTATCAGC TGCGATAAGG TTTTTATAGG 

222601 AAGTTGGTTA TAGCAGATTT TTTTTCCTTG CTTGTCTGTA TAGATGGAGA 

222651 GAGCTTGGGG AAGGTGGTCT TTCTCAAACT GTGTTTCTAA GAAGAAATGG 
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222701 GAAGATCCGA AGGTGATGGT ATCTGTGAGA TGTTGCGTGC GAAAAGACCT 

222751 TCCCAAGGAC AAAACATAAA GCGCTTCTAG GAGGTTTGTT TTTCCTTGGG 

222801 CATAATTGAG TTTAGGAGCC AGTGAGATTT CTAAATCACT GTGGTTACGA 

222 851 AAATTTTTTA GCTTCAGAGA GCAGATTTTC ATAAACATCG GCAGGGTAGT 

222901 GACCTAAAGG ATTCTCCTTA GGGAGTTTAT TAATCATCAT GTAGCCTCAT 

222951 AGGCATGATG ACAAATAATC CTGAGGCAGA ATCGGTAATG ATTCCAGGAT 

223001 TATAGGAATC CGAGATCCCT AAGCTGACTA ATTCATCCTT ACTATGCTTC 

223051 AGGATATCTA AAAAGAAAAA GGGATTAAAG GCAATTTCTA GGAGTTCGCC 

223101 AGAATAATTT ACAGCCATGC TTACCTTTCC TTCACCCACC TTAGTACAGT 

223151 TGGCTGTTAG AGTGAGCTCT CCGGGTAAGA AAGAAAACTT CACGGAGTGA 

223201 GAGGACTCAT TTGTAAATAA AGCCACTTGT TTGAGCAGAG TAATTAGTTC 

223251 TTCGCGATGC AGATCGAGTT TTACGTTGCT TTCTGTAGAT ATGACGGGGG 

2233 01 AGAAATCTGG AAATTCTCCA GAAAGAAGTT TTGTGATCAG GAGAGTATTG 
223351 TCACATTCAA CCGCAATCTT ATCTTGATCC AAGAAGATCG CAGCTTCACC 
223401 TTCATCGGAG CACATCTTTA TAATTTCTTC TACTGCTTTG ATAGGAATAA 

2234 51 TATATTCCCC AGAAAAACTT TTATCTAAAG TAACTTCAGC ATCTATTTTT 
223501 GCTAAACGCT TTCCGTCAGT CCCTACGATG GTAGCCACGC CATTGGCGAT 
223551 AGCAAGCAGG ACTCCAGTAA GAACATAGCG GCTTTCTTCT CTAGATACAG 
223601 CGAATGAAGT TCTCTGTAGC ATGGTTTTTA GCTGCTCTGC AGGCAAGGAA 
223651 AAACGCAAAG CATTTTGTAT ATCAGGGAGC ATGGGGAAGT CTTCTTTTTC 
2237 01 CATGCTGAGT AGGCGAAAGC ATGAAGATCC CGAGGTGATT TGTGCCATTT 
223751 CCCCTGCTGA AGAGGAAATT TCTAAATTTG CCTCTGTTAA TTCTTTTACT 
223801 AATTGAAAAA ATCTCTTGGA GGGAATGGAA ATAGCGCCTT TCTCATAGAC 
223851 TTTAGCTTTG GTGACGCAAC GTGTGCTCAC TGTCAGATCC GTAGCAGTGA 
2239 01 AAACTAATTC ATCATTATAA GTTTCAATCA AAACATGGGT GAGTACTGGA 
223951 ATAGGTGTGT TTTGAGGGAC GACACTTTGA ATTTTTTTGA TAAGGTTTCC 
224 001 TAGCTCATTT CGGGATACAA CGAATTTCAT ATTTTCCTAT AACCTGAGTC 

166 



W O0027994 [ file:/A\dcwas03\firmdata\lp\Fo1eyPat\PatentDocuments\WOQ027994 .cpc] 



Page 29- of 330 



WO 00/27994 PCT/US99/26923 

224051 TAACTTTATA TGAAGGGCTA CGCCTCTCTA GGAGAGACGG TAGAGGTACG 

224101 ATCGGGATAT TACTACAAAG TGATTTTCTG GTCTTCTAGA TATTTTACTA 

224151 ATAAAAAAGG CTTTAGATTG AGGAAATCTT CCCTGGAAAT CAAGGAAAGA 

224201 GGATTCTAAT CATTGTCTTA AGAAGGAAAA ATTGCTTTTT GTTATACTGG 

224251 TTTTTGTCCC CAATTATGGG GGAGGATCTT ATGGCACAAA .AAGAAATTGT 

224301 TTCTAATCGC AAGGCTCTGC GTAACTATGA AGTTATAGAG ACTTTAGAAG 

224351 CAGGCATCGT TTTGACTGGG ACTGAGATTA AGTCGTTGCG CGATCATGGG 

224401 GG AAACCTCG GTGATGCTTA TGTCATTGTT TCTAAAGGTG AGGGGTGGTT 

224451 ATTAAACGCG AGTATTGCTC CCTATCGGTT TGGAAATATC TATAACCATG 

224 501 AGGAGCGTCG TAAACGTAAA CTCCTTCTTC AT AG AT AT G A ACTTCGTAAG 

224551 TTAGAGGGTA AGATTGCTCA AAAGGGCATG ACTTTGATTC CTCTGGGAAT 

224601 GTTTCTGAGT CGCGGCTATG TTAAGGTACG TTTGGGTTGT TGTCGTGGGA 

224651 AAAAAGCTTA TGATAAGCGT CGTACGATCA TAGAAAGAGA AAAGGAACGT 

224701 GAAGTTGCCG CTGCTATGAA GAGGCGCCAT CATTGATATA GGTTAGGATA 

2247 51 TGGTGTTCTT CAGCCCACTG TTTTGCTTCT ATTTTAGAAT CAAAAGTCAT 

224 801 GAGGACTGTG GCAATAGCGT CGGCGTATGC GGAGCTCGGA TGGACTACTG 

224851 AAACACTTTG GATAGGATAG GAGCTTAGCT CTAGGGGTTT CCCTGTACGA 

224901 GTATCAAGAA TATGGGTGTA AATTTTTCCT TCAACACACC ATTTTTGAAT 

224951 ATGATTTCCA CTTGTTGCAA TTGCCATATC ATCGATATCT AAGATCGTAC 

225001 CTGCTGCTTC AGAAAAAATA CGCCAAGGTC TTCCCGAGGG ATGATGCCCT 

225051 GACGTTTTGA TCTCTCCTCC CCACTCTACA TAGTTGTTCG GACAAAAGGT 

225101 ATTGCAAATT TCATTTAGAC AATCTACGGC ATAACCTTTG ACAACACCAC 

225151 AGAGGTCGAT -TTGAACATGA GGATTCTTTT TGATTAGAGT TTTTGTGTTT 

225201 GACTGAAACT CCAAGTGTTG CCAGCCCATG TCTTTATAAT GTTGTTCCCA 

225251 AACGTCTTTA GGGGGGAGGG TTTGACTTTT GAGATGTAGA AGCCATAGGG 

225301 TTTTTAAAGG TCCTACAGTA GGGTCAAAAC GTCCTTCTGA AAGTTTGTAA 

225351 AGTGTATCTA CCTGATCTAG AAACTCGGAA AGTTCTACAG ATAAAGTTAT 
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2254 01 GGGGACATCT GCTGGAGCTC GGTTGATTAT CGAGAGTTCA GAATAGGGAT 

225451 TCCAGTTGTT ATAAATCGAG TCGATCTTAT GAAAGCATCT ATCAATTTGT 

225501 TGGGATAAAG ATGCTTTTTC TTTTGCGGAT AAAGAGGTTC CCAGAACAAT 

225551 GCGATAGAAG ATTGTCATCT GCTCTCCTTC GATTGTTGTC GTTTTTTGAG 

225601 AGCATGAACA GAGTCCAAGA CATAAAAGAA CTAAGAAAAA TTTTGGTAAC 

225651 ATCGCCATGT ATTGCTCATG AGCATAGCGA CAGTCATGGG ACCAACGCCT 

225701 CCAGGAACTG GAGTGATTGC TGCGCATTTT GTCACAACGT TATTAAAATC 

225751 TACATCCCCA AGAAGAGTAT AGCCTTTCGC ATTGTCTGCA GGGACTCTTG 

225801 TTGTTCCTAC ATCTACGATC ACAGCATGTG GGGCTACCAT AGTTTCCTTT 

225851 ATAAAAAGCG GTGCTCCTAG AGCAGCAATA ATGATATCAG CTGTCTTTAA 

225901 GATTTCTGGG AGGTTTTCCG ACTGGCTATG AAGAACTGTG ACTGTACAGT 

225951 TAGTTTGAGG ATGCTTTTGC ATCATGAGGG CCGCTAAGGG TTTCCCCACG 

226001 ATGTTGCTTC TCCCTACAAT AGCGGCATGG CGGCCTCGAA GAGGAATTTC 

22 6051 ATAATAGTTC AGGAGTTCAA TAATTCCTGC AGGAGTGCAG GGTAGAAGTC 

226101 CATCAAAATT TCCAAGGAGC AACTTTCCCA TGTTCACAGG GTGAAGCCCG 

226151 TCCACATCTT TGTCTGGGGA GATCGCTTGG AGAATCACTT CGCTGTCCAA 

226201 GTGTTTGGGC AAGGGAAGTT GCACGAGGAT GCCGTGGATG CTAGGATCTT 

226251 GATTCAATCG TTCTATGAGC TTAAGGACTG AGGAGAGGGT AGAGTCAGAG 

226301 GGTAACTTGT GCGCTTTGGA GATAATTCCG ATTTCTGTAG CTTTTTTGAC 

226351 TTTCATGCCA ACGTACACCT CAGATGCGGG GTCATTGCCA ATCAGGACCA 

226401 CAGCAAGCCC CGGAGAGGTA GGACTTTGTG AGATTTCCTC TTTGAGTCTC 

2264 51 TGAAGGATTT TTTCAGCTGC AGGAATCCCT CTCAGTAACA TACCAATCTC 

226501 CTAAAAAAAG ATATCATTTT ATGGATTCCT GAGTTTATTG CGCAAGGAAG 

226551 TATATAGATT ACAGAAGTGA AGATGAGATT ATCTAAGAAA GAACATTTTA 

226601 GAGGTATCAT TAGCACATCT ATTGACCAAC GGACTTCATG TCCGAACAAG 

226651 GCGCACATAG GAATGGTTAG GAGTAAAAAG ACTCCATAGA AAATCATATT 

226701 CACAATGACC AAGGCGGCTT TGCTCTCTTT TGAAAAAATG AGGTGTGCCT 
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226751 TATGGGTGAT CAGAGCAGTT AGAGGGTAGA GCAGTAGAAA TACACCGCGG 

226801 CTTCCTAAGG CAAGATCACT TAAGACACCA CAGCCTAGAG CGAGTACCAG 

226851 TCCCTTATCT TTTGAGAAAC AATAGAAACA AAGAACAATA TAAGGGGAAA 

226901 AAAGGATCAG ATGTGCTTTA GGGAAGAGCG TTGGAAGGGT GAACATGGAT 

226951 AAAATACACA AAATAAAGCA TTTGCGCGGT GTTATGTGGA CAGACATGGT 

227001 TTACGCTTTG AATTATGATT ATCGATTTCA AAAGGTTTTG CAGACTTGTA 

227051 TATAGCAAAG CATAGGATTC TTAGCTTGAT TTTTTATGTT TATCTAAACT 

227101 GCTTTTAAAC AATTATTTAT ATAATATTTT ATTTTTAGAT TAAAATTAGT 

227151 TTATTCTTAA TAAGTTTTTT AATTAAAATT AGTTTATTAA GTTGATTTTT 

227201 TTGTTTTTGT ATTTTTTATG GGGAAGCCTA AGAAGAGCAG AACGGATAGG 

227251 GCTTTGGCTC AGGAGATTCA AAAGAAATCA ACGGAAGTGT TGAAGAAGCC 

227301 TGCGCGGATA AAAGCTAAAA ATCGTCGTAA ATTTCTTATT GCTAAGGAAC 

227351 AGAAAACTCT TAAACACCGT GCTCAAGAAT ACGATCAGTT AGTTCGCTCT 

227401 CTCTTAGATT CTCAGAAGAA GGACACCGAT AAAGTTTTGA TTTTCAATTA 

2274 51 TGAGAATGGG TTTGTTTTTA CTGACAAGGA CCATTTTAGT AAGTACTCTA 

227501 TCCGTCTTTA GGCGGGGTTA AATTAAAAAA AAATCGCAGA ACGGGACTAT 

227551 CCCGTGATGG AACGACAGGA AGCCTGGGTA ATTAATTTAG AGTTAGGAAC 

227 601 TGTCTCGGTA TTAAGATCTC CAAAATCTTA ATACCTAGAC AGTCGGATCC 

227651 GAGTGATCCC TCTAGCTCGA CAGCGATGCA CTTTGTTGCA TCTACTCGCT 

227701 GTTCTCTGTC CTCCCATAAG TTTTTTTACG CAAGGTGTTT CACCTTGCGT 

227751 TTTTTTTTGT TTTTTAGATT TTTAAAATTT GTTTGTAAGT TCTTTTTTTT 

227801 AAATTAATTA TTTTTTGTTT TAAAATTATA AGTAGTTATA AGTTTTATTT 

227851 TTTAAATTCA GAGAATCATT ATGGCAGTTT CAGGTGGCGG AGGGGTTCAG 

227901 CCTTCTTCGG ATCCAGGAAA GTGGAATCCT GCTCTGCAAG GAGAGCAGGC 

227951 AGAAGGCCCG TCTCCGCTAA AAGAATCTAT ATTTTCTGAA ACCAAGCAGG 

228001 CCTCCTCTGC TGCGAAGCAG GAAAGCTTAG TGCGTTCAGG ATCTACAGGA 

228051 ATGTATGCAA CAGAATCTCA GATAAATAAG GCTAAGTATC GTAAAGCTCA 

169 



WO0027994 [ file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocumen t s\WO0027994.c pc1 



Page 297 of 330 



WO 00/27994 PCT/US99/26923 

228101 AGATCGATCA TCAACCTCTC CAAAATCCAA ATTGAAAGGT ACATTTTCTA 

228151 AAATGCGCGC TAGTGTGCAA GGATTCATGT CAGGATTCGG ATCTCGGGCT 

228201 TCGAGAGTGT CAGCAAAGCG TGCTTCCGAT AGTGGTGAGG GAACATCCTT 

228251 ATTGCCGACA GAGATGGATG TTGCTCTAAA GAAGGGAAAC CGTATTTCAC 

228301 CTGAAATGCA GGGATTTTTC TTAGATGCTT CGGGTATGGG AGGGAGTTCC 

2283 51 TCTGATATTT CTCAGCTTTC TTTAGAGGCT TTGAAATCTT CAGCATTTTC 

2284 01 AGGTGCCAGG AGTTTAAGTT TAAGCTCTTC AGAATCTAGT TCCGTGGCTT 
22 8451 CGTTTGGATC TTTCCAAAAG GCCATAGAGC CTATGAGTGA GGAGAAGGTA 
228501 AATGCTTGGA CAGTGGCTCG TTTAGGAGGG GAGATGGTCA GCTCTCTTCT 
228551 CGATCCCAAT GTTGAGACCT CATCATTAGT GCGCAGGGCA ATGGCAACAG 
228601 GCAACGAAGG CATGATAGAT CTTTCTGATT TAGGACAGGA AGAGGTCAGT 
228651 ACAGCCATGA CATCTCCCAG AGCAGTAGAA GGAAAAGTAA AGGTATCTTC 
228701 TTCTGATTCT CCAGAAGCGA ATCCAACAGG AATTCCAAAT TCTAATACTT 
228751 TAGAAAGGGC GGAAAAGGAA GCAGAGAAAC AAGAAAGTCG AGAGCAGTTG 
228801 AGTGAGGATC AGATGATGCT TGCACGTGCT ATGGCTGGGC TTCTTACAGG 
228851 GGCAGCGCCT CAAGAGGTAT TGAGTAATTC TGTTTGGTCT GGTCCTTCTA 
228901 CAGTATTTCC TCCTCCCAAG TTTTCAGGAA CTTTACCCAC CCAGAGATCG 
228951 GGAGATAAAT CAAAGCATAA ATCTCCAGGA ATAGAGAAGA GTACGAACCA 
229001 TACGAACTTT TCTCCTCTTC GGGAAGGTAC TGTGAAGAGT GCTGAGGTTA 
229051 AAAGTTTGCC TCATCCAGAA AGTATGTATC GTTTTCCTAA AGATAGCATC 
229101 GTTTCCAGGG AGGAACCTGA AGCCGTTGTT AAAGAATCTA CGGCATTCAA 
229151 AAATCCAGAG AATAGCAGTC AAAACTTTCT CCCTATTGCT GTGGAGAGTG 
229201 TTTTCCCTAA GGAAAGTGGT ACGGGAGGGG CTTTAGGAAG TGATGCTGTG 
229251 AGTTCCTCAT ATCATTTCCT TGCGCAACGT GGAGTGTCTT TACTCGCTCC 
229301 TCTACCTCGT GCTACTGATG ACTATAAAGA GAAGCTCGAA GCTCATAAAG 
229351 GTCCTGGAGG TCCTCCAGAT CCTTTGATTT ATCAGTATCG AAATGTTGCT 
2294 01 GTTGAGCCGC CAATTGTTCT CCGTTCTCCC CAGCCGTTTT CAGGATCTTC 
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229451 ACGTCTATCG GTTCAAGGAA AGCCTGAAGC TGCTTCAGTT CATGACGATG 
229501 GTGGGGGGGG AAATAGTGGT GGTTTTAGCG GAGATCAAAG AAGAGGATCT 
229551 TCGGGCCAGA AAGCTTCCCG TCAGGAAAAG AAGGGAAAAA AATTATCTAC 
229601 GGATATTTAG GGTTTTAAGT CGGTTTGATG TGATGCGTAT AATTCGTTTT 
229651 GATCCTTATG GTGCGCTATC TGCACAAAGC ATAGCTAAAG ATTCCCGTCA 
229701 AAACTCTCCT TTAGTAGAAA AAATTTCTGA GGAAATTGCT ACGAATGAAG 
229751 CGATTCGGCT TGCCTTGCTA GCTATTGGAG ATCGCGAACA AGAGGAGAAG 
229801 AAACAGAGGC ATCGTTATAA GCTACTCGGA CAAAAGCAAG CCAAGGTCTT 
229851 GCTTTCTCAG TTGCGTCATG TGCATTTAGA TTTTAAAAAA CTATATTGCG 
229901 ATAGTAAGAA AAAAGAAGAT CAGGAAAAAG ACGAAAAAAA CAAACAGAAG 
229951 CGATCTATTA AAGTTACAAA GAAAAAAAAG GGCATCTCTT TAGGGGCTGC 
230001 CGCTTCTCAG GCAATTGCAG CAGCAGCAGA AGCTTGGGTA ATTGCTAGAA 
230051 ATAAAGGAGT CTTAGAAACT GCCTCCACTC TTTTTTATCA AAAGGATGAA 
23 0101 GAGGCCTAGA CATACTTGAA TCACGAGCTA GGCCTTCTTC TGTGACTATT 
23 0151 TTAGATCATC ACGCTGCTTC TTGCTCTTCT ACTGGAAGAC TTTTTTCTAT 
230201 AATTTCTTTT TCTTCATCGT CTACTGTAGG ATGTTCTGAA TGCTTAGCTA 
230251 GATCTTTCCA AATCATTCGA AGTTTCTGAT TTTGTTGTTT GGTCAGGTTT 
230301 TCCAGTATGA TCAAGCTTTC ATCATTTAAG GAGAATCTTC CTTTAGACCA 
2303 51 ATTTATAGAT CCTGCAAGTA GAGTTTTATT ATCTATAACT GCAAACTTAT 
230401 GGTGAAGAGT ACAGGGTGCG GTATTTATAG AAACAAAGTC TTTATTGATA 
230451 TTTAATTGTC GTAATTGCTT AAAAGTAAGT TTGCTATGAC TTCTATCAAT 
230501 GATAATATCT ACATGGATTC CTCGTTGTTT TGCTTGATGT AAGGCTTGAA 
230551 TAATCTCCGA GTGGGTCAGA GCAAACATAG CAACTTGGAT GGTTTTCTGA 
230601 GCTGTCTGGA TTTTTTCGAG TACAGCTTGT ATTGCAATTT TACGATCTTG 
230651 AGGAAGAACA AAATACTTTC CTGTTTGATC CTTTATAGAA AAGTCTCCAG 
230701 AGGTATTTGT GATAATGAGA TCACAGAGCT CCGAGCTATG CATTCCTAGA 
230751 ATGAGATTAT TATCTAAACG TAGAGAAAGA TTGGTGTAGT TCGCAGATCC 
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230801 


TAGCCAAGCA 


TCTTU Li inl 


\- l A I uunnnv 


APPTTTTTGA 


TGCATCAGTT 


230851 


TACGCCCTGC 


TGCjAuVj 1 1GG 


TPP 2iP*T 2k Tx A p 


TTACATTGCT 


GGCTTGCTTT 


230901 


AAGATTTGGG 


GAATTTTAAA 


Hill uA i Ao 


TAPATCGTAA 


CTTTGTTTTT 


230951 


TGCTTGAGCT 


TGTCGAG1 I A 


AAV- 1\. lul lu 


PATCTTGGGT 


TCTGAGAGGT 


231001 


TATAAATACG 


TAGGAAGATC 


TC TTC ATC AG 


P PTPT T CT Tx*V 


AGCATCGCAT 


231051 


AGAATTTTAC 


GCATGTCCTC 


A ITGCATu GA 


TTTPAPTAGA 


TGATAGCTTC 


231101 


TTCAGACTTT 


AAAAAAGTCT 


TAAaAGTGTC 


a. p p zt pp a pp a 


GCTCTTGCAA 


231151 


AAATTCCTAC 


TAAAATCAAC 


GTGCTAATAA 


TZv Ap Ap Ap. A*p 


TTTTAATTTA 
t x i x ni\ i i i 


231201 


TCTTTTTGTC 


TTTTATTCAT 


TTTTTACTCT 


»T*A A A A t» A PpT 
X AAAA 1 AVpV- 1 


lllllOllll 


231251 


TATATATTTT 


TAAAAATAAA 


AATAACTAA 1 


rprp * a TT fiXtP 
1 1 AA 1 1 Annv. 


PTTTAATAPA 
ol 1 1 rvrv i nun 


231301 


ATAAGATTCT 


TATTAAATAT 


A A A A JlPTTTr 

AAAAAG 1 1 I G 


TTT ATT PPGG 


AAATATGCGT 


231351 


GCCATCGTTC 


GGAAACCTTT 


ibl 1 1 lul At* 


ATPPGTPGAC 


PTCTACTTCT 


231401 


TTAGGATAGG 


AAGGCTTCAT 




TxTh APPA APP 


GAAAGTTGTA 


231451 


ATTAGGACGG 


TGAGTAGCAA 


AA 1 GGG 1 G 1 G 


P 2VPPPPPTPA 
oAov.Vjv.Vj 1 o/\ 


AGATCATTTG 


231501 


CTGGGGCACA TCGTGTGAAG 


G 1 CC I CGAGA 


p a a a RTPTTT 
uAAAA 1 V- 1 1 1 


TTPAPTTTGA 

i i V— ill van 


231551 


ATGGTTTCTC 


TCAGATTATC 


GGCAAGGATA 


a TP a p a rcicc* 


ATPATTTTAP 
t\ 1 vjn 1111 /\o 


231601 


ATCTGGATGG 


TGAAGGAGAG 


* fp(T>rp * AT»A/^A 

ATTT AAT AC A 


TPPPTPPTPP 
1 tou 1 V.V- 1 V,o 


APPPATPTTT 

nVJVjOrtl will 


231651 


CCAACACTAG 


GCAACCACGA 


/"* A A A A tr^r* 7s. r* 

CAAAAGGGAG 


/^r* A TP r PP*T ,r PP 
LoA lultl lb 


AAPTPPATGG 


231701 


ATTTTTCCTC 


CCTGATATCC 


A I GGGGAAGG 


TPTPPPATPP 
1 L 1 Loun 1 oo 


PTTTTPPTAT 
\_ 1 1 1 lV>V.lnl 


231751 


TCCCATGAAG 


ATTCCCTTGG 


AGCCCTTATT 


T» A A PPTTP/^T 
1 AAuL 1 i oVj 1 


PPTPT ATAPT 
V.V. Ivilnl MO 1 


231801 


CTAACGTATC 


GTTTGCAGTT 


rnpm« A /** A A A A 

T CTG AO AAAA 


"PAATAAPJi TP 
1 AA 1 AAoA 1 V- 


l V.VJU 1 V. 1 VjVJV. 


231851 


TGTAGACGCT 


CTAAAATGGT 


TTCTAGAACC 


a r*r* r* a p a a p p 
ACG Cj AG AAL L 


TPTPP TxCl h PP 
lul v.onV*nV3V* 


231901 


CACCTCTTGG 


TCTGTGACCA 


TTAGGAATTT 


CGTTAGGGAA 


AGTTGGCCCT 


231951 


CTCCAAGAAT 


TCTAAGAGCT 


GTGGTTAGAG 


ATTCTCTCCA 


ATAGCGTTCT 


232001 


TTAACGACAG 


CCGCAGTCAG 


TGCATGAAAC 


CCTGATTCTC 


CGTAACTTTT 


232051 


AAGTCTACGC 


ACACCAGGCA 


TAACTAACGG 


AAATAAAGGG 


GAGAGGTATT 


232101 


CTTGGAGTTT 


GTTCCCTATA 


TAAAAATCTT 


CTTGGTAGGG 


TTTGCCGACT 



WO0027994 [file:/A\dcwas03\firmdata\lpVFoleyPat\PatentDocuments\WQ0027994.cp c1 



Page 300 of 330 



WO 00/27994 PCT/US99/26923 

232151 ACTGTAGCAG GATAGATTGC ATCTTTTCTG TGATAGATTT TATGACAGTG 

232201 GAATTCAGGG AAGTCATGTT GGAGACTGTA GTATCCGAAA TGATCGCCAA 

232251 AAGGACCTTC AGGACGACGT TTCCCGGCCG GAGATTCTCC GACCAGGATG 

232301 AATTCCGCAT CGTAGAGTAG AGGGTGGGGA TGGTCGTTTG TTTTTTTATA 

232351 AAGGAGCTTC GCTCCTTGGA GGAAGGTAGC AAAGAGAAGT TCCGAGACAT 

232401 TCTCAGGTAG GGGGGCAATC GCAGAAAGGG TTAAAAAGGG GTTTCCAGAC 

232451 AAAAATACCG AAACAGGAAG GTTTTGCTTT TTTTGCTCTG CTTCATACAG 

232 501 ATGCATCCCT CCGCCCTTCT GGATTTGAAA ATGGAGGCCC ATGGTGTTTT 

232 551 GATTGAACCG TTGCACGCGA TACATCCCAA GATTGGGTGT AGTAAGAGTC 

232601 GGCGATTCCG TATAGACAAG AGGAAGTGTG AGAAAGGCTC CACCATCTTC 

232651 AGGCCAGCTT GTGAGTAAGG GAAGGTGATC TAAGTTAACT GAGGACATAG 

232701 AAACAAAAGG AAAGCGACGG AATCGAGCTT TTTTGAGCCC TAAAGAGCTT 

232751 ATTCTTTTTA ATAGATCCCG AGATTTCCAT AGAGAAGAAA GCTTTGGTGT 

232801 AGAAGAAATA AGGTGGGCAA CTCGAGCGAT GAGGTTATCA GGAGCTTGAG 

2 32851 AAAAAAGTTG GTCTACACGA TGTTTTGTTC CAAAGAGATT GGTCAGGACT 

232901 GGGAATGACG ATCCGATGAC ATTATGAAAA AGAAGGGCAG GGCCTTGATC 

232951 TTCAATAACA CGACGATGAA TCTCAGCTAA CTCGAGGTTA GGACTTACGG 

233001 GAGCAAAAAC ATCAATAAGT TGTTTTTGTG AACGAAAAAG AGAAATATGA 

233051 CGCCTTAAGA AAGACATAAA TATTTCCCTA TACTTAAGTT AAATTAAAAA 

233101 TTTTTACTTT TAGCTCTTTC GAGAACTTTC TCTAATCCGA GCTTATCAAT 

233151 GTGACGAAGA GCGCTAGCAG AAATTTTAAG CTTAAGAAAA CGGTTTTCTT 

233201 CTGTAGACCA TAGACGCTTG GTCAACATAT TAGGGAAAAA TCTTCTTTTA 

233251 GTCTTCCCTG "TCACTTTCAA ACCAATTCCT TTTTTCTTTT TAGCAATACC 

233301 TCGAAGTGTA TAGCTATAAC CACGGCGAGG TCTCTTTCCT GTAAGTGGGC 

233351 ACTTTCTTGA CATATTCTTC CTATGAATTC TCTAACATCC GCTCAATAAA 

233401 GCAACTCTAT GAAAAGGAAT CTATACTAGA GCTTTTCGAA TATATGGGGA 

233451 AGAGGTTTTC TTAGAAAATG TGATTGTCTG TTGTTGATAA TGAAGGAATT 
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233501 CTTTTAAGAA AGACGGTCGT TCCAGGAAAT TATTATAAGG TTCTTACTAT 

233551 AAAAAGACTT AAATGTTTTA TTGCTATCCT TACAGTCCTG TAAGGATCTT 

233601 CTCAATGTAA CCATTAAATT TTTTATGAAT AGCGAGTTCT TCTAAGGAAG 

233651 GCCGAACTCG ATACGACCAA TTCTTTTTAG AAATTGTCCC AGGTGTATTA 

233701 ATGCGTTCTC TTTGTAGATT TTTTGATACT AAATCAGGGC AGAGGGCGAG 

233751 ATAATCGTTA AAGAGGTTGA TATGAAAGAT AGATGCTGAT TCATGAGAAA 

233801 GTTTTAAGAT GTCTATTTGA GTTTCTGTAG TCAGGGTTTT TTGAAAAGGA 

233851 AGATGTAGAA ATTTAGCAAA TTGCTTAGCT TCCTTAGGTG AATTGAGCCA 

233901 CCATTGGGCA AACGTATCAG AGTCGTGGGT AGAGAGAGTG GTCACAGAAA 

2 33951 GTGGATTATA ATCTTTTAGG GGAATGAAGG CACTGTCGCT TTCCCAGTTG 

234001 CGTTCCCATC GTGGAATCCG GGTTCCACAG ATTCCTAAGT GTGTTAATGT 

2 34 051 CGTTTTGACG TCTTGGGGTA TAATCCCTAA ATCTTCTCCG ATAGGTAACA 

234101 TAGAAGAGGC TCCGAGCATA GTAGAAAGGA TCTCCGTGCC CTGCTTTATA 

234151 TAGTCTTTAG GATTGTCTGG AATGAACCTT CCTCTTCCTG AAGAATCCCA 

234201 AATCCACAAA CGGAAAAATC CTATAATATG ATCTAAGCGA TAGACGGAAT 

234251 AGAAGTTTTG AGCATATCGC AGACGCTCTT TCCACCAAAT GTAGTCGTCT 

234301 TTGGCAAGTT GTGAAAAATT ATAAATAGGC AGATGCCAGT TTTGTCCTTC 

234351 AGAATTGTAG AGGTCAGGAG GAGCTCCTAC AGACCTTGAT GAAGAAAAGT 

2344 01 AGTCTCGGAA ATACCAAACA TCACAGCTAT CCTTGCTAAT AAGAATAGGG 

2344 51 AGGTCTCCTT TAAGCAGGAC GTGGTGTTGA TCTGCATAGG CTTTCACTTC 

234 501 GCAGAGCTGT TGGTAACAGA GAAACTGTAG ATAGGAAAAA AAGAGGACTT 

234551 CATCATGGAA TTTTTTAGTT AAGTCCGGAA AATTCTCCTG ATCTGTGAGC 

234601 GACTTCGGCC AGTTATTAAT AGGTTCTCCG TGCATATGAT GTTTGATTGC 

234 651 ACGAAAGGTC CCATAGGGAT AAAGCCAATA GCGCTCGCTT TCTAGAAACT 

234701 CAGAAAAATT TGAGTTTCCT TCGAGGGAAG ACTTGCAACA TTTTTGGTAG 

234751 TACTCTCTTA AGAATGCCCA TTTTTTTTCT TTAACTTGAG TATAGCTGAC 

234801 TGATGGAGTC GAGCATAACT CATGCATATC TTGAAGTTTC TTGGCAACTT 
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2348 51 CAGGGATGGT ATCGATATTT GGAAGAGAGG ATAGGGAAAG GAATAGGGGA 
234901 TTCAGGGCTA CGGAAGAGAT GCTGTTATAG GGACTCGTAT CTTCACCAGT 

2349 51 ATCATTTAAA GGGAGAAGCT GAATAACGCT GAAGCCCTGT TTTTGGCACC 
23 5001 AAGAGATCAG AGGAATGAGA TCTAAAAATT CACCGATTCC ACAGCTATTT 
235051 TTTGTGTGTA TTGAAAATAG TGGGAGATAA ATCCCGTGTT TAGGAGAGGT 
235101 TCCTATAAGT TTCCAAGCAT GTGCTGAGGG TGAGTGTTTT GTGTATTTTA 
235151 AAACATTCAC GCGACGTAGT AGATTCCCAA AACATGAAGG TCAGGAAGAT 
235201 TCCCTGTTCT TAGGGATTCC ATCCAAATCT TAGCATGCAA AGAAAAGAGC 
23 5251 TTGAGATACT CGAATAACTT TTCTCCATTT AGGTAGTTCA TACTCAGGAT 
235301 ATCTGAAAGA TAGAGCTGTT GAGTGACCTC ACCGTAGCCT AGAATTCCCT 
2353 51 TGATGCTGGA TTGGAACGAG CCATTTACAG AGAGAGCAGC TTTGAAAATA 
235401 CGCTCGCGAA ATACGTTTTC AGGAAGAGTA CCTAGTAGTG TCGATACTGC 
235451 AAGATCTCCG GAATTTCCAT CTTCTTCTAT TTGCACAGGG ACATGGGTAT 
235501 CGCTGAAACG GATTAAACAA GAGTTATTTT TATCTGGAGC AAGTGTCGTA 
235551 TTTAATAGGG GTGCTAAGGA TTCTAGTAAT TGCTCGTATT GGTTTTGCAT 
235601 GGCAATCCTT TTTTAATCAT GACCAAGGAT AGGGTTTAGG GAAGTCTGAT 
235651 GCTTTGGGAT AATCTTCATT GTTTATATTT ACAGCATCTA AAGCATTAGC 
235701 AATCATAGCT CCTAATTGCT GACGTTTGTC TGCTGAAGAG AAAAGGCGTG 
23 5751 ACGACGTTTG ACGTAAAGCA GAAAAGAATA AGTTCAAGAC ACCGGTCACA 
23 5801 GAATCAACAT CGTCTCCTAT GAGATTGCGG ACTTCTCGTT CTACTTTAGA 
23 5851 TGCTGTTGGG AACTTATCGT TAATGATTTT ATGGTAGGAC TCAGCTACCT 
23 5901 TCACAAAGTT TAGATCAGAA GGAGTTTGGA TTCCCTCAGC TTTTAAGCTA 
23 5951 TCGAGTAAAA TAGGAACGCG ACTTTCAAAG TAATCGTACG AGGTAAGAAC 
2 36001 TGCTTGCAGG TTACGAGTTT CTGTCATGAG AACTTGTAGT TGCGCACTGG 
236051 GTACGTAGGG ACCCTGCCTT TTTAATTCTG TTQCCATTCC TTTCATTAGA 
236101 AAGGAGCTGA CAATAGCCAT ATCTTGGTAG GTATAGCGGT CTTGAAGCAT 
236151 AGAAAGTAGC TGATCACAGG TATGTGTGTC TCCAGTCACT TCTAAGTACA 

175 



WO0027994 f file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.cp cl 



Page 303 of 330 



WO 00/27994 PCT/US99/26923 



236201 


AAGAGCGAAG 


CCCTGAAGGA GAAACATTCA GTTGGTCTGC ATATTCTTGA 


236251 


GAGGCAAATA 


AGATGTTTTT 


CGCACCAATA 


GCAGTTCGTC 


CGAATTGCTC 


236301 


CGTATGAGTA 


TTCCTTGCTT 


GGATAAGCGC 


TTCTTTTAAT 


TTACCTTGGG 


236351 


AGGGTGGAGT 


CGTTTGAACC 


AGGTAGTCCA 


AAGCTGTGGA 


TTGCAGAGCT 


236401 


GGGTCTTTAA 


TTTTCTCTTG 


TACAAGAGCA AGAATGTCTT 


CTGGAGAAGC 


236451 


ATCGTCTCCT 


ATTGCATCAC GCAGGCCGCG 


AAGTTCTTGA 


CCAGAGATTT 


236501 


CAGAATTCCC 


AGAAGCATAC 


TTATCAGCAA 


GATCTGTGTC 


AGGCTTCTCT 


236551 


TCTGTAGATT 


CAGATTTTTT 


CTCAGCCTTT 


CCAGCTTCTC 


CTTTTTTCCG 


236601 


AGATTCTAGA 


GTTTGAAACT 


TCTCTTCCTT 


TTTTTTCGTG 


CGTGTTGCTG 


236651 


CTGCGGGATT 


TGTCAGGTCC 


TGAGATTGTT 


GAATCATGTT 


CATCTCAGAA 


236701 


CCTTCTTGGC 


TGGCTACAAC 


TTCTGCTGCA 


TCTGCTTTTG 


CAGCTGCAGC 


236751 


TTCTACAGCT 


GCAAGGTTGA 


CACCCTGAGT 


GCCTCCTAAA 


CCACCTGTGC 


236801 


CTCCTGATGC 


TGCCATATGC 


CTCCTATGAG 


CGACAACGTA 


TCAATTAGAA 


236851 


AATCTGAATT 


CTTCCTAAAG 


GCTGGATGCG 


GATTTCTGGT 


AGGATTTCTT 


236901 


GATAAGAAAT 


CACAGCAATG 


TCAGGGAATT 


CTGTTTCTAT 


TAATTTTCGT 


236951 


ACATATCTTC 


TTACATCAAT 


TGCTGTCAAT 


AATACTGGTG 


GTTGGCCTCC 


237001 


TGCAGGTGTT 


GGCGTGATCG 


TATTCCTCAT 


AGATTTTAAA 


ATTAGGTTCA 


237051 


CAGAATCAGG 


ATCTAGAGCA 


AGGTAAGAAC 


CTGCCGATGT 


CTGTTTAATT 


237101 


GCTCCACGAA 


TCATCTCTTC 


AATTTCTGGA 


TCTAAGAGAT 


AAACAGAAAT 


237151 


TGCTGATTGT 


CCTTGAGAGA 


ACTTGAAGCT 


GATATAAAGC 


TTTAAAGAAG 


237201 


ACCGTACATA 


TTCTGTAAGC 


AAAACTGTAT 


CTTTCTCAGT 


TTGCGCCCAC 


237251 


TCGCTCAGAG 


ATTCTAAGAT 


TGTACGTAGG 


TCTTTAATTG 


AGATTTGCTC 


237301 


TTGAACCAAT 


CTCTTAAAGA 


TTTCCGTAAG 


CTTTTGCAAT 


GGAATAAGCC 


237351 


TTGTGACTTC 


CTTCACTAAG 


TCCGGGAATG 


AACGTTCCAT 


AAATTCGATC 


237401 


ATAGAACGTA 


CCTCTTGAAT 


TCCCAAAAAC 


TCTTGAGAGC 


TTTTATGGAA 


237451 


AAAGTACGAA 


AGATGGAGAA 


TGATCACTTC 


GAGCGGCGTC 


CAATATTTAA 


237501 


TTGCTGCCTT 


CTCTAGAATA 


GCTTTTGCAT 


CTTCACTAAC 


CCAAGCTGAA 
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237551 GGAAGACCCG CAGCATTCTT ATAGGTAATG AAAGGTAGAT 

237601 GAGATTGTCC TCCACCTCAT TGGTTAACAC ATGGTGCGGA 

237651 CTCGCACATA AGGGACTTCA TTAAGCAGAA TCATATAATC 

237701 AAAGAAGGGG AATCTGTGCG AACATGAATG CCAGGGTATC 

237751 ATCCTGATAG AGAGCTTGCC GCATTTTAGG AATCATATCA 

237801 TTTGTCCTGA TTTTGTCTTG TGTTGGATAA GCTTAGAGAG 

237851 AGTTCTAGAA TTACGGGAAG AGTTAGAGAA TAGTCATCGG 

237901 AACAGTAGCA GCGCCATCAC CAGCAGCCCC TACGGTTGTT 

237951 AGCCACCACC TTTTTTTCCT GCCGCTGATT TCTTAGTCAG 

23 8001 CCTAAGGCAA CGAAAATTAA TGCTAAAATG GAGAAGGACC 

238051 GCCCTTGAAG AAACCAACCC CTAAAGTTGC AGCACCTGCA 

238101 CTCGTGGTTC TTTAACGAGC TGAGTAGAAA TCTCTTTACC 

238151 TTTTTGTCAC TCGATACACG AGTCGTGACA ATACCCGCTG 

238201 CAAAAGAGAA GGAATTTGAG AGACTAAACC ATCTCCAATG 

238251 TGTAGACGTG AGCTGCTTGA GCGAGGTCCA TGCCGTGCAT 

238301 ATCGTCAAAC CGCCAACAAT GTTAATCAAA GAGATAACGA 

238351 AACGTCTCCT TTGATGAACT TCATGGCACC GTCCATGGCT 

238401 CACTTTCCTT TTGGATTTGA GCCCTTTTAT CACGAGCTTG 

2384 51 ATCATACCAG CTCGTAAGTC CGCATCAATC GCCATCTGTT 

238501 CGCATCCAAT CGGAATCGGG CAGCAACTTC GGCAACACGC 

238551 TAGTTACTAC GATAAACTGA ATGATTGTAA TAATGAGGAA 

238601 CCGACCACAT AGTTCCCTCC AACCACGAAG TCTCCGAAGG 

238651 ATGACCCGCA TACGCTTTAA GGAGAATCTG TCGAGAAGAG 

238701 TCCCCAAGCG GAACATCGTA GTGATGAGGA GCAACGAGGG 

238751 AGCTGCAAAG CACTTGGAAT ATAAAGAGCC ACCATCAATA 

238801 GATCGATAAG TTGATGGTGA TCATCAAGTC AACGATAGGC 

238851 GAATAATGAT CATTAAGACA ACGCCCATCA TCCAAAGAGC 
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TATAACGGCT 
GGAATTTTTC 
GTATCCTTCT 
GGATTCCGAT 
TCAACAAAGC 
ATCTTTTCCA 
GATTATCCCC 
GAAGCTCCTG 
TAGGAGAATC 
ATAGAGGGAA 
AGGAGTAGTG 
CAAGTTCGTA 
TCAACGCAAT 
GAGAGAAGAG 
AGCCACCCCA 
TACCAGCGAT 
CCGTAGAGTT 
TGTGGCATCA 
TACCTGGCAT 
TCGGCACCCT 
GATAATGAAC 
CCTGAATGAC 
GAAATATTAA 
AAAAACAGAC 
AGAATACAGA 
GGAGGCAAAG 
AAGGATTAAG 
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238901 TCGCTGGACT TATTGATCAT GTTTAAGGCG GTATCGCCAC CAAGTGTTCT 

23 8951 GCTGACGAAA TTGAGTAGCT TATTCATTAT AAATGATCAG GTTGGTTAGT 

239001 ATTTTTATTA TTAGGATTTT GCGCATTCAG TGAAGTGATA TAGAGTAGAA 

239051 TTTCTCCAAT AGCTTCGTAA GTAGATTCTG GAATAAATTT TAATTCCTTC 

2 39101 CCTTCATCCA AAAGCTGATG TGCTAAAGGT ACGTTTCGCA TAATGGGAAT 

239151 TCCGTACTTT TCAGCTTCAT CAAGTATCCT TTTAGCTCGT AAGTTGATGC 

239201 CCATGGCAAT GATCCAAGGT GCTTTATATT TTTCAGGCAT GTAGCCAATA 

239251 GCAACAGCAA TATCTTTGGG ATTAGAGACT ACGGTGCTTG CATGTTTCAC 

239301 CTGTGATGAC GAGTCTTCAT AGGCAATTTC TTGAGCAATT TGTCGACGAC 

239351 GGCCTTTAAT CTCAGGATTT CCTTCCGTGT CTTTAAACTC CTGCTTAACC 

239401 TCAAACTTCT CCATCTTTAA TTCTTTAGCG AAATTGTGGC GCTGATAGAC 

239451 AAGGTCAAGA ATCGCAACAA TCAAAAAGAA AATTCCTATC GAGGTTACTG 

23 9501 CTTTATAAAA AATTTCTTTG AAGATTTGAG CAGTAATTAT AGGAGAGACT 
239551 CCTGCAGTTT CTATAATTAA AGAGACTTTG CTTTTTAACG TTATGTATAA 
239601 AATTAAGGCT GCTCCAAAAA TTTTTAAAAT CGATTTGATC AGCTCTATGA 
239651 GAGTCTTTAT TTTAAACTTT TGTTTGATGT TCTCAATAGG GTTGAACTTC 
2397 01 TTGATATCTG GTTTAAAAAC TTCGGTAGAA AATGTAGGAC CAACGATAAG 
239751 AAAACCTACA ATGACGCCAA CAACAGCAAC AGCTCCCAGT AAGGGAAGTG 
239801 ATGCTGTTAA AATAAGCATA AGACAGTTCT TAAGATAAAA TAAGGTAATT 
239851 ACAGGATCAT GGCGAGTGGG AGCTTGTGAG AGCATGGAAA CCAGAAAGCC 
239901 ACCTAAATGC TTGAAAAAAA AGGTCGATAG GGAGAAAGCC GTAAACATAG 
239951 AGACGATAAA GGTAACCGCA GAAGGAAAAT CCTGAGATTT TGCTACTTGA 

24 0001 CCTTTTTTCC GAGCATCTCT AAGTCGCTTC GGCGTGGCCT TTTCTGTTTT 
240051 TTCACCCATG CTGTTGCCAA GG TTCGATTA AGAGGACTAT ATTCGGAATT 
240101 ATCTGCAAAT TACCTTATGC TCTTGTTTTT CTCAAGAGTG AAGCCCAAAC 
24 0151 GTTTTGTTCT TAGTAAGATT ATAAATGAAT CTTAGAGATA AAGAAGAATA 
24 0201 GTAGAAAAAA AGAACAATAA CGAAAAGTTG TTGAATTTAG CAATACTTTA 
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240251 GGTCACTTCA AGATGGTTCT ATATGAAGAA AGTATTAGGT AGGGAAAACT 

240301 TCAATCGATT ATAGTGGTTT TTTATCATTT TCAAAGCACT CTTGATTTTG 

240351 AAGGTACTAT TTTTCTAGCT ATTGTGCTTT TTTGAAACAG AAGAGTTGTC 

2404 01 TTTCTAATAT TCGAAAAAGC ATGTCATTAT TTTTATTTTT AGATGTCTTA 

240451 TGAGTCATAC TGAATGTGGA ATTGTAGGGC TTCCTAATGT AGGAAAGTCT 

240501 GGCTTATTCA ATGCTTTAAC AGGAGCTCAA GTTGCCTCCT GTAACTATCC 

24 0551 GTTTTGTACT ATCGATCCTA ATGTGGGTAT TGTTCCTGTT ATCGATGAAA 

240601 GACTGGAAGC CTTAGCTAAA ATTAGCAATA GTCAGAAGAT CATCTATGCG 

24 0651 GATATGAAAT TTGTAGATAT TGCAGGTTTA GTTAAGGGAG CTTCCGATGG 

24 0701 CGCGGGTCTG GGAAATCGGT TTCTCTCTCA TATTCGAGAA ACTCATGCTA 

2407 51 TTGCTCATGT AGTGCGTTGT TTTGATGATC CAGACGTTAC ACACGTTTCA 

240801 GGAAAAGTCA ACCCTGTTGA GGATATTGAA GTTATCAACT TAGAGCTCAT 

240851 TTTTTCTGAC TTCTCCTCAG CAAAAAATAT CCATAGCAAA TTAGAAAAGC 

240901 TAGCCAAAGG AAAGCGTGAA GTAGGAGCTC TCTTGCCTCT ATTTGATACA 

240951 ATTATTGCTC ACTTAGAAAA GGGGCTGCCG CTACGTACTT TAGAATTAAC 

241001 TCCAGAACAA ATTGTGGCAT TAAAGCCCTA TGCGTTTTTG ACCATGAAGC 

241051 CTATGTTTTA TATAGCTAAT GTTGACGAGA GTTCTCTACC AGATATGGAT 

241101 AATGATTATG TTGCCGCTGT TCGGGAAGTT GCTGCAAAAG AAAATTCTAA 

241151 AGTGGTTCCT ATCTGTGTTC GTATAGAAGA AGAAATCGTT TCCTTACCTA 

241201 TTGAAGAGCG CTTAGAATTT CTTATGAGCT TAGGTCTTGA AAAATCAGGA 

241251 CTTCATAGAT TAGTGCGTGC TGCGTATGAC ACTTTAGGAC TGATTTCTTA 

241301 TTTTACTACA GGTCCTCAAG AATCTCGTGC ATGGACAGTG GTTCGAGGGT 

241351 CTTCTGCTTG 'GGAAGCTGCT GGAGAAATCC ATACGGATAT TCAAAAGGGC 

241401 TTTATTCGTG CTGAAGTGAT TACTTTTGAA GATATGATAG AGTGTCAAGG 

2414 51 TCGTGCAGCT GCTCGAGAAT TAGGGAAATT ACATATAGAA GGACGTGATT 

241501 ATATCGTCCA GGACGGTGAT ACTATGCTGT TCCTTCATAA TTAAAGGAAC 

241551 CCTTTGCAAA CCAATCTTGA GCATCCAAAA TGTCTTTTTC AATTGCTCGT 
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241601 ATTAGAGTTT CTTTTGATTG AAACTTTTTT TCTTCTCTAA GAAATTTTCT 

241651 CGGGATAATG CTCACTTCTT TGCCGTATAG ATTTTCCGCA AAGGAAAAGA 

241701 TATGCGCCTC TGCATATAAA GACTCTCTTC CAAAAGTAGG GGCAGTTCCT 

241751 AAATTCATAA CACCCTGACA GGTAGTGCTA TCATAACGTA TTTCACAAGC 

241801 ATAAACTCCT AGGGGAATTA AACTTTCTTC TCTAGGAAGA TTTATAGTGG 

241851 CGAATCCTAG AGAACCTCCT ATTCCGGAGC CCTCGGTTAT TTTTCCAGAA 

241901 ATGGCATAGG GATGACCCAA AAAACGATGA GCACATTCAA GATTCCCTGC 

241951 GGACAGAAAC TGGCGGATTG CTTTGCTGGA GACAACTATG TTATCCATAC 

242001 GGTAAGGAGG AATCTTGATG ACCTCTATAC CTAACGGCTT GCCTATAGTA 

242051 TCGAGAGCCT CGGTATTGCT TTGCTGTTCT TTCCCTATGC AAGAATCATA 

242101 ACCTAAGATG AGGCGTTTGC ATTTCAAGTT ACGATGTAAC AAAGTAAGAA 

242151 ATTCTTCTGC CGATTGATTC GCAAAATTTA AATCAAAAGT AAGGACACCT 

242201 AACCAGTCTA TGGGAAACGT TTGCAATAAT TGGAGGCGCT CTTCTTTTGT 

242251 ATTGATGAGT TTCGTGTGAT TTAAAGAAAG TACCGTTTGA GGATGAGAAT 

2423 01 CAAAGGTAAT AACTCCACTG GATCCAGAAT AGGAAGTAAG AATAGATAAA 

242351 AGATTGCTAT GCCCTAGATG ACATCCGTCG AAAAAACCTA CAGTTACAGA 

242401 ATCTACAGAA AACGAAGACG TTAAACTATA GGCTATTTCC ATGGGCATCT 

242451 CGTAGGTAGG GAGAAATATC GAAATCGGGG TGGTCTAATA GATTCCCATC 

242501 AATACATTCA TCTATAGAAA AACGGCCACT GCGTAAACGG CGTAGCTGCT 

242 551 CAAGATAAGC TCCACAGCCT AACATCGTGC CAAGCTCATG AGCAATGCTG 

242601 CGAATATAAG TTCCTTTGCT ACAAGAGACT ACAAAATGCA ATAAAGGGTA 

242651 CTCATATTTC GTAATCTGCA AGTGAACTTG AACTGTAGAA TGGTGACGTT 

242701 CTATAGATAA ACCTTTTCTA GCATATTCAT ACAGCTTTTT CCCTTGGACT 

242751 TTTTTAGCGG AAAACATGGG AGGCAGTTGC TGGATCTCTC CTTGGAAATA 

242801 CTCGGCAGCT GATAATACTT CTTCGAGACT AGGAATCTTC TTAGATCTTC 

242851 CTACAACTTT GCCGTCGCAA TCATAAGAAT CGGTAGTTGT CCCTAAATGG 

242901 GCAATTGCTT CGTATTCCTT GTCTTCAAAA AGTAAAATAT CAGAAAGTCT 
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242951 AGTAAATTTA CGGCCAATCA ACATGACCAT AACGCCAGTA GCGAAGGGAT 

243001 CTAAAGTTCC TGCATGACCA ATCTTTTTAA CGCCTATTAA CTTGGTTAGA 

243051 GCGCGGATAA GGCTAAACGA AGTTCTCCCT TGAGGCTTGT CTACAAGAAG 

243101 AATGCCCTCT TTTAATTCTA CTGCAAGATC CATAGTCATG TCTTTAATAG 

243151 TATTCAAGTT CCCAAAAAAT ATAGTTTATT AACTCTTTTC TTTCTCTTGA 

24 3201 ATCTGCCAAA GCAGGTTTTC TATATAATCT TGAGGTGAGA AAATATCATC 

243251 GAGATAAAAA TGAAGTTCTG GGAAATATTT AAGGACGACA TTTTTCGAAG 

243301 CTCTATGAGC GATAAAACCA GCAGAGACTT TTAAAGCTTC TAAAGCCTCT 

243351 TCCTTGGTAT TCTCATGAGG CATTACAGAT ACATAAACAC GTGCAGAGTG 

2434 01 CAAATCCTTA GATAGAGAAA CACGAGTTAC CGTGATCCAA AGATTAGAAA 

2434 51 TCTTGGGATG CTTAACATCT TTTAAAATTA CCTTTGCAAT GGCTTCTTGT 

243501 AATAAAGCAT TTACCCGTTT AATACGTCTA TTTTCTGTCA TACAGTACTT 

243551 CAAGTTATAG TTTTTGTGGA TGATAGATAA CTTCATAACA TTGTAGGACA 

243601 TCACCTATTT GAGCTTGCTG GTATCCTTCT AACAAAATTC CACACTCTAA 

24 3651 ACCTTTGCGA ACTTCTTTGA CATCTTCTTT AACACGTTTT AATGAAGATA 

243701 ACGTACCTTT CCAAAGGATC TCTTTATTAC GTAATACTCG GACTTTATGA 

243751 TTGCGAGTCA TAATTCCTTC AGTAACTATG CAACCGTAAA TAGATCCTAC 

24 3801 TTGTGAAGAC CTAAAGATTT CTTTAATCTC AGCAGAACCT TCATCTTTTT 

24 3851 CTTCAGCAAT AGGATCTAAT AGAGAAGTCA TAATTTCTTT AATTGCATCA 

243901 ATAGCATGAT AGATGACGGT AAATAGTTCA ACTCGGACTC CTAAGCTCTT 

243951 AATTAAAGGT TCCGCATGAC TTTCTATTCC TGTATGGAAA CCGATGAGAA 

244 001 CTGCTTTAGA GGCGGCAGCT AAACGAATGT CTGATTCTGA AATTTCTCCT 

244051 ACACTGTTTG TTAAAATTTC AACATCTACT TTTTCTGATT TAATCTTAGA 

244101 TATTGAACTG ACCAAAGCTT CTATGGAACC TTGAACATCA GCTTTAATCA 

244151 TAAGCTTAAG AGTCTTTTTA TTCTGTAACA TAGAATCAAA GTTAGGCCGC 

244201 TTCTTTTGCT GTAAAGCAAA ACGCTGTTGT CCTGCGGATC TAGCTTCAAT 

244251 AATGTCTCTA GCCGTTTTCT CGTTTTTCAC GACGAAGAAA GGATCGCCAG 
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244301 CTTTAGGAAT GTCCGATAGA CCTGTGATCA ACACAGGAAT AGATGGCCCA 

244 351 GCTTCTTTCA TCAATTCATT ATGTTCGTTA TGCATAGTTT TCACTTTGCC 

244401 ATAACAATCA TTGAAGACGA GAGCTTCGCC CAGTTTTAAG CTTCCATTTT 

2444 51 GAATCAAAAC AGTCGCAACA GGTCCGAGAC CCTTGTGCAG TTCTGATTCA 

244 501 ATAACAAGTC CTCGAGCACG TGCTGAAGGA TCGGCTTTTA GCTCCAAGAC 

244551 TTCAGCTTGT AAAGCTAACA TCTCTAAAAG TTCTGAAAGA CCTTCTCCTG 

244601 TTTTTGCGGA GGTATTTACT GTAACAGTCG AGCCTCCCCA AGCTTCTGGC 

244 651 AATAGATTGA TTTCAGAAAG TTGTCTATAG ATGGTTTCGG AATTAAAATT 

244701 AGGCTTATCA CACTTGTTGA TAGCTACAAC AATAGCGATA TCAGCAGCTT 

244751 TTGCATGTTC AATAGCCTCT AAAGTTTGTT CTTTAATTCC TTCGTCTCCA 

24 4801 GCGACTACAA GCACAACAAT ATCACAAACT TCAGCTCCAC GGGCTCGCAT 

244851 TGCAGAGAAA GCTTCGTGAC CAGGAGTATC TAAAATTGTT ATGTCTCCCA 

244901 CTGGGGTGGA GCAGCAGAAG GCTCCCATGT GTTGGGTAAT CGCTCCAGCT 

244951 TCTGTTGCAG CGACATTACT TTTCCTTAAG GAGTCAATGA GTGTTGTTTT 

245001 TCCGTGGTCG ACGTGACCCA TAAACGCAAC AATAGGGGAG CGAATCACAA 

245051 GCTTGCTGGG ATCTGTAGAT TGAATTTCGT CTCTTACAGT GTCATTGCTT 

24 5101 AGGCACAACT TATCTTGCTC AGAATAGTCG ATGTCAATTG TACATCCAAA 

24 5151 CTCTAAGCCA ATAAATTGTA CTGCAGTTTC GCTGTCTAGA ATATCATTGA 

24 5201 CTACATAGGT CATTCCATGA ATGAATAACT TTTGAATGAC TTCTGAAGCC 

24 5251 TTGAGCTTCA TTTCTGCTGC CAGATCTTTG ACGGTAATTG GCAAGGAAAT 

.245301 TTTGATATGC GTAGGTCGCT GGATAGAGGC TTCGTCATAG TGTTTTTTAG 

245351 GCTTATAAAC ACGTTTTTTT CGCCATCTGT CTTCTTCTCC GCCTTCATTT 

24 5401 AATCCGTAAC GATCTCTTCC TGTAAAAGCC TTTAGGCTTT CATCAGATTT 

245451 CTTAGAACGA TCACGAAAGT CGGTAAGATT TTTCTTCCCA GCATCTCTTT 
24 5501 TTGGACCAGA AGCAGGACTA CGGTTAGCAG GATTGAATTG TTTTTCTCGA 
245551 TTATTCTGTT CACCACCTTC AGATGTTCCA GGTTTCCCTG TTTTATCTGA 
24 5601 AGCAACGGGC TTTGTGCTTT TCGAGCCAGC TACGACTTTC TCTTCCTTGG 
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24 5651 CAGGAGCCTT GAATGTTTTT GCTAGGAGAT GATTGATATG CTTTCCTGTA 

24 5701 GGGCCGAACT TAGATTTAAT CATTACGACG CTTTTAGGTT CAGCAGGCTT 

24 5751 CACAGGCTTA GGCTCTAATT CTTTTTCTTG AGGTGGGGTT TCGGGCAATA 

24 5801 CGGGTTGCTC AGGAAGAACT TCAGCAACTG GATGAACCTC AGGACTTTCG 

24 5851 TCACAAACCT CATCGACTAC TTCTAACTCA GGCTCAGGAT CTGCTATGGA 

24 5901 GACTGGAGCA GGTTCAGATG TATCCACTGG AATATGAGCA GAAGACTCTT 

245951 CTTCGGATGA TGAGAACGAC GAACGATTTT TAGCACGAAT GCGACGTGAA 

246001 GTAGACTCTG GTGAAGCTTG TTCCGCACTT GCCGTAGGGG TAGAAGTTGC 

24 6051 GGCAAGAGCT ACTTTTACAG ACTTTTCTTT CGCAGAAGGT TTTTCTGAAG 

24 6101 AAGATTTAGC TTCAGAAGAT CCTGCTTGGG CAAGTTTTTG TTTTAGCTTA 

246151 TCCAGCCCCG CGGCTTTCGT TAATTGAGCA TTCTTAATCT TCAATTTCAG 

246201 GTTTTTCGTC AACTTTACTT TCTCCATATT TGCTGACTTG CTCAAGGATC 

24 6251 TTATAAGCAA GCTCTAAACT GATCCCAGGA ACAGATGCCA GATCATTAGC 

24 6301 ACTCGCTAAT AATACTCTTC TAATTGTGTC ATATCCAGCA TGTTCTAAAT 

246351 TTTGGATGAC TAGCTTACTA ATCCCTTCCA TTTCTAAGGG TTGATCTAAG 

24 6401 TGCGGACTAT CGAATTCTGC TAATTGAAGG CGTTGAATTT CTAGCAACTT 

246451 ATTGTACTCA CTCATACGTT GTACTTCGAG CTCGTAGTCT AGAATGTGGC 

246501 TAATTAAACG AGCGTTAATT CCTCGTTTAC CAATAACAGT AGCGTAGTCT 

246551 G CATC ATT AA CGACAATTGC AATCACTTTG TCGTCTTCTA AAATAGCAAT 

246601 CTTTTGGATT TCTATTGGAT AAAGAAGATT CTGTAATAAC TCTGTAGAGA 

24 6651 CGGGGGAGTA ATTGACAATG TCAATTTTCT CATCGTTCAA TTCTCGAATG 

246701 ATATTTTTTA CTCGAGAACC TCGCATACCT ACAAAAGCTC CAACAGGATC 

246751 AGTTTTAGGG TCTGACGATC TTACAGCTAG TTTCGTGCGG TACCCAGCTT 

246801 CACGAGCTAT CTTAACAATC TCCACAGAAC CTTCTTCTAG TTCTGGGACT 

24 6851 TCTTGAATAA ATAATTGTTT AACAAATTCT GCGTGACTAC GACTGAGGAT 

246901 AACTTCCGCT CCACCATTTT CAGACTCTTG AACTTCATAG AGTAGGGCGT 

24 6951 AAATTTTATC ACCGATCTTA TGTTTTTCTG TTTTAGGATA AAACCGGGTA 
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247001 GGAAGAATTG CTTCAACTTT TCCTAAGTCA ATAATTAAAT TAGAACCTTT 

247 051 AGCAAAACGT TTGACAACAC CAGATAAAGT TTCATTTACG CGATGGCGAT 

247101 ATTCTTCATA AATAACGTCT CTCTCAGCAT GTCTTAGCTT TTGACCGATA 

247151 ATTTGTCGTG CTGCGTGAGC AGCTATCTTC CAAAATTATC AGAAACAAAA 

247201 GGGACATCCA TGTACTGACC AATCTGACAG TCCGGATCGT ATTCTCTGGC 

247251 TTTATCTAAA GGAATTTCTT TGCTAGGATT CTGACAAATT TCTACTATTT 

247301 CCTTTTCACA AAAGACTTCG ATGTCACCAG TACGAGAATT AATGTTTACA 

247351 GATATGTTCG CGTCATCTCT TAAGGTTTTT TTAGCAGCAA TTTTTAAAGC 

247401 AGATTCGATA GCTCCTATAA TAGTAGAGCG CTGAATCCCT TTTTCTTTCT 

247451 CCATGTAGTC AAAAATAGCT ACAAGATTTT TATTCATTAT ACTCCTCTTG 

247501 TAAATAAGGA AATCAGTAAA TAGCTAAATT AAGGACGTAT TATTTAGAAC 

247 551 TAAATAATAC GATCCTCTGC ATTACCAGCA TTAGATGCTA TTTTCCTTTT 

247601 TTCTTTCTCT CTTTAGGGCC TTGAGAATCC TTGAAATCTA ATTCAGTCCT 

247651 AGAGTCTTGA TCATAAGCAT TGTCAGCTAA GTATTCTTTT ACAGAAAGAG 

247701 AAACTTTTTT ATGATCTGGA TCTAGCTTAA TTACTTTTGC AGAAACATTT 

2477 51 TCTCCAATGG AGATAATATC TTCAATTTTT GCAAAGGGCT TGTCAGAAAG 

247801 TTCTGAAACG TGAATCAATC CTTCAATCCC GTTTTGTAGC TCAACAAAGG 

247851 CTCCAAATGC AGTGATTTTA GTCACAACTC CTGAAATTAC TGTGCCAGCA 

247901 GGGAACATAG CTTCAATTTC ATTCCAAGGA TTAGAACTTA ATTGCTTAAC 

247951 TCCTAAAGTA ATTTTTTTAC TTTCTTTGTC TACTGATAAA ATAACAGCCT 

248001 CTACAGAATT TCCTTTTTTG AATAGTTCTG AAGGGTGAGA GACTTTTTTA 

248051 ATCCAACTCA TGTCAGAAAT ATGAATCAGA CCCTCAATTC CTGGTTCTAA 

24 8101 TTCAACGAAA GCACCGTAAT TGGTTAAGTT CTTGATTTCA GCATTGACAT 

248151 GGAGACCTAT AGGATATTTT TCTTCGATAT TGTCCCAAGG ATTACGTTCT 

248201 GTTTGCTTTA ATCCTAGAGA AATTTTTCCT TCGTCCTTCT GAATAGATAG 

24 8251 AACAATGGCT TCAACTTCAT CGCCTTTATT TACGACTTCA CTAGGATCTA 

248301 CAATATTTTT CACCCAAGAC ATTTCAGAAA TGTGAATTAG ACCTTCAATG 
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248351 CCCTCTTCAA TTTCAATGAA AGCTCCGTAG GGGAGAAGCT TCACAATTTT 

248401 ACCAAGAACT CGTTTTCCAG GAGGGTATTT CTTCTCAATA TCTTCCCAAG 

24 8451 GATTATGCTC TTTTTGTTTG AGACCTAGAG CAACTCGTCC TTTTTCTTTA 

248501 TCTACGCTTA AAATAATTAC TTCCAACTCT TGATTCAATT CGACCATTTC 

248551 GGAAGGATGT CGTATGCGCT TCCAGGTCAT ATCGGTAATG TGGAGAAGAC 

248601 CGTCAATACC ATCGAGATCT AAGAATACAC CAAAGTCAGT AATGTTTTTA 

248651 ACAACTCCTT TGCGGTATTC TCCGATAGAA ATTTGTTCAA TAAGTTCGGC 

248701 TTTCTTAGAG ATTCTCTCAG CTTCTAAGAG TTCTCTTCTT GAGACAACAA 

248751 TATTGCGACG TTCAACGTTA ATTTTTAAAA TTTTGAATTC ACAAACTTTT 

248801 CCGACATAAT CATCTAAATT TTTGATTTTC TTGTTGTCAA TTTGTGATCC 

248851 AGGTAGGAAG GCTTCCATTC CAATATCTAC AATAAGGCCG CCTTTGACTT 

248901 TACGTGTAAT TTGACCTTTA ACAATAGAAC CTTCTTCACA ATGAGCTAAG 

248951 ATGTATTCCC ATTGACGTTG TCGTGTGGCT TTTTCTCTAG AAAGGACAAC 

249001 TTTGCCCTCT TCGTCTTCGG CTTGGTCGAG ATAGACTTCT ACTTCAGCTC 

249051 CAAGCACTAA ACCTTCTGAA GAGTCTATGA ACTCTGACAT AGGGATCACT 

249101 CCCTCAGACT TCAGACCAAC ATCAACTACG ACAAAGTCTT TATTAATATC 

249151 AACTACGGTA CCTTTTAGGA TGGCGCCAGG CTGTATTTGG TTATCAGATT 

249201 CTTCTTCGCT CGAAGTAATT CTGTGTGCCG TATAAAGCAA ATCTTTAAAT 

249251 TCGGCAACGT CTTCTGTGAG GCATTCTATA TTGTCCAGAA TTTTTTTAGA 

249301 TCCCCAAGTA TATTCAGCTT GTTTTGGCAT TTATTGTTAT TCTCCTAAAA 

249351 ACTACAAAGT CAAAAAAGAC AGTGTAAATA TAAACCTTAA AAAAGGCAAG 

249401 ACCTCCCTTG CAGGAGACGA TTAGTTCTAT ATAATCGAAT TAATCGTAAA 

249451 ATCCTATACT 'TCCGTTCTAG AGATGTATAA GGCCTCTTTT TTAACTTTTC 

249501 TATATTACCA GATTTTATAA AATACGTGTG GTAATTATCA TAAATGTGCC 

249551 TTGTCATATG TAGAGAAACA GGATTTGGCG CATAGAAAAC ATATCGTTTT 

24 9601 TTTCATAAGC ACCCCATAAA TTCTCGTCCC AGATGGGCTA CATCAGCTTT 

24 9651 GTTCTTAAGC CTGATAGATC TTTTTGGTTG CTGCTCAAGA ACCAAGTGCA 
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24 9701 TTCAGAAGTA GAACCATACT TATTGAGGGT TAAATAATAG 

2497 51 GGTTTCTTAG CTCTAAGAAA ATTCTAAAGA TTAATCACAG 

249801 GTTATTACTG GAAGATGAAG ATCTTGAGAT TTGAAACTTT 

249851 CCGTATAAAT TAAAACTAGA CAAAAGCTAT TCAAAAGCAA 

249901 TAGAGAAACT GACGCATAGA CGCTAAGTGT TGTTTATTAG 

249951 ATTAAGTGTC TTTTTTCTAG TAGATCAAGA ACAGTGTTCT 

250001 AGTTAGATAA GGAAATTATA AATGATTCAT TCCCGGTTAA 

250051 TTCAGGTCCA TCTGGATATA CAGCGGC AAT TTATGCATCA 

250101 TGCATCCTCT TTTATTTGAG GGGTTTTTCT CTGGGATCTC 

250151 CTTATGACTA CAACAGAAGT TGAGAATTTT CCAGGGTTTC 

250201 TCTTGGGCCA AAACTTATGA ATAATATGAA GGAGCAGGCT 

250251 GGACCAAGAC ACTAGCTCAA GATATTATTT CCGTAGATTT 

250301 CCTTTTATTT TGAAATCAAA AGAAGAAACC TATTCTTGTG 

250351 CATAGCTACA GGAGCTTCTG CTAAACGTTT AGAAATTCCT 

250401 ACGATGAATT TTGGCAAAAA GGAGTGACTG CTTGTGCCGT 

250451 GCTTCTCCTA TTTTTAAAAA TAAAGATCTT TATGTGATTG 

2 50501 TTCTGCTTTA GAAGAAGCTC TTTACCTGAC TCGTTATGGA 

2 50551 ATGTAGTTCA TCGTAGAGAT AAACTGCGGG CTTCTAAAGC 

250601 CGGGCGCAAA ACAATGAAAA AATTACATTT TTATGGAATA 

2 50651 AAAAATTTCT GGAGATAGCA TTGTCCGTTC CGTAGATATT 

2 50701 AGACTCAAGA AATTACAACT AGAGAAGCTG CGGGGGTGTT 

250751 GGCCATAAGC CAAATACGGA TTTTCTCGGA GGACAGCTGA 

250801 GTCGGGCTAT ATTGTGACTG AGAAAGGAAC GTCCAAGACT 

250851 GAGTATTTGC TGCTGGAGAT GTTCAGGATA AGTACTATCG 

250901 ACTTCTGCAG GAAGTGGTTG TATAGCAGCA CTAGATGCTG 

250951 AGGCTAATGC GATTGCAGTT GCTGTGGCAT ACTCTTTGCA 

2 51001 GAGAGAATGA CTTTAGAAAT TCCAATTTTT GCATAGACAT 
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251051 GAGAACTTCG GGTCCGTGAG ATACTTTAAA GACTTCGATG TCTTTCCAGG 

251101 CAACAACGCT CCCTATGCCA GTTCCTAAAG CTTTTGCTAC AGCTTCTTTT 

251151 CCAGCAAAGC GACCTGCAAA TGAAGGGATG GGATCGGTCT TTTCTAAGCA 

251201 ATATTTCTGT TCTGCTTCTG TAAAGATTCT ATTGAGTAGT CGATTGCCGT 

251251 GAGTTGCAAT TGCCTCGCGA ATGCGGCTAA TTTCAATAAT ATCGGTTCCT 

251301 ATATGAATGA TTTCCATAGA GTTCGCACTA ATTTCCTTCA GGATTTTCCA 

251351 TTAGGGCATT CTGTAATTTT GAGAAGTAAG CGATGGCAGT TTTAATGAGA 

251401 GCATAGCCCA CTGTAGCTCG AATCACGAAT AAGGCTTGAG GATTGCTGAT 

2514 51 ATAACGACGA AGGACTAAAG AGAAGGCAAT CATAATGACC ATGATGAGAA 

251501 ACCTTCTAGA GATAAAGGTC TGCTTTATAT AGGCAATCCA TGTAATTTTT 

251551 TGTGAAAAGA ATTCAGAAGA TAGACTCAGC TGTTTCCTTA TAGTTTTGCT 

2 51601 TAGCAAGTAG CGATGTTTTA GGGATGCTAA TCCCCAAGAA ACCATAGATA 

251651 GAAGACAGTA GGTAAAAAAA GAAGAAAGGG GGTCTAGGGC TATTGCGGTA 

251701 GCTTTTAGCA AGAGTTTCAT TCCTGCTGAT ATCCACAAAA TACCAGGAAA 

251751 TAGTATCAGG AAATATTTGA TGTTTCTTGC CATAGTGCAT TACACTTAAT 

251801 TTCTAAATAC TCTATTTTAC TTGGAGTATG AATTTTTTGC TAACTGGTAT 

251851 TAAGATATGC GAGCTGAGAT GGCTGTGATC TATTGGGATC GCTCAAAAAT 

251901 TGTCTGGTCT TTCGAGCCAT GGTCTCTAAG ACTTACTTGG TATGGCGTCT 

251951 TTTTTACTGT AGGGATTTTT CTAGCATGTC TCTCAGCAAG GTATTTGGCT 

2 52001 CTTTCCTATT ATGGTTTGAA AGATCATTTA AGTTTTTCCA AAAGCCAGCT 

252051 ACGCGTGGCT TTAGAAAACT TTTTTATATA CTCTATTTTA TTTATTGTCC 

252101 CTGGAGCTAG ACTTGCCTAT GTGATTTTTT ATGGATGGAG TTTTTACTTA 

252151 CAACATCCTG AAGAGATCAT TCAAATATGG CACGGAGGCT TGTCGAGTCA 

252201 TGGAGGCGTT CTTGGCTTTC TTTTGTGGGC GGCCATTTTT TCTTGGATAT 

252251 ATAAAAAAAA GATTTCAAAA TTGACTTTTC TCTTCCTTAC AGACTTGTGT 

252301 GGATCAGTTT TCGGAATTGC AGCGTTTTTT ATTCGTTTGG GTAATTTTTG 

252351 GAATCAAGAA ATTGTAGGAA CACCGACTTC TTTGCCTTGG GGGGTGGTTT 
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252401 TTTCTGATCC TATGCAAGGT GTCCAGGGAG TTCCTGTGCA TCCTGTGCAG 

252451 CTTTATGAGG GAATCAGTTA CTTGGTCGTC TCTGGAATTT TATATTTTCT 

252501 TTCCTATAAG CGTTATTTGC ATTTAGGTAA GGG AT ATGTG ACTTCTATAG 

252551 CCTGTATTTC TGTCGCTTTC ATTCGTTTTT TTGCGGAGTA TGTAAAAAGC 

252601 CATCAAGGGA AAGTTTTAGC AGAGGATTGT CTACTTACAA TTGGTCAAAT 

252651 TTTATCTATA CCTTTATTTC TATTTGGTGT GGCCTTACTT ATCATTTGCT 

252701 CATTGAAAGC TCGAAGGCAC CGTTCACACA TATAGCAGCA TCTTTTTGGC 

252751 ATGAGACTTT AGAATTTTAC AGTCTATCAC AATAAGAGAC CTGTATTGAG 

252801 AGATTCTCCT TTGGGAGATA ATGTGGGTTT ATGATATTTT GACCTCTTAA 

252851 GTTTAATTTT TAGAGTTTAT CAAATGAATA AACGCACTTT GCTTTTTGTT 

252901 TCTTTAATTG GGATTGCTTT TGTAGGATGT CAAATATTTT TTGGTTATAA 

252951 TGAATTTCGT TCCTGCAAAA ATCTAGCAGA GAAACAAAGA AAGATTTCAG 

253001 AACAGACGCT AGCTGCAGTA GAATCTGTAG GGTTAAGTGT AGCTTCATGG 

2 53051 GACACCGATG TAAACGGAGA AGAACATAAG AATAACTACG CAGTTCGTGT 

253101 TGGAGACAAG TTATTTTTAT TACATAATGG AGAAGCTGCT CAGTCTGTTT 

2 53151 ATTCTTCTGG GGAATCTTGG AGCTTTGTAG ATCACAAGTG TGGTTTCGAT 

253201 AATATTCACT TGGCTTTATA TCGTCAGCAG GGTTCTTCTT TCAATCCTAC 

253251 GAATACGGGA AAAGTTTTTC TTCCTACGAA TCATGAAGGT TTACCTGTAC 

253301 TAGTTGTTGA GTTTCGTAAC AATAAAGAGC CTTTAGTATT TCTAGGTGAG 

253351 TACGCACAAG GAAGAATTTC CAATAAAGAT AGCACGATCT TTGGTACAGC 

2534 01 GCTTGTCTTT TGGAGATCAG GAAGCGACTA TATTCCTTTA GGTCTCTATG 

253451 ATTCTCGAGA AGAAAAGTTA GTTTCTTTGG ATCTTCCTAT TACACGAGCT 

253501 GTAATTTTTG GTAATGACCA AGATTCGGCA AAGTCGTCAG ATACTGCGAA 

253551 CCACTATGTT TTATTTAATG ATTACATGCA GATTATTGTT TCTGAAGAGA 

253601 GTGGTTCTAT AGAAGGTATC AATTTACCTT TTGCTTCAAC AAATAATAAA 

253651 AGCATTGTGA ATGAAATTGG TTTTGATAGG GATTTAGCTT CAGAGAAATC 

253701 TCCTGAAGCT CTTTTCCCTG GGCTGTCTTC AAAACTTCCT GATGGCCAAC 
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253751 AAGCCAAAAA CTCGATTGGA GGTTACTACC CTTTATTGCG CAGGGGATTA 

253801 TTAAGTGATT CTAAGAAATT ACTTCCTCTA GAGTATCACG CATTAAATGT 

253851 GGTTTCAGGA AGAGAGCTAG CGACTCCTGT GGCTTTAAGA TACCGAGTTC 

253901 TTTCCTATAC CCCCCATTCC ATTCAATTGG AAAGCTTAGA TAGATCGGTT 

2 53951 CAGAAGGTAT ACAAACTTCC AGAGAATCCG GAAGAAAAGC CCTATGTTTT 

254001 TGAAACTGCA ATTACTTTAA CGAAAGAAAC CGAAGATGTA TGGGTAACTT 

254051 CAGGAGTTCC TGAAGTGGAG ATCATGTCAA ATGCTTCAGC CCCAACCATT 

2 54101 AAATACAGGG TTATCAAAAA AAATAAGGGG TCTTTAGATA AAGTTAAGCT 

254151 TCCAAAAGTA AAAGAGCCTT TAGCTATACG TCGTGGTGTT TATCCTCAAT 

254201 GGATTTTAAA TTCGAATGGA TATTTCGGTA TTATTTTAAC TCCGTTGTCT 

254251 GAAATTGCTT CTGGCTATGG ATCTCTCTAC ATTTCGGGTT CTACGGCTCC 

254 301 GACAAGATTG TCTGCTATTT CTCCTAAAAA TCAACTGTAT CCAGTATCAA 

254351 AATATCCTGG ATATGAGACC TTGCTTCCTT TGCCAAAAGA TGCAGGGACA 

254401 CATCGATTTT TAGTGTATGC AGGTCCCTTG GCAGAGCCTA CACTTAAAGT 

2544 51 ATTAGATAAG ACAATTACTC AGGAGAAGGG AGAAAATCCT GAGTATCTTG 

254501 ATAGCATTTC TTTCCGTGGT GTTTTTGCAT TTATTACAGC TCCTTTTGCA 

2 54 551 GCACTCCTAT TTATTATTAT GAAGTTCTTC AAATTGGTTA CGGGTTCTTG 

254601 GGG AATTTCC ATTATTTTAC TTACTGTATT TTTGAAATTG CTTCTCTATC 

254651 CTTTAAATGC ATGGTCCATA CGATCTATGA GGCGTATGCA GATTTTATCT 

254701 CCTTATATTC AGCAAATTCA GCAAAAGTAT AAGAACGAAC CTAAGCGTGC 

254751 TCAGATGGAA ATCATGGGCT TGTATAAGAC AAACAAAGTG AATCCTATCA 

254801 CGGGTTGTTT ACCTTTATTG ATACAGCTTC CTTTCCTAAT TGCGATGTTT 

254851 GATTTATTAA AGTCATCATT CTTATTACGA GGAGCCTCGT TTATTCCTGG 

254901 GTGGATTGAT AACTTAACAG CTCCTGATGT GTTGTTTTCT TGGCAGACAT 

254951 CGATATGGTT TATTGGAAAT GAGTTCCACT TACTTCCTAT TCTATTAGGT 

255001 ATAGTGATGT TCTTACAACA GAAGGTCACG AGTTTGCATA AGAAAGGACC 

255051 TGTTACGGAT CAGCAGAAAC AGCAACAAGT TATGGGGAAC ATGATGGCGA 
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255101 TTTTATTTAC CGCTATGTTC TATAACTTCC CTTCAGGATT AAACATCTAT 

255151 TGGCTTTCGT CTATGATTTT AGGAGTCGTC CAGCAGTGGA TCACTAATAA 

255201 GATATTAGAT AGCAAACATC TTAAAAATGA AGTGGTTTTA AATAATAAAA 

255251 AACATCGATA ATACTGAAAA AAAAGTCGAA TTGTCTTAAT ACAGGCTTAT 

255301 TGAAACAAGC GATTTATATG AGCTCATAGT TTATGCGAGC ATGGGAAGAA 

255351 TTTCTTTTGC TACAAGAGAA AGAAATTGGC ACAAATACTG TAGACAAGTG 

255401 GTTGCGATCT TTAAAGGTCT TATGTTTTGA TGCTTGTAAT TTGTATCTTG 

255451 AAGCTCAAGA TTCTTTTCAA ATTACTTGGT TTGAGGAGCA TATAAGACAT 

255501 AAGGTTAAAT CTGGTCTTGT AAATAATAAC AATAAGCCCA TTCGTGTTCA 

2 55551 CGTTACTTCG GTAGATAAAG CAGCTCCTTT TTATAAGGAG AAGCAGATGC 

255601 AGCAAGAGAA GACAGCATAC TTTACCATGC ATTATGGAAG TGTGAATCCT 

255651 GAGATGACCT TCTCTAATTT TTTAGTTACC CCTGAAAATG ATCTTCCTTT 

255701 TCGTGTTTTA CAGGAATTTA CTAAGAGTCC TGATGAAAAC GGAGGAGTTA 

255751 CTTTTAATCC AATTTATCTG TTTGGACCTG AGGGATCTGG AAAAACTCAC 

255801 TTAATGCAGT CAGCTATCAG TGTTCTTCGT GAATCTGGAG GTAAGATTCT 

255851 CTATGTTTCT TCGGATTTGT TTACAGAGCA CTTAGTCTCT GCTATCCGTT 

2 55901 CAGGAGAAAT GCAAAAATTC CGTTCTTTTT ACCGCAATAT TGATGCTCTA 

255951 TTCATTGAGG ATATCGAGGT TTTTTCAGGA AAGTCGGCAA CTCAAGAAGA 

256001 GTTCTTCCAT ACGTTTAATT CTCTTCATTC TGAAGGGAAG TTGATTGTAG 

256051 TGTCTTCATC CTATGCGCCT GTGGATCTCG TTGCTGTTGA AGATAGATTG 

256101 ATCAGCAGGT TTGAATGGGG AGTTGCAATT CCGATACATC CTTTGGTTCA 

256151 GGAAGGATTG CGCAGTTTCT TAATGAGACA GGTAGAGCGC TTATCTATTC 

256201 GCATTCAAGA AACGGCCTTA GATTTTTTAA TTTATGCGCT ATCTTCCAAC 

2 56251 GTAAAGACCT TACTGCATGC ACTGAATCTT TTAGCAAAGA GGGTAATGTA 

256301 TAAAAAACTC TCTCACCAAT TACTATATGA AGATGATGTG AAAACTCTTT 

256351 TAAAAGATGT TTTAGAAGCA GCAGGAAGCG TTCGTTTAAC TCCTTTAAAG 

2 56401 ATTATTCGTA ATGTTGCTCA ATATTATGGG GTCTCTCAGG AGAGTATTTT 
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256451 AGGACGTTCT CAGTCCCGAG AATATGTATT GCCACGTCAG GTAGCCATGT 

256501 ACTTTTGTCG TCAGAAGCTT TCACTATCAT ACGTGAGAAT AGGCGATGTC 

256551 TTTTCAAGAG ATCATTCGAC GGTAATCTCA TCCATACGAT TGATTGAACA 

256601 AAAAATAGAA GAAAATAGCC ATGATATTCA CATGGCTATT CAAGATATTT 

256651 CTAAGAATTT AAATTCCTTG CATAAGAGTT TGGAATTTTT CCCTAGCGAA 

256701 GAGATGATTA TTTAGAGGAG TAACGATCCT CTAATTTTTG CTTCTGGACA 

256751 GTTGCAGCAG CCGGCGCAGA TATCATTTTC ATTGAGTTCA ACTCCGCTGT 

256801 TCTAGCTTGT TTAAATTGTC TAATATGATT GAAAGCAAGG ACAAGAGCAA 

256851 TACCAGCAAG TACTATCTGT ATAGCAAGAA CAATGCCGAT TCCTAAGGAT 

256901 AGGGTGACTC CACTGGCAAT AACCAGTCCT AGGGTGGTTA AGGAAGCAAG 

256951 GAGGAGACCT AGACTAACTA AAATAGGCTT AAAAAATACC CAGGAACAAT 

257001 AACGATGTGT TGCCTTGTGA GATACTGATG GTTGTGGTTG TGTAGTCTGA 

2 57 051 GGTGTTTGTG CTACTGTAGC CATGTTGTTC TCCTGAGTAA AAAATTAAAT 

257101 AATTTTAATC TGTCTATATA ATTGTTGTCA ATAATATTTT TATAGATATT 

257151 ATTTTAATTT TATTAAAATT TATCAACTAA TTTTATTTAT AGTCTTTCTC 

257201 TAGTTCAGAA CTATTAAGCT CTCTTAATAG GCTCTCTTTT ATTTACCAAG 

2 57251 GCGAAGTTCT TTTTTTTTTT TCAAATTTCG CATATGGAGG GTTCTAGATG 

257301 GTATTTGAAA TGGTTGCATT GTGGAAGATT TTTCGAGTTT TGATAAGAAC 

257351 AAAGTCAGTG TTGACTCTAT GAAACGGGCG ATTTTAGATC GTCTGTATTT 

257401 AAGTGTTGTA CAATCACCAG AGTCCGCATC TCCTAGAGAT ATCTTCACAG 

2574 51 CTGTTGCAAA AACTGTTATG GAATGGTTGG CCAAGGGGTG GCTGAAAACT 

2 57 501 CAAAATGGCT ACTATAAAAA TGATGTAAAA AGAGTTTATT ACCTTTCCAT 

257551 GGAATTTCTC TTAGGGAGAA GTCTAAAAAG CAATCTTTTG AATTTAGGAA 

257601 TTCTAGATTT AGTAAGGAAG GCACTAAAAA CTTTAAATTA TGACTTTGAC 

257651 CACCTTGTAG AAATGGAATC CGATGCAGGA TTAGGAAATG GTGGTTTGGG 

257701 GAGACTGGCA GCTTGTTACT TGGATTCTAT GGCTACATTA GCAGTTCCAG 

2 57751 CCTACGGCTA CGGTATACGC TATGATTATG GTATTTTTGA TCAGAGGATC 
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257801 GTCAACGGGT ATCAAGAGGA AGCTCCTGAC GAGTGGCTAC GTTATGGAAA 

257851 TCCTTGGGAA ATCTGTAGGG GAGAGTACCT CTATCCCGTA CGATTTTATG 

257901 GAAGGGTCAT TCATTATACC GATTCTCGAG GGAAACAGGT GGCAGATCTT 

257951 GTCGATACCC AAGAGGTATT GGCGATGGCT TATGATATTC CGATTCCTGG 

258001 GTACGGTAAT GATACTGTAA ATTCTCTAAG GCTATGGCAA GCACAATCTC 

258051 CGCGAGGCTT TGAATTCAGC TATTTTAACC ACGGGAACTA TATCCAGGCT 

258101 ATAGAAGATA TCGCCTTGAT AGAAAACATC TCTCGCGTCC TCTATCCTAA 

258151 TGATTCTATT ACTGAGGGGC AGGAATTGCG TCTCAAACAA GAGTATTTTT 

258201 TAGTTTCAGC AACCATTCAA GATATTATCC GCAGATATAC AAAGACACAT 

258251 ATTTGTTTGG ATAACCTTGC GGATAAAGTC GTAGTACAAT TAAACGATAC 

258301 CCATCCCGCT TTAGGGATTG CTGAAATGAT GCATATTTTA GTCGATAGGG 

2 58351 AAGAATTACC TTGGGATAAG GCTTGGGAGA TGACTACAGT CATCTTTAAC 

258401 TATACCAATC ATACAATCCT CCCAGAGGCT TTAGAGAGAT GGCCTCTCGA 

258451 TTTATTCTCT AAGTTATTAC CTCGGCATTT AGAGATTATT TATGAAATAA 

258501 ATTCCCGTTG GTTAGAAAAA GTTGGCTCTC GCTATCCTAA AAATGATGAT 

258551 AAGCGCCGGT CTTTATCCAT TGTTGAAGAA GGGTATCAAA AGCGTATCAA 

258601 TATGGCAAAC CTTGCCGTAG TAGGTTCTGC AAAAGTAAAT GGAGTTTCGT 

258651 CATTCCACTC TCAGCTGATT AAAGATACTC TCTTTAAAGA GTTTTATGAG 

2587 01 TTTTTCCCTG AGAAGTTTAT CAATGTGACC AATGGGGTGA CTCCACGACG 

2 587 51 ATGGATTGCT CTCTGTAATC CTCGTTTGAG TAAGCTTCTC AATGAAACTA 

258801 TAGGGGATCG TTATATCATT GATCTTTCTC ATCTTTCATT GATCCGTTCC 

258851 TTTGCCGAAG ATAGTGGTTT CCGAGATCAT TGGAAAGGGG TAAAATTAAA 

258901 AAATAAGCAG GATCTAACAA GTAGAATTTA TAATGAAGTT GGAGAAATAG 

258951 TAGACCCTAA TTCTCTCTTT GACTGTCATA TTAAGCGTAT TCATGAGTAT 

259001 AAACGACAAC TAATGAATAT TTCTTAGAGT CATCTATGTT TATAATGACT 

259051 TGAAAGAAAA CCCTAATCAA GATGTCGTCC CTACAACAGT AATTTTTTCT 

259101 GGTAAGGCGG CTCCTGGCTA TGTCATGGCC AAACTCATTA TCAAGTTAAT 
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259151 CAATAGCGTT GCTGACGTTG TAAATCAAGA TTCTCGAGTT AATGATAAGC 

259201 TTAAGGTTCT TTTTTTACCT AACTATCGAG TTTCTATGGC TGAGCATATC 

259251 ATTCCTGGTA CAGATCTTTC AGAACAGATT TCTACAGCTG GAATGGAGGC 

259301 TTCTGGAACA GGAAATATGA AATTTGCTTT GAATGGAGCT CTGACTATAG 

259351 GAACTATGGA CGGTGCAAAT ATAGAAATGG CAGAGCATAT TGGTAAGGAG 

259401 AATATGTTTA TTTTTGGTCT TTTGGAGGAG CAAATTGTAC AACTGCGGAG 

259451 GGAATACTGT CCTCAGACAA TTTGTGATAA GAATCCTAAG ATCCGTCAGG 

259501 TTTTAGATTT GCTAGAACAG GGATTTTTCA ATAGCAATGA TAAAGATCTG 

259551 TTTAAACCGA TAGTACATCG CCTACTGCAT GAAGGAGATC CCTTTTTTGT 

259601 CTTGGCTGAC TTGGAGTCTT ATATCGCTGC CCATGAAAAT GTGAACAAAC 

259651 TCTTTAAGGA ACCAGATTCA TGGACTAAGA TTTCTATTTA TAATACTGCA 

259701 GGAATGGGCT TTTTCTCTAG TGACAGAGCC ATTCAGGATT ATGCCAGAGA 

259751 TATTTGGCAT GTTCCTACAA AATCTTGCTC TGGAGAAGGA AATTAAGAAA 

259801 TAGACAGGAT GGAATCAGGG ACTCTTTCCA TAGCCAAGGA GCTATAGAAA 

259851 GAGTCCTTTT TGTTCAAAGA TTGCTAGTTT AATAGTAGGA CAGCCGGAGC 

259901 TTCTAAGATC TTTTGTAATC GTTTCATAAA CATCGCAGCA GGATAACCAT 

259951 CAATCACTCT ATGATCTACA GATAGGGTAA GATTGCAGGT AGATCCTATA 

260001 GTAATTTCTC CGTCAAGAAC AAGAGCTTGT TCTGTAACAC TTCCTACGGC 

260051 AAGAATCGCC GCTTGAGGAG GATTGACAAT CGCTGTAAAT TCAGTGATTC 

260101 CTGTCATTCC TAAGTTAGAG ACACAGAAGG ACCCTCCTTT GTATTCAGTG 

260151 TCTTGAAGAG ATTGATTTCT TGCTTTTAAC GCTAAGCTCT TAATTTCTGC 

260201 TGAAATCATG CCGAGATTTT TACGGTCTGC GCAGCGTATA ATTGGCGTAA 

260251 TAATTCCATC TGGAATGGCC ACAGCTATCG AGATATCGAT AGTATCAAAA 

260301 CGGACGATTT TATTATCGAC ACTGTTAAAT CCTGAATTGA TAGAAGGGAA 

260351 CTCTTTGAGC GCCAGAGCAC AGGCACGTAC AATGCAATCG TTAATAGAGA 

260401 GTTTGATTCC CTGAGCTTGA AGTTCTTTGA GCAGATTAAG GAGAGGTGAG 

2604 51 GCGTAGACCT GCTGCCTTAC ATAGAAGTGA GGAATAGAGA TCTTAGCAGC 
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260501 TTGTAGGCGT GCAGCAATCA CTTCCCGAAT CGGAGAGAGA TTCTCCTCAT 

260551 GATAGGAACC TGGAGGCACT TCGGGAGACT CAGGATAGCC AAAACCAGCA 

260601 ATGCTTTTAG GAGGAGCTTT CTCTAAATCT TTTTTTACTA TACGTCCTCC 

260651 AGGACCACTC CCTTGAATTG ATGAGACATC TATGTTTTTC TCTTTTGCTA 

260701 GTTGTCTAGC TAATGGAGAG AGATTATTCG TAGTGCCTAC GTGTTTGAAG 

260751 ACTAAAGGCG AGGAGAGAGG TGGCTCTGGC TTAAAAGTTA CTGCTGTGAA 

2 60801 TGTTGCTGAG GCAGCTTGTG GAGTTGTTGC AGGCGAGACC TCTTCAGAAG 

260851 AACCTTTTGG AGATGCTTCA AGGTTAGAAG GTTCTGTCTT AGGAAGAAGT 

260901 TCTTCTAGAT TAAAGGGCTC GTTGGCTTCT GTAGAGAGTA CCGCAATAGG 

260951 GGTGCCTATA ACGATTTTCT CGCCTTCATG ACGTAAGATT TCACGAATCC 

261001 AGCCATCTTC ATTTGCTGTA TGTTCTAAAA TAGCTTTGTC TGTAGAGATC 

261051 TCTACAATGA CGTCTCCAAA ACTGACCTGA TCATTACTTT TTTTATGCCA 

261101 TTTCACTATA GTGCCCACTT CCATAGTTGG AGAAAGCTTT GGCATTTTCA 

261151 ATAAGGAGAT CACAAACTTA CCTCATGACT TTTTCAATGG TATCTAAGAT 

261201 TCGGTTAACA TTAGGCAAAG TGGCCTGTTC TAAGATTTTA CTATAGGGCA 

261251 TAGGCGTTTC TTTTTGGCAT ACCCTTAAGG GGGGAGCATC AAGAGAATCA 

261301 AAAACATGCT CAGTAATCAG GGCAATAATT TCAGAAGAAA TCCCAGCGAA 

261351 GTAGTGGCCC TCTTCAATTA CAATACAGCG TGAAGTTTTT CGTACCGATG 

261401 ATAAAATTGT TGATATGTCT AAAGGTTTGA TCGTTCTTAG ATCAATAATT 

261451 TCTATAGACA AGCCCCAACG TTTTTTGGCT AGAGAACACG CTTCTTTTGT 

261501 AATGGAAACC ATACGGCTAT AAGTAATAAT TGTAAGGTCA TTTCCTTCTT 

261551 GAACTCTATG TGCTTTCCCA ATAGGAACGA GATATTCTTC GGTGGGGACT 

261601 TCCCCTTTTA AGTTATATTC TAGCTCGTTT TCTAAAAAAA GAACGGGGTT 

261651 ATTATTTCTG ATTGCTGATT TTAATAAGCC TTTAGCGTCG TAAGGGTTCG 

261701 AAGGGGCTAT AATAATAAGA CCTGGAATAT TAGCATACAA CGACTCAACG 

261751 CAATGAGAAT GCTGGCAAGA TACCTGGGCT GCAGCACCAT TAGGGCCACG 

261801 AAAAACTATA GGAACGGAAA ACTTCCCTCC AGTCATAAAA TGCATCTTAG 

194 



WO0027994 f file:/A\dcwas03\firmdata\lp\FoleYPat\PatentDocuments\WOQQ27994.cpc 1 



Page 322 of 330 



WO 00/27994 PCT/US99/26923 

261851 CTGCATGAGA AATGATTTGG TCCAAGGCTA CAAAGGAAAA GTTCCAGCTC 

261901 ATAAATTCTA TAATAGGGCG CAGGCCTGAC AATGCGGCTC CTATTCCAAT 

261951 TCCAGAGAAG GCTGCTTCAC TAATAGGAGC ATCAATGACT CTCTTAGGGC 

2 62001 CCCATTTATC TAATAAGCCT TTGGTGACTT TATAAGCACC ATTGTAGTCA 

262051 CCAACCTCTT CACCAAGAAT ACAGACATTA GGATCGCGAG ACATCTCTTC 

2 62101 GTCAATTGCT TCTCGGAGAG CTTCTCGAAT TTCTAATGTT TTATGTTTAG 

262151 GCATAGACTC CTTCCTCTAA TGTGGTGACG GATGGATCTG ATGAGAGTTT 

262201 TGCGTTAGAG AACGCTTCTA AAACAGCAGT TTTGCATTCT TGGCGTATAT 

262251 TTTGAAATTC CTCTTCAGTC AGAACCTCTA ATCGAATTAG CCAATCTTTA 

262301 GCTAGGACAA TAGGATCTTT TTTAAATAAA CACTGCATTT CTTCTTTCGA 

262351 TCTATATAAA TTAGGATCTG ATATAGAATG CCCTCGAAAT CGGGAGCAGA 

262401 GACACTCAAC TAAAACCGGA GATTCGGTAT CAACCATATA GCGATAAGCC 

2624 51 TCTCTAAATC CTAAAAGAGA GTTAAATAGA TCAAAACCAT TGACTGTGAC 

262501 TGCACGGATA TCGTAGGAAC TTCCTTGAGA CTCTGCTATG GGCTGTTTTG 

262 551 CAACAGCACG ATTTAATGAC GTTCCCATAC TCCAGCCGTT ATTTTCAATA 

262601 ATAAGCATTA GAGGGAGTTG GTGAAGAGAA ACAAAGTTCA GAGTTTCATG 

262651 GAATACACCT TGAGCTACCG CACCATCTCC GATAAAGCAT AGAGAAACTC 

262701 TATTTTTTTG TTCTTGATAT TTGATGGTAA ATGCGGCTCC AGCTGCGAGG 

262751 GG AATTTGTC CTCCGACAAT ACCAAATCCT CCAGGGAAAT TAGGCCCACA 

262801 CATATGCATG GATCCTCCAC GACCTAAAGC GCATCCAGTT TCTTTCCCTA 

262851 AAAGTTCAGC AGCAATTTCT TGAAGGGGAA TGTTGAGAAG AATCGCAAGT 

2 62901 GCGTGGCAGC GGTATGAAGA GAACACCCAG GGATCTAGTC CTGTGTTTGC 

262951 GATTGCAGCA GTTGCTACAG CTTCTTGGCC AGCGTAAGAG TGGTAAAATC 

263001 CACCCACTAG CCCTTCTAGA TAGGCTTCTT CTCCTCGGGC TTCGAATTCA 

263051 CGAATCAGAA CCATCTGTTT TAAAAATTTA ATACAGGAAG CGGGCCCGTA 

263101 AAGGTCTAAG ATCCTTTCTA CTGTGGATTT CTCTGTGCCC TGAGAAGCTA 

263151 TATTATAAGG TGCTGAACTA TCCATAACTT TTTTATAAGG GAGACGCTTC 
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263201 • GGGAGCGGTT TTTGATCTCA TTATAGGAGC CTAGGTTTCA AAAGACAAGG 

263251 GCTATCCGAA GAAAAAATAA TTAAGTTTTT TTAAAAAATC GAAAAAATTT 

263301 TAATATGTTT CTCGTGGCAA TTTAAAGAGA ACAAAAAGGA TTACATCACA 

263351 TGGACGTTTC TCGTAAAATT AATCGACACA CTCAGTTTTA CGTGGATTCG 

2 634 01 ATAGACGGTG TCATCAAAAA CTTTGATCAT AAGCCTAGTG AAGATAAATC 

263451 TCGAGATCAT GAAGAATTAG AGGAAAAGCT TTTAACTATA ACAAAACGTA 

263501 TTGTTGCTTC TGCTCAAGAA TTTCAGAATC GTAAGACGGA CTCTAAAAAC 

263551 TACTACTTAA AAAAAACTCA ATGGTTGCCA TTCAAAAATG AAGAGTTGGA 

263601 GCAAACTAAG GAATTGTTTG CCATGTTAAC TTCGATGGAT AAAAAAATAG 

263651 CTCAGTTATT TTTTTATTCT CCCGGATGTA GTAGCGACTG GGTAGAATTT 

263701 ACCGAAGTAA TTTGTCATTT AAATGATTCT ATAGGTTTGG GAGGGGTTTT 

2637 51 ACTGTGTTGC GGATTATTCG AACAACAGTG TGAGCATGTT GTTACTGTGA 

263801 ATAAAAAGCT AGACCTACCG CTTCTTTTAG GAACTACCGT TGTAAATAGT 

2 63851 CTACGTTATT ATTTAACCTA TAGGAATATC TCTCTTTTGA ATTGTCAGAG 

263901 CATGAGTGAG CTAGGCAAGG AGCTCGGGGA TGTTTTAAAG CAACATGGAG 

263951 TTGCTTTCAC TTTAATTTTT AAAGAGATTG TGGATATAGA TTTATTAAAC 

2 64001 TATGTGAAAC TAATTCAAGG ACTCAAGCGT AGTGGAAATA TTCAAGCACG 

264 051 TATTTATGAT AATGATGTGC CTACTTTACC TTCAGTTTCA TCGAGTCCTA 

264101 TTGCGTTGAG ATATAGCTTA GCGAATACAA TTCGGGGCCT TGCTTTACAT 

264151 GTAGATTTTT CTTCATTGAA GTTTATAAGC CCGTCTATAC TTTCCAATAC 

264201 AGAGCACACT GCAAAAGCTT TAAACTCTGG CGGAGAATGT TTTATCTTTT 

264251 CTAATCTGGA TGAATTCAAT CTTGGAATGA AAATAGTCAT GCAACTATTG 

264301 CGGACAGGGA AGATTAGCCC AGAAATTTTA AATAAAAATA TCATGAAAAT 

2643 51 TCTCATGATA AAAAGAAGAG TACGTTCTTT ATATATTTAA TTATTTGGTC 

264401 TTTTTTTTGA TTTCAAAACT AATAAAATAA ATAGAATTTT AGATCTCTGA 

264451 AGGAATCTCA GCAAGGCTGG GAGTCGATAG ATCTCTTACT TGTTTTTCTA 

264 501 ACTTACTTAG TCTTTCTTCA GTTTTAGGAA GGTTCCGAAT TTTAGCAATC 
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264551 AACCGATGTG TTTCTTGATA AGGTCGTGCT GGAGCGCCTC CATAAATGCC 

264 601 TGGAGAGGTG ATAGATTTTG TGACTCCAGT TTGAGCAATC ATGATCACAT 

264 651 GGTCTGCAAT AGAAATATGC CCAGTAATTC CGGTTTGCCC TCCAATGATG 

264701 ACATGTTCAC CAATTTTTGT AGAACCTGCA ATGCCTGCTT GGGCAACAAT 

264751 AATACTATGC TTTCCAATTT CTACGTGATG AGCTACTTGT ACTTGGTTAT 

264801 CTATTTTAGT TCCTTCATGG ATCACGGTGT TCTTGAATCG ACCACGATCT 

264851 ATCGTAGTGT TGGCTCCGAT TTCTACATCA TCACCTACAA TCACATAGCC 

264901 TAGATGCTTT AAAGGTTTGT GATGACCAAA AGCATTTGTA ATATAACCAA 

264951 AACCACAGGA TCCTAAAACA GCTCCAGGTT GAACAACTAC ACGGTTTCCC 

265001 ATGAGGACTC TTTCTCGAAT CACCACCTTA GGGTGAATCA GACAGTTAGC 

265051 ACCTAGAACG CTGTGAGCTC CAATGACACT TCCAGCTCCG ATGTATGTGT 

265101 CAGAGCCGAT ATGGGCATGT TGACTAATGA CAACGTAAGG TTCTATGGTT 

265151 ACATTTTTCT CAATACGTGC AGTAGGATGA ATCACTGCAG TAGGATGAAT 

265201 ACCAGGAAAC CCTGATGTTA CGGGTTCAAT AAACAACTCT ATGCACTTTT 

265251 GAAATGTTAG AGAAGGGGAT TCATTGGTAA TAAGAAAGTT TTTCTTTAGG 

265301 TGGGCATGTT GCATTGCCTG AGATCTAGAT AAAATAATAG CACCAGCTTT 

265351 GGTGTTTTTT AGAAAGCTAG AGTATTTCTC ATTATCTAAA AAAGCAATAT 

265401 GGTGAGGTTG CGCCTGACTA ATATCTTCAA CACCTGAAAT AGGAGTTTCT 

265451 ATATTTCCTT GAACTTCGAC TTGTAGTAGC TCAGCTAACT GTTTAAGAGT 

265501 GTAGACTGGT GCTTCGGACA TAGAAAACTC CTTAAACTTG GACTAGTTTT 

265551 GTTTT-TTGAA AGATTCGTTA AGAATAGCAA TAATTTCGGT TGTTTTATCA 

265601 GTCCCAGGTG CTATTGCTAA GACAGGTTCT TCATTAAGGA TAGCTTCTAG 

265651 TTTTTCTTTG "GAC'CGCACTG ATTCTGCAGC TATTTTTACT TCTTGAATGA 

265701 GTTTTTGAAT GCGTTTTACA TGTACTTTGA TTGATAGATT GATAGTACTG 

265751 AGACTGGTAC GCATTGTACT CTCCTGAAAG ATCTTCGAAT TTCTTTCGCA 

265801 ACTCTTCAGA GGCAGAATCC GATAGGCfTTT CCATGTAATC TTCATCTTGC 

265851 AACTTATTAT AAATAGAAGT GAGTTCTTCT TCTATTTTCT CAGCATTTTT 
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265901 TACAAACTGC TGTTTCATAG CTTCCAATTC TTCAGTTTCC TTTTTACCTA 

265951 GATCGGATTC TTCAAGACAT CGCTTTAAAT TAACATAGCC TAAATTTGCA 

266001 TGAGCTGCGC TTGTTGATCC TAAAACAAGA AGAAATGTAG AAAATAATAA 

266051 TTTTTTCATA ATTACCTGTA CTAACTTGGG TATAGAAAAA GGGTACCAAA 

266101 AGCCTTTTCT GAAAACAACA AAGATTTCCT TCGATAAGTC CTTAATTTAT 

266151 ATCTTAGAAC ATGCCCCCTA AAGCAAAGAA GAATCGCTGA GATACATCAA 

266201 TTTTTTCTCC ATTCAAAGTC TCGGTTGGAC GGAAGGGCCA ACCAAATCCT 

2 66251 AACATAACAG GAACATTATT CATTACATCG AAGCGCAGAC CAAATCCAGC 

266301 ACTACTACGT AGATCTTTTA ACGAAATCTT ATACTCTTGT AAACCGACAA 

266351 AACCTGAGTC TAAGAATACA AAGGCACTAA TATTAGGTTG TCTGATGAGA 

266401 GGGTATTGAA ACTCTTCTGA AATAAGGAGC GAAGAGAGTC CTCCCTGAGG 

2664 51 TTCTGTAGCA GAGTATTTTG GACCGATAAT AAAGGATTTA TATCCCCGAA 

266501 CTGTAGTCTC TCCACCTAGG AAGAAGCGCT CACTGACAGG AACTCCTTCA 

266551 GCTGTAGTAT TGCTATAGGG TTTAATAAAT TGAGCTTCCC CTTTGATTTT 

266601 CAAAATACCT TTACGCGTAA GTTTTCTATA GATAGAGCTG TTTAAAGAGA 

266651 GTTTTGTAAA ATGATAAGTT CCTCCCAAAC CAGAAACCTC AAAAGTCACC 

266701 CCCCCGCGAA TCCCTGTAGT TGGAGTTCTA GGACTATCTA CAGAATCGTA 

266751 ATTCAAGTTG ACACCTGCAG CAGAGACAAA TCCTTTATTG CTGTCTATAT 

266801 TTGGCCCTAG GAGGAACTTA CGTTTTTCAT GTAAACTCGT TTGACTTCCT 

266851 CGATAAAATA GACCGTATTT CAGGTGTTCG TTCAAGATAT ACGTTGTGCT 

266901 GACGTTCCCG CCATAGGTTT GGACAGCATA ATCTTTAGAT AATGCTCTGT 

266951 TAATTGATTT ATCTAATTCA ATTCCTAAAA TCCAAGGAGT GTTTAGAAAA 

2 67001 TGAGGTTTGG TCCACTTCAA AGTATAGTCT GTGACTTTGT CCCCGAAGTT 

267051 GGCTTTTAAG AATAGATGTT CTCCACCGCC TCTTAGACAA CGAAAACCTT 

267101 TAGAAAATAT ATTTCTAGCT CCAAATAGAT CAAAATTACT TTCAGATAGT 

267151 TCAATTCCTC CAAAAAGATT GTCAAGAGAA CTAAATCCTA AGAATAAGCC 

267201 TAAGTTTCCT GTTGTTGTTT CTTTGACTTC TACAAAAATA TCTCGGTATT 
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267251 GATCCGCATT GCCCATAGGA TCAAGTTGAG AACGAACTGT ATAGACACTA 

267301 ACGCTTTGGA AGTAGCCTGT ATTTCTTAAA CGTTGCTCAG TATCTTCTAG 

267351 CTTTAAGCGA TTGAATGTAT CTCCTGGGAA GAGACTGGTT TCGTGTAAAA 

2674 01 TAACGTCAGA TTTTGTATGG GTATTCCCAG TAATTTTAAT TAACCCAACT 

267451 TTATAAGGAG ACCCTTCACT TACCTCATAA GTTACATCAT AAATAGGGCG 

267 501 GGTTGCGTGA GGGATGAAGA GAACGTCTAC ATTGGTATTG ATGTAGCCAT 

267 551 ACTTTGCATA AGTTTGTTTG ATCTTATGAG CCCCATCCCA TATTTTATCG 

267601 GGGCAATAAA GATCATTGGG GCCGACTTGG GATTGCTTTT CTATAAGGCG 

267651 TTTTGGCAAA ACCTCAAACC CTTGGATATG GACGTGTCCT AAGGTATATC 

267701 GCGACCCTCG ATCAATATCC ATGTAAAGAA GAATATTCCC TTTGTCGTCA 

2677 51 AGGTCATAGT GAGAGTTGAC TATAGCATCA GCGTACCCGT TATTATGTAG 

2 67801 GTAATTCGTA ATTGCCAAGC TATCTTGTTC AACAATATCT GGGTGATAGA 

267851 GTCCAGCTCC AGTAAACCAA CTTGTAGTTG TAGAGTGCTG CTTGGTTTGA 

267901 ATAAATTCTT GGATATCTGA TTTTTCTGAT CGAGAGATTC CTGAGAACGT 

267951 AAGCTGTTTA ATTTTCCCGC AAGGACCTTC ATTGATTTTA ATTAAAACAT 

268001 CGATGTGACC TTTTTCTTGA TTGTGTTCCA GACTGTAGTC TACACTGGAT 

268051 GCGAAATATC CTCGCTTGAG ATAATACGTT CTTAGATCAT CAAGACCCTT 

268101 AAGAAATTTT TCTCGTTCAA AGAGATCATT ACGGTAAATT TGTAGGGTTT 

268151 TAAGAATTTT ATGTTCAGGA ACGACTTGAT TTCCTGAGAT ATGAATATTT 

268201 CGAATTGAGG GTTTAGCTAT TAGGTGAAGG GCTATGTTAG TTTTCCCTTC 

268251 AGAAAATTCT ACTTTAGGCT CAACAGAGTC GTATTCTTTA GCTAGAATTC 

2683 01 TCAAGTCTTC ATCAAAATCT AATTGAGAAA AAAGAGCCCC ACTTCTGGTC 

268351 TTTAATTTGG GTAAGGGATG TTTATTTGAA GCATTTTCTC CTTCCGTTAT 

268401 GATTGTGATA GAGTCTACCA CCACATGGCC TTCTTTAACT TTTTCAGTAG 

268451 AAAATAAAGT TAAAGGGGTT TGGATTAACG CTAGAATAGA TATTTGCAAG 

268501 ATAACTTTAT TTCGCATGAT GAGCATTCCC AAGAGTCTTC CCTAGATAGA 

268551 AAGCTTGTTT ATCAGATAAT GAGGTAAGCA CAAAAATAGG GACAAAGAAG 
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268601 TTTTTTAACA AGACCTTACT ACTTAGATTA TTTTTGTATT TCTAAAGAAA 

268651 TTTAAATTGT GTTTGATTTT TTCCCGTCCT AACACCGGGA TTATTACTGT 

2687 01 GCTCAGACTA GAGCCTGAAT ATTTCGGCGG GTTTTTTGCT TCCTTAAAAA 

2687 51 GCTTAATAAA CCTTGTCGCT CAAATAACTC TACTGATTAC ATGGTTTTAA 

268801 GGCAAATTCG ATAATTCACT GTGTCCTAGG GGAAAGTAAG AAAGACTAGC 

268851 ATAGAAGAGA ACCTTGTTTA CTTTTAGGAT AAGAGAGCTG CTAATAGGAG 

268901 TGTCGTCCAG AAAAAGCTCT TGCCAGTGTC CCTGAATCTA CATAATCAAA 

268951 AGATAAGCCT ATAGGAAGAC CTAAAGCTAG ACGGGAAATA TTTACAGAGA 

269001 AATGTTGTAA TTCTTGTTTT AGAAAAAGGG CAGTAGCATC TCCTTCTAAG 

269051 GTTGCATCAA TGGCTAGGAT AATTTCTTTT GGGCATAGCG TTTCTATGCG 

269101 TGATTTTAAA ATGGAGAGAC GCTCGTTTTC TATATGTTTC CCTGTAATGG 

269151 GCGATAAGAG TGAACCAAGA ACATGATAAC GTCCCTTGAA TACTTTAGAA 

269201 CGTTCTAGAA AGAAAACATC TTTTGGAGAA GCGACAATAC ATAGACTTTG 

269251 GTTATCTCTT TCTTCTCTAC AAAAGTGACA GTCTGCCTCT TTAGATTCTT 

2693 01 TGAGAGTAAA ACATAGGGGA CAGTGACTAC GCTCACTAGC AACATTATGA 
269351 AAAGCGTTAC CTAATATTTT TAATTGTTCG CTGTCCCAAG AGATGAGTTC 

2694 01 AAAAGCAAGT TTTTCTGCTG TTTTAAATCC AATTCCTGGA AGTTTTCGTA 
2694 51 AAAAGAAAAT TAATTTAGAT AAGTAATCTG GATATCTTGT CATAGTGATG 
269501 TGAATTTTAT TTTTACATTC GGGTCTGGGG CCTAAATTAA GGTTAGAGTA 
269551 TAAACTCTCT GAATAGTATA CTAGCTTTTT TCTTTACATG TGGTTCTCTG 
269601 TGAATAAAAA CAAAAAAGCA GCAATTTGGG CAACGGGTTC CTATTTGCCT 
269651 GAGAAAGTTC TTTCAAACGC AGATTTAGAA AAAATGGTAG ATACCTCTGA 
269701 TGAGTGGATC GTGACCAGAA CGGGGATCAA AGAGCGTCGT ATTGCTGGAC 
269751 CTCAGGAGTA CACTTCTCTT ATGGGAGCCA TCGCTGCAGA GAAAGCTATA 
269801 GCAAATGCGG GTTTAAGCAA GGATCAGATT GACTGTATCA TTTTCTCGAC 
269851 AGCAGCACCA GATTATATTT TCCCATCAAG CGGAGCTCTT GCTCAAGCAC 
269901 ATTTAGGCAT TGAGGATGTC CCTACATTTG ATTGCCAGGC GGCTTGTACT 
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269951 GGGTATTTGT ATGGTTTGTC TGTAGCTAAG GCTTATGTAG AATCAGGTAC 

270001 ATATAACCAT GTATTGTTAA TTGCTGCTGA TAAGTTGTCT TCTTTTGTAG 

270051 ATTATACAGA TCGGAATACC TGTGTGTTGT TTGGAGATGG AGGAGCTGCT 

270101 TGTGTCATAG GGGAGAGTCG GCCAGGATCT TTAGAGATTA ATAGGTTGTC 

270151 TTTAGGCGCA GATGGTAAGC TAGGAGAGTT ATTAAGCCTT CCTGCTGGAG 

27 0201 GTAGTCGTTG TCCTGCTTCT AAAGAGACTT TACAATCAGG CAAACATTTT 

270251 ATTGCTATGG AGGGAAAAGA AGTTTTTAAG CATGCTGTGA GACGTATGGA 

270301 AACGGCAGCT AAACATTCGA TAGCCCTGGC AGGCATTCAG GAAGAGGATA 

2703 51 TAGATTGGTT TGTACCTCAT CAAGCTAATG AAAGAATAAT AGATGCTTTA 

2704 01 GCGAAGCGTT TTGAGATTGA TGAGTCTAGA GTGTTTAAGA GTGTACATAA 
2704 51 GTATGGAAAT ACTGCGGCCT CGTCTGTGGG CATTGCTTTG GATGAATTAG 
270 501 TTCATACAGA ATCCATTAAG CTTGATGATT ATTTACTTTT AGTTGCCTTT 
27 0551 GGGGGCGGTT TGTCTTGGGG CGCAGTAGTT TTAAAGCAGG TCTAATAAGG 
270601 ACGATAATTT CATGAAAAAA CGTTATGCTT TTTTGTTCCC AGGACAAGGG 
270651 AGCCAATATG TAGGTATGGG ACAAGACCTA TATATGGAGT ATCCTGAGGT 
270701 TAGAGAGCTT TTTGATTTTG CTAATGAAAG GTTAGGATTT TCTCTGACTT 
270751 CAATTATGTT TGAAGGTCCT GAGGATCTTT TGATGGAAAC AGTACATAGT 
27 0801 CAGCTAGCTA TTTATCTTCA TAGCATGGCT GTGGTAAAGG TTCTATCTCA 
270851 GCGTTCTTCT ATTCAGCCTT CTTTAGTCTC TGGATTAAGT TTAGGGGAGT 
270901 ATACTGCTTT AGTTGCTTCC GATAGAATCT CCGTGCTCGA CGGCCTTGAG 
270951 CTTGTTAGAA AGCGTGGTCA GTTAATGAAT GAAGCTTGTA ATCAGAGCCC 
271001 AGGGGCTATG GCGGCTTTAT TAGGGCTTCC CTCTGAAGTT ATAGAGGAAA 
271051 ATATAACAAG TCTTGGTCAA GGAATTTGGA TTGCTAATTA TAATGCACCC 
271101 AAACAGCTTG TAGTGGCTGG AATAGCAGAA AAAGTAGACC AAGCGATTGA 
271151 GTTATTTCGT GATTTAGGAT GTAAAAAAGC AGTTCGTTTA AAGGTGTCTG 

• 271201 GAGCATTTCA TACTCCTTTA ATGCAAGTTG CTCAAGATGG CTTAGCTCCA 

271251 GACATTTATG CTTTATGCAT GAAAGATTCT AGCCTTCCCT TAGTGTCACA 



201 



WO0027994 ffile:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WQ0Q27994. cDc1 



Page 329 of 330 



WO 00/27994 PCT/US99/26923 

271301 CGTGGTAGGA AAATCTTTAG TAAATACTGA AGAAATGCGA GAGTGTTTAG 

271351 CTCGGCAAAT GACATCACCT ACGTTATGGT ATCAGAGTTG TTACCATATC 

271401 GAATCAGAGG TGGATGAGTT TTTAGAATTA GGTCCAGGAA AAGTTTTGGC 

271451 TGGTTTAAAT CGCTCTATAG GGATTTCTAA ACCGATTACA AGTCTTGGTA 

271501 CTTTTGCTCA GATTGAAAAA TTCCTATCAG AGGTATGATT TGTATGGATA 

271551 TAACATTAGT AGGCAAAAAA GTTATAGTAA CTGGAGGATC TCGAGGAATT 

271601 GGACTCGGGA TAGTTAAGCT TTTTCTTGAG AACGGAGCAG ATGTAGAAAT 

271651 TTGGGGATTG AATGAGGAGC GAGGTCAGGC TGTTATAGAA AGTTTAACAG 

271701 GCTTGGGTGG CGAAGTTTCT TTTGCTCGTG TGGATGTGAG TCATAATGGT 

271751 GGAGTGAAAG ATTGCGTGCA GAAATTTTTA GATAAGCACA ACAAAATAGA 

271801 TATTTTGGTA AATAATGCAG GCATTACCAG GGATAATTTG TTGATGCGTA 

271851 TGTCTGAGGA CGACTGGCAA TCGGTGATTA GCACCAACTT GACTTCCTTG 

271901 TATTATACAT GTTCCTCAGT GATTCGCCAT ATGATTAAGG CGCGTTCAGG 

271951 ATCTATTATA AATGTGGCTT CTATTGTTGC TAAGATCGGT AGTGCGGGCC 

272001 AGACCAACTA TGCTGCTGCT AAAGCTGGGA TTATTGCTTT CACAAAATCT 

272051 TTAGCTAAGG AAGTAGCTGC AAGAAATATT CGTGTCAACT GCCTTGCTCC 

272101 AGGCTTTATT GAAACAGACA TGACAAGCGT GTTGAATGAC AATTTAAAAG 

272151 CTGAGTGGCT TAAGTCGATC CCTTTAGGTA GGGCTGGCAC TCCAGAAGAT 

272201 GTTGCTCGTG TGGCGTTGTT TTTAGCCTCG CAGTTATCGA GCTATATGAC 

272251 CGCGCAGACA CTGGTTGTTG ATGGGGGATT GACTTACTAA GACAATAGAA 

272301 GAAAGGGATT TGAAAATTTC TCTTCGAGAA CTAATTAAGT AACCGTCGAA 

272351 TAAAAAATGA TTTTTTGCGA TACTAATTCT CTTTCTCTTT GTCCCTAGGG 

272401 AATAGTGAAG TATTGTATAG TTTAAATAGT AAAAGGATAT AAGCAATGAG 

2724 51 TTTAGAAGAT GATGTAATAG CAATTATTGT TGAGCAGTTA GGAGTGGATC 

272501 CAAAAGAAGT TAATGAGAAC TCTTCTTTTA TTGAAGACTT GAATGCTGAT 

272551 AGTTTAGATT TAACAGAATT GATTATGACT TTAGAAGAAA AATTTGCTTT 

272601 TGAAATTTCA GAAGAAGATG CTGAGAAGCT TCGTACTGTC GGGGATGTAT 

202 



WO00279 94 [ file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0 027994.cpcl 



Page 330 of 330 



WO 00/27994 PCT/US99/26923 

272651 TTACTTATAT TAAGAAACGT CAAGCTGAAC AATAAACTTT CTTATATTCT 

272701 GGGTGCGTCG TAATCTTTTT TAAAGCTATT GTTTTTCAAT AGTGATTACG 

272751 GCGTGCCTTT TTTTCTATAG AGTAGCTCAT CTAGAAAGAT TTATTTTGTC 

272801 TTTTTAAGGT TCTCTGAACT TGATTTGTTT AGCATAGAGC TCTAAAAAAG 

272851 ATAAAGCTAC GGATGGGCAC TCTTCCACAA TGTTTAGAAT TTGTCCTTTG 

272901 CTAAGAACTA GCATGCGGAC TTGTGTATTT GCAGAAGCAT TGTATTCCCT 

272951 GGGCTTATTA TTGAATAAGC TTTCCTCTCC AAAACAATCT AAAGGTTTTA 

273 001 AATTTAGAGG AGACTCTAGT TTTTCTTTAG AGATCGTAAT GTATCCTTCT 

273051 ACAATGATAT AAAAGCTGAA TCCAGGTTGT CCTATAGAGA ATACATTGCT 

273101 GCCAGGCTTA AATATTATCG TTTCAGTTTT ATCGGCAATT GTTAAAAGAA 

27 3151 GGTCCATGTC TAAAGATTGG AATATAATCG TTTTTTTTAG TAGAAAGGCG 

273201 CGATCGATCA AATTCATAAA AAAGTTCCTT ATTCACACCA TAGTTTTAGT 

273251 TTTT 
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